summaryrefslogtreecommitdiff
path: root/packages/gateway/src
diff options
context:
space:
mode:
authorFuwn <[email protected]>2025-09-25 16:00:59 -0700
committerFuwn <[email protected]>2025-09-25 16:00:59 -0700
commit112d4615397a1dd63f5077378cfd4ac9fab8df80 (patch)
tree15d63ed11e21a6f3792ad8595dfb086a4d7c07cf /packages/gateway/src
parentfix(gateway:aiModeration): Improve message and history context (diff)
downloadumabotdiscord-112d4615397a1dd63f5077378cfd4ac9fab8df80.tar.xz
umabotdiscord-112d4615397a1dd63f5077378cfd4ac9fab8df80.zip
fix(gateway:aiModeration): Improve hate speech analysis
Diffstat (limited to 'packages/gateway/src')
-rw-r--r--packages/gateway/src/listeners/aiModeration.ts6
1 files changed, 6 insertions, 0 deletions
diff --git a/packages/gateway/src/listeners/aiModeration.ts b/packages/gateway/src/listeners/aiModeration.ts
index 25e6a29..5d0e9db 100644
--- a/packages/gateway/src/listeners/aiModeration.ts
+++ b/packages/gateway/src/listeners/aiModeration.ts
@@ -115,6 +115,12 @@ Please analyze THIS SPECIFIC MESSAGE for any rule violations. Consider:
IMPORTANT: Only analyze the current message content. The chat history is provided for context but should not be the basis for flagging the current message. If the current message itself is innocent but appears in a conversation with inappropriate content, do NOT flag it.
+SLUR GUIDELINES:
+- HARD SLURS (with hard R, full spelling): These are prohibited and should be flagged for deletion
+- SOFT SLURS (with soft R, censored, or abbreviated): These are generally acceptable in casual conversation
+- Context matters: Consider if the word is being used in a hateful manner vs. casual/reclaimed usage
+- When in doubt, err on the side of caution for hard slurs but be more lenient with soft slurs
+
Respond with a JSON object containing:
{
"violation": boolean,