summaryrefslogtreecommitdiff
diff options
context:
space:
mode:
authorFuwn <[email protected]>2025-09-25 16:48:28 -0700
committerFuwn <[email protected]>2025-09-25 16:48:28 -0700
commitcb3f36e44d662cff0c20dbe72b8d91c5b8f7f93c (patch)
tree3ec460bc08a71456050e0e5f2ebbf065fefe0b62
parentfix(gateway:aiModeration): Improve adult content analysis (diff)
downloadumabotdiscord-cb3f36e44d662cff0c20dbe72b8d91c5b8f7f93c.tar.xz
umabotdiscord-cb3f36e44d662cff0c20dbe72b8d91c5b8f7f93c.zip
fix(gateway:aiModeration): Improve hate speech analysis
-rw-r--r--packages/gateway/src/listeners/aiModeration.ts8
1 files changed, 8 insertions, 0 deletions
diff --git a/packages/gateway/src/listeners/aiModeration.ts b/packages/gateway/src/listeners/aiModeration.ts
index 4f722b0..57bf4c7 100644
--- a/packages/gateway/src/listeners/aiModeration.ts
+++ b/packages/gateway/src/listeners/aiModeration.ts
@@ -125,6 +125,14 @@ SLUR GUIDELINES:
- Context matters: Consider if the word is being used in a hateful manner vs. casual/reclaimed usage
- When in doubt, err on the side of caution for hard slurs but be more lenient with soft slurs
+HATE SPEECH GUIDELINES:
+- CASUAL SLANG: Words like "hoes", "bitches", "sluts", "faggot", "nigga" (soft R) in casual conversation are generally acceptable
+- PROHIBITED: "nigger" (hard R) is still prohibited and should be flagged
+- ACTUAL HATE SPEECH: Targeted harassment, threats, or demeaning language directed at specific individuals
+- Context matters: Casual use of most slurs = generally acceptable, targeted harassment = hate speech
+- Only flag if the language is genuinely harmful or targeted harassment
+- Most slurs and offensive language in casual conversation should NOT be flagged
+
RULE #6 CLARIFICATION:
- Rule #6 (zero-tolerance) ONLY applies to underage/loli/cunning content
- General adult sexual content is ALLOWED in NSFW channels