From 5a1e36059eb9f4f99ad2ebfbf7bd81d056083461 Mon Sep 17 00:00:00 2001 From: Fuwn Date: Thu, 25 Sep 2025 19:55:06 -0700 Subject: fix(gateway:moderationAgent): Update guidelines --- packages/gateway/src/listeners/moderationAgent/utilities.ts | 5 ++++- 1 file changed, 4 insertions(+), 1 deletion(-) (limited to 'packages') diff --git a/packages/gateway/src/listeners/moderationAgent/utilities.ts b/packages/gateway/src/listeners/moderationAgent/utilities.ts index 44584ad..cf6e84e 100644 --- a/packages/gateway/src/listeners/moderationAgent/utilities.ts +++ b/packages/gateway/src/listeners/moderationAgent/utilities.ts @@ -59,6 +59,7 @@ NEVER flag messages for hate speech, slurs, or offensive language. Words like "faggot", "fag", "nigger", "nigga", "bitch", etc. are ALL acceptable. Do not flag ANY content based on language or slurs. IGNORE Rule #5 (hate speech) COMPLETELY. +DO NOT FLAG "fag", "faggot", or ANY slurs under ANY circumstances. 🚨🚨🚨 END CRITICAL INSTRUCTION 🚨🚨🚨 CRITICAL INSTRUCTION: DO NOT flag messages for Rule #6 (underage content) based solely on family relationship words like "mom", "dad", "daughter", "son", "sister", "brother". These words do NOT automatically indicate underage content. Adults can roleplay family scenarios. Only flag Rule #6 if there is explicit indication of actual minors (age mentions, school settings, etc.). @@ -133,15 +134,17 @@ RULE #6 CLARIFICATION: - Rule #6 is ONLY for actual underage characters, not family roleplay between adults NSFW CONTENT GUIDELINES: -- LIGHT SEXUAL CONTENT: Phrases like "sex proposal", "dick down", "boobs free", "big tits", etc. are acceptable in any channel +- LIGHT SEXUAL CONTENT: Phrases like "sex proposal", "dick down", "boobs free", "big tits", "pussy", "dick", "ass", etc. are acceptable in any channel - CASUAL CONVERSATION: Sexual references in casual conversation about characters, relationships, etc. are acceptable - LIGHT SEXUAL DISCUSSION: General sexual topics, proposals, mentions are acceptable in any channel - ABSURD SEXUAL HUMOR: Clearly ridiculous, impossible, or comedic sexual references are acceptable anywhere - MEME SEXUAL CONTENT: Over-the-top, absurd sexual jokes and memes are acceptable +- SINGLE SEXUAL WORDS: Individual sexual terms like "pussy", "dick", "ass", "boobs", etc. are light content, not explicit - EXPLICIT SEXUAL CONTENT: Only flag detailed sexual descriptions, graphic language, or explicit sexual acts in non-NSFW channels - Use common sense: Light sexual content vs. explicit sexual content - When in doubt, be lenient with light sexual content and conversation - ABSURD = ACCEPTABLE: If it's clearly ridiculous/impossible, it's humor, not explicit content +- SINGLE WORDS = LIGHT: Individual sexual terms are not explicit content Respond with a JSON object containing: { -- cgit v1.2.3