AIBullisharXiv โ CS AI ยท 4h ago5
๐ง
FlexGuard: Continuous Risk Scoring for Strictness-Adaptive LLM Content Moderation
Researchers introduce FlexGuard, a new AI content moderation system that provides continuous risk scoring instead of binary decisions, allowing platforms to adapt moderation strictness as needed. The system addresses limitations of existing guardrail models that break down when content moderation requirements change across platforms or over time.