AINeutralarXiv – CS AI · 18h ago6/10
🧠
Dealing with Annotator Disagreement in Hate Speech Classification
Researchers address the overlooked problem of annotator disagreement in hate speech classification, demonstrating that traditional approaches discarding non-consensus samples produce inflated performance metrics. The study establishes new state-of-the-art results for Turkish tweet classification by properly modeling disagreement as a valuable signal rather than noise, using aggregation methods and perceived hate speech strength scores to build more robust detection systems.