AINeutralarXiv – CS AI · 11h ago6/10
🧠
ToxSyn-PT: A Synthetic Fine-Grained Dataset of Minority-Targeted Toxic Language in Portuguese
Researchers introduce ToxSyn-PT, a large-scale Portuguese dataset for detecting hate speech targeting minority groups, featuring fine-grained annotations and non-toxic counterexamples absent in existing datasets. The study reveals that hate speech detection models trained on social media fail to generalize to minority-specific contexts, exposing critical gaps in current evaluation metrics and highlighting the need for specialized datasets in non-English languages.
🏢 Hugging Face