AINeutralarXiv – CS AI · Mar 176/10
🧠
MALicious INTent Dataset and Inoculating LLMs for Enhanced Disinformation Detection
Researchers released MALINT, the first human-annotated English dataset for detecting disinformation and its malicious intent, developed with expert fact-checkers. The study benchmarked 12 language models and introduced intent-based inoculation techniques that improved zero-shot disinformation detection across six datasets, five LLMs, and seven languages.
🧠 Llama