AINeutralarXiv โ CS AI ยท 9h ago6/10
๐ง
MALicious INTent Dataset and Inoculating LLMs for Enhanced Disinformation Detection
Researchers released MALINT, the first human-annotated English dataset for detecting disinformation and its malicious intent, developed with expert fact-checkers. The study benchmarked 12 language models and introduced intent-based inoculation techniques that improved zero-shot disinformation detection across six datasets, five LLMs, and seven languages.
๐ง Llama