βBack to feed
π§ AIβͺ NeutralImportance 6/10
MALicious INTent Dataset and Inoculating LLMs for Enhanced Disinformation Detection
arXiv β CS AI|Arkadiusz Modzelewski, Witold Sosnowski, Eleni Papadopulos, Elisa Sartori, Tiziano Labruna, Giovanni Da San Martino, Adam Wierzbicki|
π€AI Summary
Researchers released MALINT, the first human-annotated English dataset for detecting disinformation and its malicious intent, developed with expert fact-checkers. The study benchmarked 12 language models and introduced intent-based inoculation techniques that improved zero-shot disinformation detection across six datasets, five LLMs, and seven languages.
Key Takeaways
- βMALINT is the first English corpus specifically designed to capture both disinformation and the malicious intent behind it.
- βResearchers tested 12 language models including BERT and Llama 3.3 on binary and multilabel intent classification tasks.
- βIntent-based inoculation, inspired by psychology research, integrates intent analysis to improve disinformation detection.
- βThe approach showed improvements in zero-shot disinformation detection across multiple languages and datasets.
- βThe MALINT dataset has been released publicly to support further research in intent-aware disinformation detection.
Mentioned in AI
Models
LlamaMeta
#disinformation-detection#llm-research#dataset-release#malicious-intent#fact-checking#bert#llama#nlp#inoculation-theory
Read Original βvia arXiv β CS AI
Act on this with AI
Stay ahead of the market.
Connect your wallet to an AI agent. It reads balances, proposes swaps and bridges across 15 chains β you keep full control of your keys.
Related Articles