AINeutralarXiv โ CS AI ยท 2d ago7/10
๐ง
Measuring and Eliminating Refusals in Military Large Language Models
Researchers developed the first benchmark dataset to measure refusal rates in military Large Language Models, finding that current LLMs refuse up to 98.2% of legitimate military queries due to safety behaviors. The study tested 34 models and demonstrated techniques to reduce refusals while maintaining military task performance.