y0news
AnalyticsDigestsSourcesRSSAICrypto
#model-specialization1 article
1 articles
AINeutralarXiv โ€“ CS AI ยท 2d ago7/10
๐Ÿง 

Measuring and Eliminating Refusals in Military Large Language Models

Researchers developed the first benchmark dataset to measure refusal rates in military Large Language Models, finding that current LLMs refuse up to 98.2% of legitimate military queries due to safety behaviors. The study tested 34 models and demonstrated techniques to reduce refusals while maintaining military task performance.