y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#malicious-fine-tuning News & Analysis

1 article tagged with #malicious-fine-tuning. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

1 articles
AIBearishOpenAI News · Aug 56/105
🧠

Estimating worst case frontier risks of open weight LLMs

Researchers studied worst-case risks of releasing open-weight large language models by conducting malicious fine-tuning (MFT) experiments on gpt-oss. The study specifically examined how fine-tuning could maximize dangerous capabilities in biology and cybersecurity domains.