AINeutralarXiv โ CS AI ยท Mar 57/10
๐ง
Inference-Time Toxicity Mitigation in Protein Language Models
Researchers developed Logit Diff Amplification (LDA) as an inference-time safety mechanism for protein language models to prevent toxic protein generation. The method reduces predicted toxicity rates while maintaining biological plausibility and structural viability, addressing dual-use safety concerns in AI-driven protein design.