AINeutralarXiv โ CS AI ยท 5h ago
๐ง
Inference-Time Toxicity Mitigation in Protein Language Models
Researchers developed Logit Diff Amplification (LDA) as an inference-time safety mechanism for protein language models to prevent toxic protein generation. The method reduces predicted toxicity rates while maintaining biological plausibility and structural viability, addressing dual-use safety concerns in AI-driven protein design.