AINeutralarXiv โ CS AI ยท 7h ago7/10
๐ง
Safety-Guided Flow (SGF): A Unified Framework for Negative Guidance in Safe Generation
Researchers introduce Safety-Guided Flow (SGF), a unified probabilistic framework that combines control barrier functions with negative guidance approaches to improve safety in AI-generated content. The framework identifies a critical time window during the denoising process where strong negative guidance is most effective for preventing harmful outputs.