AIBullisharXiv โ CS AI ยท 19h ago7/10
๐ง
Sysformer: Safeguarding Frozen Large Language Models with Adaptive System Prompts
Researchers developed Sysformer, a novel approach to safeguard large language models by adapting system prompts rather than fine-tuning model parameters. The method achieved up to 80% improvement in refusing harmful prompts while maintaining 90% compliance with safe prompts across 5 different LLMs.