AINeutralarXiv โ CS AI ยท 17h ago6/10
๐ง
ContextBench: Modifying Contexts for Targeted Latent Activation
Researchers have developed ContextBench, a new benchmark for evaluating methods that generate targeted inputs to trigger specific behaviors in language models. The study introduces enhanced Evolutionary Prompt Optimization techniques that better balance effectiveness in activating AI model features while maintaining linguistic fluency.