AINeutralarXiv โ CS AI ยท 3d ago7/10
๐ง
Curveball Steering: The Right Direction To Steer Isn't Always Linear
Researchers propose 'Curveball steering', a nonlinear method for controlling large language model behavior that outperforms traditional linear approaches. The study challenges the Linear Representation Hypothesis by showing that LLM activation spaces have substantial geometric distortions that require geometry-aware interventions.