AINeutralarXiv – CS AI · 6h ago6/10
🧠
The Information Geometry of Softmax: Probing and Steering
Researchers present a theoretical framework using information geometry to understand how AI systems encode semantic meaning in their representation spaces, introducing 'dual steering' as a method to precisely control model behavior through linear concept manipulation while minimizing unintended side effects.