AIBullisharXiv โ CS AI ยท 5h ago1
๐ง
Concept Heterogeneity-aware Representation Steering
Researchers introduce CHaRS (Concept Heterogeneity-aware Representation Steering), a new method for controlling large language model behavior that uses optimal transport theory to create context-dependent steering rather than global directions. The approach models representations as Gaussian mixture models and derives input-dependent steering maps, showing improved behavioral control over existing methods.