y0news
AnalyticsDigestsRSSAICrypto
#representation-steering1 article
1 articles
AIBullisharXiv โ€“ CS AI ยท 5h ago1
๐Ÿง 

Concept Heterogeneity-aware Representation Steering

Researchers introduce CHaRS (Concept Heterogeneity-aware Representation Steering), a new method for controlling large language model behavior that uses optimal transport theory to create context-dependent steering rather than global directions. The approach models representations as Gaussian mixture models and derives input-dependent steering maps, showing improved behavioral control over existing methods.