AIBullisharXiv โ CS AI ยท 1d ago6/10
๐ง
Navigating the Concept Space of Language Models
Researchers have developed Concept Explorer, a scalable interactive system for exploring features from sparse autoencoders (SAEs) trained on large language models. The tool uses hierarchical neighborhood embeddings to organize thousands of AI model features into interpretable concept clusters, enabling better discovery and analysis of how language models understand concepts.