🧠 AI🟢 BullishImportance 7/10

Capabilities Ain't All You Need: Measuring Propensities in AI

arXiv – CS AI|Daniel Romero-Alvarado, Fernando Mart\'inez-Plumed, Lorenzo Pacchiardi, Hugo Save, Siddhesh Milind Pawar, Behzad Mehrbakhsh, Pablo Antonio Moreno Casares, Ben Slater, Paolo Bova, Peter Romero, Zachary R. Tyler, Jonathan Prunty, Luning Sun, Jose Hernandez-Orallo|March 2, 2026 at 05:00 AM|25 views

🤖AI Summary

Researchers introduce the first formal framework for measuring AI propensities - the tendencies of models to exhibit particular behaviors - going beyond traditional capability measurements. The new bilogistic approach successfully predicts AI behavior on held-out tasks and shows stronger predictive power when combining propensities with capabilities than using either measure alone.

Key Takeaways

→Traditional AI evaluation focuses on capabilities but neglects propensities, which are crucial for determining performance and safety outcomes.
→The new bilogistic framework measures when a model's propensity falls within an 'ideal band' for optimal performance.
→Propensities measured on one benchmark successfully predict behavior on completely different tasks.
→Combining propensity and capability measurements provides stronger predictive power than using either alone.
→The framework uses LLMs with task-agnostic rubrics to estimate the limits of ideal propensity ranges.