AIBullishHugging Face Blog ยท Feb 186/106
๐ง
IBM and UC Berkeley Diagnose Why Enterprise Agents Fail Using IT-Bench and MAST
IBM and UC Berkeley collaborated to develop IT-Bench and MAST diagnostic tools to identify and analyze failure points in enterprise AI agent deployments. The research addresses critical gaps in understanding why AI agents underperform in real-world business environments compared to controlled testing scenarios.