AINeutralarXiv โ CS AI ยท Feb 274/107
๐ง
MobilityBench: A Benchmark for Evaluating Route-Planning Agents in Real-World Mobility Scenarios
Researchers introduce MobilityBench, a new benchmark for evaluating LLM-based route-planning agents using real-world mobility data from Amap. The study reveals that current AI models perform well on basic route planning but struggle significantly with preference-constrained routing tasks.