#real-world-data News & Analysis

2 articles tagged with #real-world-data. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

2 articles

AIBearisharXiv – CS AI · Apr 136/10

🧠

Towards Real-world Human Behavior Simulation: Benchmarking Large Language Models on Long-horizon, Cross-scenario, Heterogeneous Behavior Traces

Researchers introduce OmniBehavior, a benchmark for evaluating large language models' ability to simulate real-world human behavior across complex, long-horizon scenarios. The study reveals that current LLMs struggle with authentic behavioral simulation and exhibit systematic biases toward homogenized, overly-positive personas rather than capturing individual differences and realistic long-tail behaviors.

AINeutralarXiv – CS AI · Feb 274/107

🧠

MobilityBench: A Benchmark for Evaluating Route-Planning Agents in Real-World Mobility Scenarios

Researchers introduce MobilityBench, a new benchmark for evaluating LLM-based route-planning agents using real-world mobility data from Amap. The study reveals that current AI models perform well on basic route planning but struggle significantly with preference-constrained routing tasks.