#av-validation News & Analysis

2 articles tagged with #av-validation. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

2 articles

AIBullisharXiv – CS AI · May 277/10

🧠

ScenePilot: Controllable Boundary-Driven Critical Scenario Generation for Autonomous Driving

ScenePilot is a new framework for generating safety-critical scenarios to test autonomous driving systems by targeting the boundary between physically feasible and infeasible situations. Using constrained reinforcement learning combined with physical feasibility constraints, the method achieves 6.2 percentage points higher collision rates while maintaining physical validity, enabling more effective stress testing of AV safety systems.

AINeutralarXiv – CS AI · Apr 146/10

🧠

General-purpose LLMs as Models of Human Driver Behavior: The Case of Simplified Merging

Researchers evaluated whether general-purpose LLMs (OpenAI o3 and Google Gemini 2.5 Pro) can model human driving behavior in autonomous vehicle safety testing by embedding them as standalone driver agents in a simplified merging scenario. While both models reproduced some human-like behaviors, they failed to consistently capture responses to dynamic velocity cues and diverged significantly on safety metrics, suggesting LLMs show promise as ready-to-use behavior models but require further validation.

🏢 OpenAI🧠 o1🧠 o3