AINeutralarXiv โ CS AI ยท 14h ago6/10
๐ง
SLALOM: Simulation Lifecycle Analysis via Longitudinal Observation Metrics for Social Simulation
Researchers introduce SLALOM, a validation framework addressing the credibility crisis of LLM-based social simulations by shifting focus from outcome accuracy to process fidelity. The framework uses Dynamic Time Warping to compare simulated trajectories against empirical data across intermediate checkpoints, enabling quantitative assessment of whether simulations achieve realistic social mechanisms rather than merely correct endpoints.