AINeutralarXiv – CS AI · 7h ago6/10
🧠
Characterizing Software Aging in GPU-Based LLM Serving Systems
Researchers conducted a 216-hour empirical study on software aging in GPU-based LLM serving systems, revealing statistically significant memory leaks across deployments. The findings highlight that memory degradation rates vary substantially based on serving runtime and configuration, establishing a reproducible framework for studying aging patterns in systems combining Python hosts and CUDA devices.