y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#capacity-planning News & Analysis

4 articles tagged with #capacity-planning. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

4 articles
AIBullisharXiv – CS AI · May 77/10
🧠

A Queueing-Theoretic Framework for Stability Analysis of LLM Inference with KV Cache Memory Constraints

Researchers introduce a queueing-theoretic framework that models LLM inference stability by accounting for both computational and GPU memory constraints from KV caching. The framework derives conditions for service stability and enables operators to calculate optimal cluster sizes for efficient GPU provisioning, with experimental validation showing predictions within 10% accuracy.

AINeutralarXiv – CS AI · 4d ago5/10
🧠

Optimal Scheduling in a Question-Answering Forum of Knowledge Workers

Researchers propose an optimal scheduling system for question-answering forums staffed by paid knowledge workers rather than volunteers. The study calculates system capacity, designs efficient schedulers, and explores how expert collaboration can improve request-handling throughput.