y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#gpu-inference News & Analysis

2 articles tagged with #gpu-inference. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

2 articles
AI × CryptoBullishBlockonomi · May 27/10
🤖

Nebius (NBIS) Stock Surges 12% Following $643M Eigen AI Acquisition Announcement

Nebius announced a $643 million acquisition of Eigen AI to strengthen its GPU inference capabilities and expand operations in the United States, triggering an 11.76% surge in NBIS stock price. The deal signals intensifying consolidation in the AI infrastructure sector as companies compete for computational resources and market positioning.

AIBullisharXiv – CS AI · May 17/10
🧠

Predictive Multi-Tier Memory Management for KV Cache in Large-Scale GPU Inference

Researchers present a unified system for optimizing KV cache memory management in large-scale GPU inference, addressing three critical inefficiencies through architecture-aware sizing, multi-tier memory hierarchy spanning CPU to NVMe storage, and predictive eviction policies. The approach achieves 70-84% cache hit rates and projects 1.4-2.1x improvements in latency and 1.7-2.9x throughput gains while reducing costs by 47% compared to existing solutions.