y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#caching News & Analysis

4 articles tagged with #caching. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

4 articles
AIBullisharXiv โ€“ CS AI ยท Mar 37/104
๐Ÿง 

BWCache: Accelerating Video Diffusion Transformers through Block-Wise Caching

Researchers have developed BWCache, a training-free method that accelerates Diffusion Transformer (DiT) video generation by up to 6ร— through block-wise feature caching and reuse. The technique exploits computational redundancy in DiT blocks across timesteps while maintaining visual quality, addressing a key bottleneck in real-world AI video generation applications.

AIBullishOpenAI News ยท Jan 227/107
๐Ÿง 

Scaling PostgreSQL to power 800 million ChatGPT users

OpenAI successfully scaled PostgreSQL to handle millions of queries per second to support 800 million ChatGPT users. The scaling was achieved through strategic implementation of database replicas, caching systems, rate limiting mechanisms, and workload isolation techniques.

AIBullisharXiv โ€“ CS AI ยท Mar 166/10
๐Ÿง 

Asynchronous Verified Semantic Caching for Tiered LLM Architectures

Researchers introduce Krites, an asynchronous caching system for Large Language Models that uses LLM judges to verify cached responses, improving efficiency without changing serving decisions. The system increases the fraction of requests served with curated static answers by up to 3.9 times while maintaining unchanged critical path latency.

AIBullisharXiv โ€“ CS AI ยท Mar 36/103
๐Ÿง 

MeanCache: From Instantaneous to Average Velocity for Accelerating Flow Matching Inference

MeanCache introduces a training-free caching framework that accelerates Flow Matching inference by using average velocities instead of instantaneous ones. The framework achieves 3.59X to 4.56X acceleration on major AI models like FLUX.1, Qwen-Image, and HunyuanVideo while maintaining superior generation quality compared to existing caching methods.