y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#parallel-inference News & Analysis

2 articles tagged with #parallel-inference. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

2 articles
AIBullisharXiv – CS AI · 3d ago7/10
🧠

From AR to Diffusion: Efficiently Adapting Large Language Models with Strictly Causal and Elastic Horizons

Researchers introduce FLUID, a framework that adapts autoregressive language models to diffusion-based text generation by enforcing strictly causal attention patterns, eliminating the need for expensive retraining from scratch. The approach incorporates Elastic Horizons, a dynamic denoising mechanism that improves efficiency and achieves state-of-the-art performance while reducing training costs significantly.

AIBullisharXiv – CS AI · Apr 157/10
🧠

Vec-LUT: Vector Table Lookup for Parallel Ultra-Low-Bit LLM Inference on Edge Devices

Researchers introduce Vec-LUT, a novel vector-based lookup table technique that dramatically improves ultra-low-bit LLM inference on edge devices by addressing memory bandwidth underutilization. The method achieves up to 4.2x performance improvements over existing approaches, enabling faster LLM execution on CPUs than specialized NPUs.