y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#npu News & Analysis

3 articles tagged with #npu. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

3 articles
AIBullisharXiv โ€“ CS AI ยท Mar 267/10
๐Ÿง 

DVM: Real-Time Kernel Generation for Dynamic AI Models

Researchers have developed DVM, a real-time compiler for dynamic AI models that uses bytecode virtual machine technology to significantly speed up compilation times. The system achieves up to 11.77x better operator/model efficiency and up to 5 orders of magnitude faster compilation compared to existing solutions like TorchInductor and PyTorch.

AIBullisharXiv โ€“ CS AI ยท Apr 76/10
๐Ÿง 

MUXQ: Mixed-to-Uniform Precision MatriX Quantization via Low-Rank Outlier Decomposition

Researchers propose MUXQ, a new quantization technique for large language models that addresses activation outliers through low-rank decomposition. The method enables efficient INT8 quantization while maintaining accuracy close to FP16, making it suitable for edge device deployment with NPU-based hardware.

๐Ÿข Perplexity
AIBullisharXiv โ€“ CS AI ยท Mar 126/10
๐Ÿง 

Towards Cold-Start Drafting and Continual Refining: A Value-Driven Memory Approach with Application to NPU Kernel Synthesis

Researchers introduce EvoKernel, a self-evolving AI framework that addresses the 'Data Wall' problem in deploying Large Language Models for kernel synthesis on data-scarce hardware platforms like NPUs. The system uses memory-based reinforcement learning to improve correctness from 11% to 83% and achieves 3.60x speedup through iterative refinement.