y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#edge-inference News & Analysis

4 articles tagged with #edge-inference. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

4 articles
AIBullisharXiv – CS AI · 4d ago7/10
🧠

The Rescue Effect: Spatio-Semantic Early Exit Bypasses Quantization Collapse in CLIP

Researchers address a critical failure mode in quantized Vision-Language Models by proposing LRA-EE, a technique that uses early exit strategies to bypass noise-saturated layers in INT8 CLIP. The method improves zero-shot classification accuracy by 2.44 percentage points while reducing computational load by 13.4%, demonstrating that selective layer utilization can recover performance lost to quantization-induced representation collapse.

AIBullisharXiv – CS AI · May 117/10
🧠

EULER-ADAS: Energy-Efficient & SIMD-Unified Logarithmic-Posit Engine for Precision-Reconfigurable Approximate ADAS Acceleration

EULER-ADAS is a specialized neural compute engine that uses bounded-Posit arithmetic to accelerate Advanced Driver-Assistance Systems (ADAS) inference on edge devices. The architecture achieves up to 71.9% power reduction and 10x better energy efficiency compared to conventional Posit implementations while maintaining near-FP32 accuracy, demonstrating practical viability for real-time autonomous driving applications.

AIBullisharXiv – CS AI · Apr 147/10
🧠

EdgeCIM: A Hardware-Software Co-Design for CIM-Based Acceleration of Small Language Models

EdgeCIM presents a specialized hardware-software framework designed to accelerate Small Language Model inference on edge devices by addressing memory-bandwidth bottlenecks inherent in autoregressive decoding. The system achieves significant performance and energy improvements over existing mobile accelerators, reaching 7.3x higher throughput than NVIDIA Orin Nano on 1B-parameter models.

🏢 Nvidia