AIBullisharXiv โ CS AI ยท 14h ago7/10
๐ง
EdgeCIM: A Hardware-Software Co-Design for CIM-Based Acceleration of Small Language Models
EdgeCIM presents a specialized hardware-software framework designed to accelerate Small Language Model inference on edge devices by addressing memory-bandwidth bottlenecks inherent in autoregressive decoding. The system achieves significant performance and energy improvements over existing mobile accelerators, reaching 7.3x higher throughput than NVIDIA Orin Nano on 1B-parameter models.
๐ข Nvidia