AIBullisharXiv – CS AI · May 46/10
🧠Researchers present Space-XNet, a framework for efficiently deploying mixture-of-experts language models across satellite constellations using optimized expert placement strategies. The approach achieves a threefold latency reduction compared to conventional methods, addressing key challenges in executing energy-intensive AI workloads in space where computing and communication resources are severely constrained.
AINeutralAI News · Apr 136/10
🧠Enterprise security leaders face growing challenges securing edge AI deployments as models like Google Gemma 4 proliferate beyond traditional cloud infrastructure. Organizations built robust cloud security perimeters but now struggle to govern AI workloads running on distributed edge systems, requiring new governance approaches.
AINeutralarXiv – CS AI · Apr 136/10
🧠Researchers demonstrate that applying Bayesian inference to Spiking Neural Networks (SNNs) for speech processing smooths the irregular loss landscape caused by threshold-based spike generation. Testing on speech datasets shows improved performance metrics and more regular predictive landscapes compared to deterministic approaches.
AIBullishMarkTechPost · Mar 166/10
🧠IBM has released Granite 4.0 1B Speech, a compact multilingual speech-language model optimized for automatic speech recognition and translation. The model is specifically designed for enterprise and edge deployments where memory efficiency, low latency, and compute optimization are critical alongside performance quality.
AIBullisharXiv – CS AI · Mar 166/10
🧠Researchers introduce DART, a new framework for early-exit deep neural networks that achieves up to 3.3x speedup and 5.1x lower energy consumption while maintaining accuracy. The system uses input difficulty estimation and adaptive thresholds to optimize AI inference for resource-constrained edge devices.
AIBullisharXiv – CS AI · Mar 126/10
🧠Researchers developed a new continual learning framework for human activity recognition (HAR) in IoT wearable devices that prevents AI models from forgetting previous tasks when learning new ones. The method uses gated adaptation to achieve 77.7% accuracy while reducing forgetting from 39.7% to 16.2%, training only 2% of parameters.
AIBullisharXiv – CS AI · Mar 37/106
🧠Researchers developed TinyVLM, the first framework enabling zero-shot object detection on microcontrollers with less than 1MB memory. The system achieves real-time inference at 26 FPS on STM32H7 and over 1,000 FPS on MAX78000, making AI vision capabilities practical for resource-constrained edge devices.
AINeutralarXiv – CS AI · Mar 27/1017
🧠Researchers introduce RooflineBench, a framework for measuring performance capabilities of Small Language Models on edge devices using operational intensity analysis. The study reveals that sequence length significantly impacts performance, model depth causes efficiency regression, and structural improvements like Multi-head Latent Attention can unlock better hardware utilization.
AIBullishAI News · Mar 115/10
🧠ADLINK Technology has partnered with Under Control Robotics (Noble Machines) to develop smart robots for dangerous industrial environments. The collaboration will integrate ADLINK's edge AI platforms with Noble Machines' autonomy software to create general-purpose robots for manufacturing and engineering facilities.
AINeutralGoogle Research Blog · Oct 153/104
🧠The article appears to discuss Coral NPU as a comprehensive platform for Edge AI applications. However, the provided article body only contains 'Generative AI' without substantive content to analyze.