y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#edge-computing News & Analysis

124 articles tagged with #edge-computing. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

124 articles
AIBullisharXiv – CS AI · 3d ago7/10
🧠

BlazeEdit: Generalist Image Editing on Mobile Devices with Image-to-Image Diffusion Models

Google researchers unveiled BlazeEdit, a 195M-parameter image-to-image diffusion model optimized for on-device mobile deployment, eliminating text-conditioning to handle object removal, outpainting, tone correction, relighting, and sticker generation. The model completes inference in 290ms on Pixel 10 while maintaining competitive quality, advancing the trend toward privacy-preserving edge AI.

AINeutralarXiv – CS AI · 3d ago7/10
🧠

Benchmarking Fairness in Spiking Neural Networks: Data Bias, Spurious Features, and Hardware Effects

Researchers introduce the first systematic fairness benchmark for Spiking Neural Networks (SNNs), revealing that biased training data causes 23% higher false positive rates for underrepresented groups, while hardware constraints amplify accuracy gaps by up to 41% in edge deployments. The study demonstrates that existing bias mitigation strategies fail under resource constraints, establishing the need for co-designed approaches that balance fairness with hardware efficiency.

AIBullisharXiv – CS AI · 3d ago7/10
🧠

Beyond Motion Primitives: Behavioral Activity Recognition from Head-Mounted IMU

Researchers from Harvard's AI and Robotics Lab have developed HiT-HAR, a hierarchical deep learning model that enables AR smart glasses to recognize complex human behaviors beyond basic motion primitives using only head-mounted IMU sensors. The team created a 160K-sample dataset and demonstrated that architectural choices exploiting temporal context outperform simple model scaling, advancing the feasibility of always-on behavioral context awareness for augmented reality applications.

AIBullisharXiv – CS AI · 3d ago7/10
🧠

Bandwidth-Efficient and Privacy-Preserving Edge-Cloud Many-to-Many Speech Translation

Researchers introduce ESRT, a privacy-preserving edge-cloud framework for multilingual speech-to-text translation that processes voice data locally while transmitting only compressed features to the cloud. The system achieves state-of-the-art performance across 45 languages while reducing bandwidth requirements by 10x and preventing voiceprint leakage.

AIBullisharXiv – CS AI · 4d ago7/10
🧠

MobileMoE: Scaling On-Device Mixture of Experts

Researchers present MobileMoE, a family of sub-billion parameter Mixture-of-Experts language models optimized for on-device deployment that achieve 2-4x efficiency gains over dense models while matching or exceeding performance. The work establishes new on-device scaling laws and delivers the first practical MoE inference implementation on smartphones, with 1.8-3.8x faster performance than existing mobile baselines.

AIBullisharXiv – CS AI · 4d ago7/10
🧠

StreamSplit: Continuous Audio Representation Learning via Uncertainty-Guided Adaptive Splitting

StreamSplit introduces a novel framework enabling continuous contrastive learning on edge devices by dynamically partitioning computation between local and cloud resources. Using reinforcement learning and uncertainty guidance, the system reduces latency by up to 4.7x and bandwidth by 77.1% while maintaining near-server accuracy, making distributed AI inference practical for resource-constrained hardware.

AIBullisharXiv – CS AI · 4d ago7/10
🧠

MobileExplorer: Accelerating On-Device Inference for Mobile GUI Agents via Online Exploration

MobileExplorer is a new framework that enables faster on-device inference for mobile GUI agents by leveraging parallel exploration of UI elements during model reasoning time. The system reduces latency by 23% while maintaining or improving task success rates, addressing privacy and network dependency concerns in mobile AI applications.

AIBullisharXiv – CS AI · May 127/10
🧠

SAFformer:Improving Spiking Transformer via Active Predictive Filtering

Researchers introduce SAFformer, a novel Spiking Transformer architecture that improves energy efficiency and accuracy by adopting an active predictive filtering paradigm inspired by brain mechanisms. The model achieves state-of-the-art performance on image recognition benchmarks while consuming significantly less power than conventional approaches.

AI × CryptoBullisharXiv – CS AI · May 127/10
🤖

Privacy-Preserving Federated Learning: Integrating Zero-Knowledge Proofs in Scalable Distributed Architectures

Researchers present a novel federated learning architecture that integrates Zero-Knowledge Proofs to validate distributed machine learning computations while preserving privacy. The system addresses model poisoning attacks and scalability bottlenecks, achieving 94.2% accuracy retention across 1,000 parallel nodes—bridging cryptographic security with high-performance distributed AI.

AIBullisharXiv – CS AI · May 97/10
🧠

SANet: A Semantic-aware Agentic AI Networking Framework for Cross-layer Optimization in 6G

Researchers propose SANet, a semantic-aware agentic AI networking framework designed to optimize 6G wireless networks through collaborative AI agents that autonomously manage cross-layer network functions. The framework achieves 14.61% performance gains while reducing computational requirements to 44.37% of existing solutions, demonstrating practical efficiency improvements for next-generation telecommunications infrastructure.

AIBullishBlockonomi · May 87/10
🧠

Akamai (AKAM) Stock Rockets 23% Following Massive $1.8B AI Cloud Contract

Akamai Technologies secured a $1.8 billion AI infrastructure contract with a frontier model provider, triggering a 23% premarket surge in AKAM stock. The company also delivered Q1 earnings that exceeded analyst expectations, signaling strong execution in the competitive AI cloud services market.

AIBullisharXiv – CS AI · May 77/10
🧠

LAWS: Learning from Actual Workloads Symbolically -- A Self-Certifying Parametrized Cache Architecture for Neural Inference, Robotics, and Edge Deployment

Researchers introduce LAWS, a self-certifying caching architecture for neural inference that builds a library of expert functions with formal error bounds, enabling efficient deployment across LLMs, robotics, and edge devices. The system generalizes both Mixture-of-Experts and KV prefix caching while providing mathematically verifiable performance guarantees without requiring ground truth validation.

AIBullisharXiv – CS AI · May 47/10
🧠

RadLite: Multi-Task LoRA Fine-Tuning of Small Language Models for CPU-Deployable Radiology AI

Researchers demonstrate that small language models (3-4B parameters) can achieve strong multi-task radiology performance through LoRA fine-tuning, enabling deployment on consumer-grade CPUs without GPUs. The RadLite system, trained on 162K samples across 9 radiology tasks, shows dramatic performance improvements over zero-shot baselines and can be quantized to 1.8-2.4GB for practical clinical deployment.

AIBullisharXiv – CS AI · Apr 147/10
🧠

SVD-Prune: Training-Free Token Pruning For Efficient Vision-Language Models

SVD-Prune introduces a training-free token pruning method for Vision-Language Models using Singular Value Decomposition to reduce computational overhead. The approach maintains model performance while drastically reducing vision tokens to 16-32, addressing efficiency challenges in multimodal AI systems without requiring retraining.

AIBullisharXiv – CS AI · Apr 137/10
🧠

Ge$^\text{2}$mS-T: Multi-Dimensional Grouping for Ultra-High Energy Efficiency in Spiking Transformer

Researchers introduce Ge²mS-T, a novel Spiking Vision Transformer architecture that optimizes energy efficiency while maintaining training and inference performance through multi-dimensional grouped computation. The approach addresses fundamental limitations in existing SNN paradigms by balancing memory overhead, learning capability, and energy consumption simultaneously.

AIBullisharXiv – CS AI · Mar 277/10
🧠

LLM4AD: Large Language Models for Autonomous Driving -- Concept, Review, Benchmark, Experiments, and Future Trends

Researchers have published a comprehensive review of Large Language Models for Autonomous Driving (LLM4AD), introducing new benchmarks and conducting real-world experiments on autonomous vehicle platforms. The paper explores how LLMs can enhance perception, decision-making, and motion control in self-driving cars, while identifying key challenges including latency, security, and safety concerns.

AIBullishTechCrunch – AI · Mar 267/10
🧠

Mistral releases a new open-source model for speech generation

Mistral has released a new open-source speech generation model that is lightweight enough to run on mobile devices including smartwatches and smartphones. This represents a significant advancement in making AI speech capabilities more accessible and portable for edge computing applications.

AIBullisharXiv – CS AI · Mar 267/10
🧠

The Cognitive Firewall:Securing Browser Based AI Agents Against Indirect Prompt Injection Via Hybrid Edge Cloud Defense

Researchers developed the Cognitive Firewall, a hybrid edge-cloud defense system that protects browser-based AI agents from indirect prompt injection attacks. The three-stage architecture reduces attack success rates to below 1% while maintaining 17,000x faster response times compared to cloud-only solutions by processing simple attacks locally and complex threats in the cloud.

AIBullisharXiv – CS AI · Mar 177/10
🧠

HO-SFL: Hybrid-Order Split Federated Learning with Backprop-Free Clients and Dimension-Free Aggregation

Researchers propose HO-SFL (Hybrid-Order Split Federated Learning), a new framework that enables memory-efficient fine-tuning of large AI models on edge devices by eliminating backpropagation on client devices while maintaining convergence speed comparable to traditional methods. The approach significantly reduces communication costs and memory requirements for distributed AI training.

AIBullisharXiv – CS AI · Mar 177/10
🧠

PrototypeNAS: Rapid Design of Deep Neural Networks for Microcontroller Units

PrototypeNAS is a new zero-shot neural architecture search method that rapidly designs and optimizes deep neural networks for microcontroller units without requiring extensive training. The system uses a three-step approach combining structural optimization, ensemble zero-shot proxies, and Hypervolume subset selection to identify efficient models within minutes that can run on resource-constrained edge devices.

AIBullisharXiv – CS AI · Mar 177/10
🧠

SPARQ: Spiking Early-Exit Neural Networks for Energy-Efficient Edge AI

SPARQ introduces a unified framework combining spiking neural networks, quantization-aware training, and reinforcement learning-guided early exits for energy-efficient edge AI. The system achieves up to 5.15% higher accuracy than conventional quantized SNNs while reducing system energy consumption by over 330 times and cutting synaptic operations by over 90%.

AIBullisharXiv – CS AI · Mar 117/10
🧠

Meissa: Multi-modal Medical Agentic Intelligence

Researchers have developed Meissa, a lightweight 4B-parameter medical AI model that brings advanced agentic capabilities offline for healthcare applications. The system matches frontier models like GPT in medical benchmarks while operating with 25x fewer parameters and 22x lower latency, addressing privacy and cost concerns in clinical settings.

🧠 Gemini
AINeutralarXiv – CS AI · Mar 97/10
🧠

Real-Time AI Service Economy: A Framework for Agentic Computing Across the Continuum

Researchers propose a framework for decentralized resource allocation in real-time AI services across device-edge-cloud infrastructure. The study shows that dependency graph topology determines whether price-based allocation can work at scale, with hierarchical structures enabling stable pricing while complex dependencies cause instability.

Page 1 of 5Next →