#energy-efficiency News & Analysis

60 articles tagged with #energy-efficiency. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

60 articles

AIBullishBlockonomi · 2d ago7/10

🧠

Nvidia (NVDA) Stock: Jensen Huang’s $6.5B Photonics Gamble to Solve AI Energy Crisis

Nvidia has committed $6.5 billion to photonics technology since March to address the energy consumption challenges of AI infrastructure. CEO Jensen Huang will present updates on the Rubin GPU and Vera CPU at Computex, with analyst price targets reaching $250 per share.

🏢 Nvidia

AIBullisharXiv – CS AI · 2d ago7/10

🧠

Hallucination Mitigation with Agentic AI, Nested Learning, and AI Sustainability via Semantic Caching

Researchers present a multi-agent LLM pipeline architecture that reduces hallucinations by 31-36% through nested learning, semantic caching, and progressive review stages. The system simultaneously improves factual reliability, cuts energy consumption by 47%, and enhances auditability without requiring model retraining.

AIBullisharXiv – CS AI · 3d ago7/10

🧠

CLANE: Continual Learning of Actions on Neuromorphic Hardware from Event Cameras

Researchers have developed CLANE, a neuromorphic hardware system deployed on Intel Loihi 2 that enables continuous learning of human actions from event cameras without forgetting previously learned classes. The system achieves 70.4% accuracy on a 50-class action recognition dataset while consuming 100x less energy and delivering 16x lower latency than conventional GPU-based approaches, advancing on-device AI for AR/VR and robotics applications.

AIBullisharXiv – CS AI · 3d ago7/10

🧠

Advancing Direct Training for Spiking Neural Networks with Circulate-Firing Neurons and Learnable Gradients

Researchers propose a novel direct training algorithm for Spiking Neural Networks that addresses performance gaps with traditional ANNs through circulate-firing neurons, learnable surrogate gradients, and balanced loss functions. The method demonstrates competitive results across datasets and extends effectively to Transformer architectures, potentially advancing energy-efficient neural network applications.

AIBullisharXiv – CS AI · May 127/10

🧠

SAFformer:Improving Spiking Transformer via Active Predictive Filtering

Researchers introduce SAFformer, a novel Spiking Transformer architecture that improves energy efficiency and accuracy by adopting an active predictive filtering paradigm inspired by brain mechanisms. The model achieves state-of-the-art performance on image recognition benchmarks while consuming significantly less power than conventional approaches.

AI × CryptoBullishcrypto.news · May 117/10

🤖

IREN Nvidia deal signals $3.4B AI pivot

IREN, a former Bitcoin miner transitioning into AI infrastructure, has secured a $3.4 billion deal with Nvidia to deploy up to 5 gigawatts of AI computing capacity over five years. This agreement underscores the strategic shift of cryptocurrency miners toward AI infrastructure provision as demand for compute resources accelerates.

$BTC🏢 Nvidia

AIBullisharXiv – CS AI · May 117/10

🧠

Globally Optimal Training of Spiking Neural Networks via Parameter Reconstruction

Researchers propose a novel parameter reconstruction algorithm for training Spiking Neural Networks (SNNs) that addresses the long-standing problem of non-differentiable spike functions. The method extends convexification theory to recurrent networks and demonstrates consistent improvements over traditional surrogate gradient approaches, with potential applications in large-scale energy-efficient neural network training.

AIBullisharXiv – CS AI · May 117/10

🧠

XiYOLO: Energy-Aware Object Detection via Iterative Architecture Search and Scaling

XiYOLO is a new energy-efficient object detection framework that uses neural architecture search and scaling techniques to optimize AI models for edge devices with strict power constraints. The system achieves 20-53% energy reductions compared to YOLOv12 baselines across GPU and NPU deployments while maintaining competitive accuracy metrics.

AINeutralarXiv – CS AI · May 47/10

🧠

Token Arena: A Continuous Benchmark Unifying Energy and Cognition in AI Inference

TokenArena introduces a continuous benchmark framework that evaluates AI inference endpoints across energy efficiency, latency, cost, and output quality rather than just model-level comparisons. Testing 78 endpoints across 12 model families reveals dramatic performance variance—the same model differs by up to 12.5 accuracy points and 6.2x in energy efficiency depending on deployment configuration, with workload type fundamentally reordering cost-effectiveness rankings.

AIBullisharXiv – CS AI · Apr 207/10

🧠

Closing the Theory-Practice Gap in Spiking Transformers via Effective Dimension

Researchers establish the first comprehensive theoretical framework for spiking transformers, proving their universal approximation capabilities and deriving tight spike-count lower bounds. Using effective dimension analysis, they explain why spiking transformers achieve 38-57× energy efficiency on neuromorphic hardware and provide concrete design rules validated across vision and language benchmarks with 97% prediction accuracy.

AIBullisharXiv – CS AI · Apr 147/10

🧠

MoEITS: A Green AI approach for simplifying MoE-LLMs

Researchers present MoEITS, a novel algorithm for simplifying Mixture-of-Experts large language models while maintaining performance and reducing computational costs. The method outperforms existing pruning techniques across multiple benchmark models including Mixtral 8×7B and DeepSeek-V2-Lite, addressing the energy and resource efficiency challenges of deploying advanced LLMs.

AIBullisharXiv – CS AI · Apr 147/10

🧠

Towards Green Wearable Computing: A Physics-Aware Spiking Neural Network for Energy-Efficient IMU-based Human Activity Recognition

Researchers have developed PAS-Net, a physics-aware spiking neural network that dramatically reduces power consumption in wearable IMU-based human activity recognition systems. The architecture achieves state-of-the-art accuracy while cutting energy consumption by up to 98% through sparse integer operations and an early-exit mechanism, establishing a new standard for ultra-low-power edge computing on battery-constrained devices.

AIBullisharXiv – CS AI · Apr 147/10

🧠

EdgeCIM: A Hardware-Software Co-Design for CIM-Based Acceleration of Small Language Models

EdgeCIM presents a specialized hardware-software framework designed to accelerate Small Language Model inference on edge devices by addressing memory-bandwidth bottlenecks inherent in autoregressive decoding. The system achieves significant performance and energy improvements over existing mobile accelerators, reaching 7.3x higher throughput than NVIDIA Orin Nano on 1B-parameter models.

🏢 Nvidia

AIBullisharXiv – CS AI · Apr 137/10

🧠

Ge$^\text{2}$mS-T: Multi-Dimensional Grouping for Ultra-High Energy Efficiency in Spiking Transformer

Researchers introduce Ge²mS-T, a novel Spiking Vision Transformer architecture that optimizes energy efficiency while maintaining training and inference performance through multi-dimensional grouped computation. The approach addresses fundamental limitations in existing SNN paradigms by balancing memory overhead, learning capability, and energy consumption simultaneously.

AIBullisharXiv – CS AI · Apr 137/10

🧠

EquiformerV3: Scaling Efficient, Expressive, and General SE(3)-Equivariant Graph Attention Transformers

EquiformerV3, an advanced SE(3)-equivariant graph neural network, achieves significant improvements in efficiency, expressivity, and generality for 3D atomistic modeling. The new version delivers 1.75x speedup, introduces architectural innovations like SwiGLU-S² activations and smooth-cutoff attention, and achieves state-of-the-art results on major molecular modeling benchmarks including OC20 and OMat24.

$SE

AINeutralarXiv – CS AI · Mar 177/10

🧠

An Alternative Trajectory for Generative AI

Researchers propose shifting from large monolithic AI models to domain-specific superintelligence (DSS) societies due to unsustainable energy costs and physical constraints of current generative AI scaling approaches. The alternative involves smaller, specialized models working together through orchestration agents, potentially enabling on-device deployment while maintaining reasoning capabilities.

AIBullisharXiv – CS AI · Mar 177/10

🧠

SPARQ: Spiking Early-Exit Neural Networks for Energy-Efficient Edge AI

SPARQ introduces a unified framework combining spiking neural networks, quantization-aware training, and reinforcement learning-guided early exits for energy-efficient edge AI. The system achieves up to 5.15% higher accuracy than conventional quantized SNNs while reducing system energy consumption by over 330 times and cutting synaptic operations by over 90%.

AIBullisharXiv – CS AI · Mar 167/10

🧠

SRAM-Based Compute-in-Memory Accelerator for Linear-decay Spiking Neural Networks

Researchers developed an SRAM-based compute-in-memory accelerator for spiking neural networks that uses linear decay approximation instead of exponential decay, achieving 1.1x to 16.7x reduction in energy consumption. The innovation addresses the bottleneck of neuron state updates in neuromorphic computing by performing in-place decay directly within memory arrays.

AIBullisharXiv – CS AI · Mar 117/10

🧠

DendroNN: Dendrocentric Neural Networks for Energy-Efficient Classification of Event-Based Data

Researchers have developed DendroNN, a novel neural network architecture inspired by brain dendrites that achieves up to 4x higher energy efficiency than current neuromorphic hardware for spatiotemporal event-based computing. The system uses spike sequence detection and a unique rewiring training method to process temporal data without requiring gradients or recurrent connections.

AIBullisharXiv – CS AI · Mar 67/10

🧠

AI+HW 2035: Shaping the Next Decade

A research paper presents a 10-year roadmap for coordinated AI and hardware co-development, targeting 1000x efficiency improvements in AI training and inference by 2035. The vision emphasizes energy efficiency over raw compute scaling, proposing integrated solutions across algorithms, architectures, and systems to enable sustainable AI deployment from cloud to edge environments.

AIBullisharXiv – CS AI · Mar 57/10

🧠

Joint Hardware-Workload Co-Optimization for In-Memory Computing Accelerators

Researchers developed a joint hardware-workload co-optimization framework for in-memory computing accelerators that can efficiently support multiple neural network workloads rather than just single specialized models. The framework achieved significant energy-delay-area product reductions of up to 76.2% and 95.5% compared to baseline methods when optimizing across multiple workloads.

AIBullishThe Register – AI · Mar 47/10

🧠

Flex appeal: UK datacenter cuts AI power draw 40% on command

A UK datacenter successfully demonstrated the ability to reduce AI workload power consumption by 40% on demand, showcasing flexible power management capabilities. This development highlights the potential for datacenters to better manage energy usage and grid stability while maintaining AI operations.

AIBullisharXiv – CS AI · Mar 46/102

🧠

TinyIceNet: Low-Power SAR Sea Ice Segmentation for On-Board FPGA Inference

Researchers developed TinyIceNet, a compact AI model for real-time sea ice mapping using satellite SAR imagery, designed specifically for on-board FPGA processing in space. The system achieves 75.216% F1 score while consuming 50% less energy than GPU baselines, demonstrating practical AI deployment for maritime navigation in polar regions.

$NEAR

AIBullisharXiv – CS AI · Mar 47/103

🧠

FAST: Topology-Aware Frequency-Domain Distribution Matching for Coreset Selection

Researchers propose FAST, a new DNN-free framework for coreset selection that compresses large datasets into representative subsets for training deep neural networks. The method uses frequency-domain distribution matching and achieves 9.12% average accuracy improvement while reducing power consumption by 96.57% compared to existing methods.

AIBullisharXiv – CS AI · Mar 46/103

🧠

Robust Heterogeneous Analog-Digital Computing for Mixture-of-Experts Models with Theoretical Generalization Guarantees

Researchers propose a heterogeneous computing framework for Mixture-of-Experts AI models that combines analog in-memory computing with digital processing to improve energy efficiency. The approach identifies noise-sensitive experts for digital computation while running the majority on analog hardware, eliminating the need for costly retraining of large models.

Page 1 of 3Next →