AIBullisharXiv – CS AI · 6d ago7/10
🧠Researchers present alpha-beta-CROWN, a neural network verification framework that enables formal verification of learning-based controllers in safety-critical systems. The tool addresses scalability challenges in verifying controller properties like stability and safety by computing certified bounds on nonlinear functions and using GPU parallelization for complex verification tasks.
AIBullisharXiv – CS AI · Mar 177/10
🧠Researchers developed SToRM, a new framework that reduces computational costs for autonomous driving systems using multi-modal large language models by up to 30x while maintaining performance. The system uses supervised token reduction techniques to enable real-time end-to-end driving on standard GPUs without sacrificing safety or accuracy.
AIBullisharXiv – CS AI · Mar 57/10
🧠Researchers have developed Sim2Sea, a comprehensive framework that successfully bridges the simulation-to-reality gap for autonomous maritime vessel navigation in congested waters. The system uses GPU-accelerated parallel simulation, dual-stream spatiotemporal policy, and targeted domain randomization to achieve zero-shot transfer from simulation to real-world deployment on a 17-ton unmanned vessel.
AIBullisharXiv – CS AI · Mar 46/102
🧠Researchers developed GPUTOK, a GPU-accelerated tokenizer for large language models that processes text significantly faster than existing CPU-based solutions. The optimized version shows 1.7x speed improvement over tiktoken and 7.6x over HuggingFace's GPT-2 tokenizer while maintaining output quality.
AINeutralarXiv – CS AI · 4d ago6/10
🧠ScheduleStream introduces a GPU-accelerated framework for Task and Motion Planning & Scheduling (TAMPAS) that enables bimanual and humanoid robots to coordinate parallel arm movements efficiently. The system models temporal dynamics through hybrid durative actions and produces more optimized schedules than traditional TAMP algorithms that typically move one arm at a time.
AINeutralarXiv – CS AI · May 125/10
🧠Researchers have developed GPU-accelerated versions of the Boruta feature selection algorithm, significantly improving computational efficiency for processing large-scale datasets while maintaining accuracy comparable to the original CPU-based method. The two variants—Boruta-Permut and Boruta-TreeImp—demonstrate that GPU acceleration offers a cost-effective solution for machine learning workflows on high-dimensional data.
AIBullisharXiv – CS AI · Apr 206/10
🧠Researchers introduced cuNNQS-SCI, a fully GPU-accelerated framework that solves a critical scalability bottleneck in neural network quantum state methods for solving complex quantum systems. The system achieves 2.32X speedup over previous CPU-GPU hybrid approaches while maintaining chemical accuracy, demonstrating 90%+ parallel efficiency across 64 GPUs.
🏢 Nvidia
AIBullisharXiv – CS AI · Mar 176/10
🧠Researchers developed Temporal Aggregated Convolution (TAC) to accelerate spiking neural networks by aggregating spike frames before convolution, achieving 13.8x speedup on rate-coded data. The study reveals that optimal temporal aggregation strategies depend on data type - collapsing temporal dimensions for rate-coded data while preserving them for event-based data.
🏢 Nvidia
AIBullishHugging Face Blog · Jul 216/105
🧠NVIDIA has partnered with Hugging Face to integrate NIM (NVIDIA Inference Microservices) to accelerate large language model deployment and inference. This collaboration aims to make AI model deployment more efficient and accessible through optimized GPU acceleration on the Hugging Face platform.
AIBullishHugging Face Blog · Oct 226/105
🧠Transformers.js v3 has been released with major upgrades including WebGPU support for enhanced performance, new AI models and tasks capabilities. This update represents a significant advancement in browser-based machine learning infrastructure.
AINeutralHugging Face Blog · Aug 84/107
🧠The article appears to be a technical guide focused on optimizing multi-GPU training for machine learning models, specifically covering ND-Parallel acceleration techniques. This represents educational content aimed at AI practitioners and developers looking to improve computational efficiency in distributed training environments.