AINeutralarXiv – CS AI · May 297/10
🧠BioArc introduces a neural architecture search framework that systematically discovers optimal model architectures for biological foundation models, moving beyond generic adaptation of NLP and computer vision models. The research identifies design principles and proposes methods to predict architectures for new biological tasks, providing foundational methodology for next-generation biology-focused AI systems.
AIBullisharXiv – CS AI · May 277/10
🧠Researchers introduce JetViT, a hybrid Vision Transformer architecture that maintains accuracy of state-of-the-art models while delivering up to 1.79x faster throughput and 44.81% lower latency on high-resolution images. The innovation uses post-training attention search to convert full-attention models into efficient hybrid variants by strategically replacing redundant attention blocks.
🏢 Nvidia
AIBullisharXiv – CS AI · May 117/10
🧠XiYOLO is a new energy-efficient object detection framework that uses neural architecture search and scaling techniques to optimize AI models for edge devices with strict power constraints. The system achieves 20-53% energy reductions compared to YOLOv12 baselines across GPU and NPU deployments while maintaining competitive accuracy metrics.
AI × CryptoBullishCrypto Briefing · May 37/10
🤖Ben Fielding discusses how neural architecture search (NAS) automates deep learning model design, emphasizes the necessity of horizontal scaling in distributed systems, and explores blockchain security's role in strengthening consensus algorithms. The convergence of machine learning and blockchain represents a transformative shift comparable to MapReduce's impact on distributed computing.
AIBullisharXiv – CS AI · Mar 177/10
🧠PrototypeNAS is a new zero-shot neural architecture search method that rapidly designs and optimizes deep neural networks for microcontroller units without requiring extensive training. The system uses a three-step approach combining structural optimization, ensemble zero-shot proxies, and Hypervolume subset selection to identify efficient models within minutes that can run on resource-constrained edge devices.
AIBullisharXiv – CS AI · 3d ago6/10
🧠Researchers introduce a differentiable Neural Architecture Search framework that jointly optimizes LLM architecture and mixed-precision quantization, achieving 1.4x faster inference speeds or 6% higher accuracy compared to sequential optimization approaches. This compression technique addresses the critical challenge of deploying large language models on edge devices without requiring extensive GPU training.
AIBullisharXiv – CS AI · 5d ago6/10
🧠M-DESIGN, a new retrieval-augmented framework, addresses the inefficiency gap between expensive neural architecture search and suboptimal model retrieval by dynamically leveraging historical evidence from prior tasks to discover near-optimal network modifications. Tested on 67,760 graph neural networks across 22 datasets, the method achieves state-of-the-art performance in 79% of cases under computational constraints.
AINeutralarXiv – CS AI · 6d ago6/10
🧠EARLY (Evolutionary Algorithm for Reservoir Learning and Yielding) introduces an automated method for optimizing Echo State Networks by evolving both topology and hyperparameters using evolutionary algorithms. The framework demonstrates that evolved architectures outperform random search baselines and adapt their complexity based on task difficulty, suggesting potential for creating reusable neural network structures across diverse temporal learning problems.
AINeutralarXiv – CS AI · May 296/10
🧠Researchers introduce RAISE, a comprehensive framework for optimizing retrieval-augmented generation (RAG) systems by treating architecture design as a hyperparameter search problem. The study evaluates 13 optimization algorithms across seven datasets, revealing that RAG performance is highly task-dependent and no single optimization strategy universally outperforms others, highlighting the need for systematic rather than heuristic-based configuration approaches.
🏢 Meta
AIBullisharXiv – CS AI · May 286/10
🧠Researchers have developed Regression Language Models (RLMs) that use frozen LLM encoders to predict numeric code execution outcomes across multiple programming languages and domains. A 300M parameter model demonstrates strong performance predicting memory footprint, GPU latency, neural network accuracy, and hardware platform performance without domain-specific feature engineering.
AINeutralarXiv – CS AI · May 126/10
🧠CardiacNAS presents an evolutionary neural architecture search framework that optimizes cardiac MRI segmentation models for both accuracy and computational efficiency. The approach achieves 93.22% dice similarity with only 3.58M parameters, demonstrating how resource-aware AI design can enable deployment of medical imaging models on resource-constrained environments.
AIBullisharXiv – CS AI · May 76/10
🧠Researchers introduce Delta-Code Generation, a method where fine-tuned LLMs generate compact code diffs to modify existing neural architectures rather than creating complete models from scratch. The approach achieves significantly higher validity rates (66-75%) and accuracy (64-66%) compared to baseline full-generation methods while reducing output by 75-85%, demonstrating a more efficient paradigm for LLM-driven neural architecture search.
AIBullishLil'Log (Lilian Weng) · Aug 66/10
🧠Neural Architecture Search (NAS) automates the design of neural network architectures to find optimal topologies for specific tasks. The approach systematically explores network architecture spaces through three key components: search space, search algorithms, and child model evolution strategies, potentially discovering better performing models than human-designed architectures.
AINeutralarXiv – CS AI · Mar 34/105
🧠Researchers propose SEval-NAS, a new evaluation mechanism for neural architecture search that converts architectures to strings and predicts performance metrics like accuracy, latency, and memory usage. The method shows particular strength in predicting hardware costs and can be integrated into existing NAS frameworks with minimal changes.