#neural-architecture-search News & Analysis

23 articles tagged with #neural-architecture-search. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

23 articles

AIBullisharXiv – CS AI · Jun 257/10

🧠

On-Device Neural Architecture Search

Researchers propose a Neural Architecture Search (NAS) system that runs directly on edge devices like Raspberry Pi to automatically design optimized neural networks for real-time sensor data analysis. Validated on sign language recognition and fault diagnosis tasks, the approach achieves superior performance with significantly lower memory requirements compared to existing methods, enabling personalized AI models that adapt to individual users without cloud dependency.

AIBullisharXiv – CS AI · Jun 107/10

🧠

LLM-Guided Neural Architecture Search for Robust Co-Design of Physical Neural Networks

Researchers introduce UH-NAS, an LLM-guided neural architecture search framework that optimizes neural networks for unconventional hardware platforms by co-designing for accuracy and hardware-specific constraints like energy efficiency and physical imperfections. The approach demonstrates superior performance on optical computing hardware compared to existing methods, advancing the practical deployment of AI on emerging computing substrates.

🏢 Meta

AINeutralCrypto Briefing · Jun 67/10

🧠

OpenAI’s next model is being designed by AI itself, says SoftBank’s Masayoshi Son

SoftBank's Masayoshi Son claims OpenAI's next AI model is being designed by AI itself rather than humans, suggesting accelerated development toward superintelligence. This development could fundamentally reshape industries and societal structures within years if the claim proves accurate.

🏢 OpenAI

AINeutralarXiv – CS AI · May 297/10

🧠

BioArc: Discovering Optimal Neural Architectures for Biological Foundation Models

BioArc introduces a neural architecture search framework that systematically discovers optimal model architectures for biological foundation models, moving beyond generic adaptation of NLP and computer vision models. The research identifies design principles and proposes methods to predict architectures for new biological tasks, providing foundational methodology for next-generation biology-focused AI systems.

AIBullisharXiv – CS AI · May 277/10

🧠

JetViT: Efficient High-Resolution Vision Transformer with Post-Training Attention Search

Researchers introduce JetViT, a hybrid Vision Transformer architecture that maintains accuracy of state-of-the-art models while delivering up to 1.79x faster throughput and 44.81% lower latency on high-resolution images. The innovation uses post-training attention search to convert full-attention models into efficient hybrid variants by strategically replacing redundant attention blocks.

🏢 Nvidia

AIBullisharXiv – CS AI · May 117/10

🧠

XiYOLO: Energy-Aware Object Detection via Iterative Architecture Search and Scaling

XiYOLO is a new energy-efficient object detection framework that uses neural architecture search and scaling techniques to optimize AI models for edge devices with strict power constraints. The system achieves 20-53% energy reductions compared to YOLOv12 baselines across GPU and NPU deployments while maintaining competitive accuracy metrics.

AI × CryptoBullishCrypto Briefing · May 37/10

🤖

Ben Fielding: Neural architecture search automates deep learning, the shift to horizontal scaling is essential, and blockchain security enhances consensus algorithms | Unchained

Ben Fielding discusses how neural architecture search (NAS) automates deep learning model design, emphasizes the necessity of horizontal scaling in distributed systems, and explores blockchain security's role in strengthening consensus algorithms. The convergence of machine learning and blockchain represents a transformative shift comparable to MapReduce's impact on distributed computing.

AIBullisharXiv – CS AI · Mar 177/10

🧠

PrototypeNAS: Rapid Design of Deep Neural Networks for Microcontroller Units

PrototypeNAS is a new zero-shot neural architecture search method that rapidly designs and optimizes deep neural networks for microcontroller units without requiring extensive training. The system uses a three-step approach combining structural optimization, ensemble zero-shot proxies, and Hypervolume subset selection to identify efficient models within minutes that can run on resource-constrained edge devices.

AINeutralarXiv – CS AI · Jun 235/10

🧠

Neural Architecture Search of Sample Reweighting Networks for Complex Distribution Shift

Researchers enhance Meta-Weight-Net (MW-Net), a neural network for sample reweighting under distribution shifts, by applying neural architecture search to optimize its structure. The improved approach better handles combined label noise and class imbalance problems that degrade standard MW-Net performance, demonstrating effectiveness on CIFAR-10 and CIFAR-100 datasets.

AIBullisharXiv – CS AI · Jun 106/10

🧠

Importance-Aware Scheduling for High-Dimensional Hyperparameter Optimization

Researchers propose Greedy Importance First (GIF), a novel hyperparameter optimization strategy that uses importance-based scheduling to improve efficiency in high-dimensional ML/DL model training. The method outperforms established optimizers like TPE and BOHB on high-dimensional benchmarks by focusing computational resources on the most impactful hyperparameters.

AIBullisharXiv – CS AI · Jun 106/10

🧠

Towards Autonomous Accelerator Design: FPGA Accelerator Generation with SECDA

Researchers have developed SECDA-DSE, a framework that integrates Large Language Models into FPGA accelerator design to automate hardware-software co-design exploration. The system successfully generated three different accelerator designs that were synthesized and executed on actual FPGA hardware, demonstrating LLM-guided design space exploration can reduce development time while capturing architecture-specific trade-offs.

AINeutralarXiv – CS AI · Jun 96/10

🧠

Seq103: A Unified Neuroevolution Framework for Compact Sequence Architecture Discovery

Seq103 introduces a unified neuroevolution framework that automatically discovers compact neural network architectures for sequence tasks, achieving 81-87% of baseline accuracy while using 11-3,200x fewer parameters. The framework applies the same evolutionary search pipeline to both recurrent and feedforward sequence classification, offering significant efficiency gains for resource-constrained deployments.

AINeutralarXiv – CS AI · Jun 56/10

🧠

Differentiable Efficient Operator Search

Researchers propose Efficient Operator Search, a differentiable framework that automates the design of token-reduction operators for multimodal foundation models. The approach unifies previously distinct manual techniques like pruning and merging into a shared search space, discovering hybrid operators that achieve better accuracy-efficiency trade-offs than hand-designed baselines.

AINeutralarXiv – CS AI · Jun 56/10

🧠

Surrogate Neural Architecture Codesign Package (SNAC-Pack)

SNAC-Pack is an open-source AutoML framework that automates neural architecture design for FPGA deployment by combining hardware-aware search with quantization and pruning. The tool reduces design cycles from months to hours while matching or exceeding baseline performance on tasks like jet classification and quantum computing applications.

AIBullisharXiv – CS AI · Jun 46/10

🧠

LLM Compression with Jointly Optimizing Architectural and Quantization choices

Researchers introduce a differentiable Neural Architecture Search framework that jointly optimizes LLM architecture and mixed-precision quantization, achieving 1.4x faster inference speeds or 6% higher accuracy compared to sequential optimization approaches. This compression technique addresses the critical challenge of deploying large language models on edge devices without requiring extensive GPU training.

AIBullisharXiv – CS AI · Jun 26/10

🧠

Beyond Model Base Retrieval: Weaving Knowledge to Master Fine-grained Neural Network Design

M-DESIGN, a new retrieval-augmented framework, addresses the inefficiency gap between expensive neural architecture search and suboptimal model retrieval by dynamically leveraging historical evidence from prior tasks to discover near-optimal network modifications. Tested on 67,760 graph neural networks across 22 datasets, the method achieves state-of-the-art performance in 79% of cases under computational constraints.

AINeutralarXiv – CS AI · Jun 16/10

🧠

Evolutionary Algorithm for Reservoir Learning and Yielding

EARLY (Evolutionary Algorithm for Reservoir Learning and Yielding) introduces an automated method for optimizing Echo State Networks by evolving both topology and hyperparameters using evolutionary algorithms. The framework demonstrates that evolved architectures outperform random search baselines and adapt their complexity based on task difficulty, suggesting potential for creating reusable neural network structures across diverse temporal learning problems.

AINeutralarXiv – CS AI · May 296/10

🧠

RAISE: RAG Design as an Architecture Search Problem

Researchers introduce RAISE, a comprehensive framework for optimizing retrieval-augmented generation (RAG) systems by treating architecture design as a hyperparameter search problem. The study evaluates 13 optimization algorithms across seven datasets, revealing that RAG performance is highly task-dependent and no single optimization strategy universally outperforms others, highlighting the need for systematic rather than heuristic-based configuration approaches.

🏢 Meta

AIBullisharXiv – CS AI · May 286/10

🧠

Regression Language Models for Code

Researchers have developed Regression Language Models (RLMs) that use frozen LLM encoders to predict numeric code execution outcomes across multiple programming languages and domains. A 300M parameter model demonstrates strong performance predicting memory footprint, GPU latency, neural network accuracy, and hardware platform performance without domain-specific feature engineering.

AINeutralarXiv – CS AI · May 126/10

🧠

Resource-Aware Evolutionary Neural Architecture Search for Cardiac MRI Segmentation

CardiacNAS presents an evolutionary neural architecture search framework that optimizes cardiac MRI segmentation models for both accuracy and computational efficiency. The approach achieves 93.22% dice similarity with only 3.58M parameters, demonstrating how resource-aware AI design can enable deployment of medical imaging models on resource-constrained environments.

AIBullisharXiv – CS AI · May 76/10

🧠

Delta-Based Neural Architecture Search: LLM Fine-Tuning via Code Diffs

Researchers introduce Delta-Code Generation, a method where fine-tuned LLMs generate compact code diffs to modify existing neural architectures rather than creating complete models from scratch. The approach achieves significantly higher validity rates (66-75%) and accuracy (64-66%) compared to baseline full-generation methods while reducing output by 75-85%, demonstrating a more efficient paradigm for LLM-driven neural architecture search.

AIBullishLil'Log (Lilian Weng) · Aug 66/10

🧠

Neural Architecture Search

Neural Architecture Search (NAS) automates the design of neural network architectures to find optimal topologies for specific tasks. The approach systematically explores network architecture spaces through three key components: search space, search algorithms, and child model evolution strategies, potentially discovering better performing models than human-designed architectures.

AINeutralarXiv – CS AI · Mar 34/105

🧠

SEval-NAS: A Search-Agnostic Evaluation for Neural Architecture Search

Researchers propose SEval-NAS, a new evaluation mechanism for neural architecture search that converts architectures to strings and predicts performance metrics like accuracy, latency, and memory usage. The method shows particular strength in predicting hardware costs and can be integrated into existing NAS frameworks with minimal changes.