273 articles tagged with #optimization. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.
AINeutralarXiv โ CS AI ยท Mar 95/10
๐ง Researchers revisited Best-of-N (BoN) sampling for AI alignment and found it's actually optimal when evaluated using win-rate metrics rather than expected true reward. They propose a variant that eliminates reward-hacking vulnerabilities while maintaining optimal performance.
AINeutralarXiv โ CS AI ยท Mar 94/10
๐ง Researchers propose a new reinforcement learning approach for large language models that optimizes for subsets of future rewards rather than full sequences. The method enables comparison of different policy classes and shows varying effectiveness across different conversational AI alignment tasks.
AINeutralarXiv โ CS AI ยท Mar 64/10
๐ง Researchers propose ASFL, an adaptive split federated learning framework that optimizes machine learning model training across wireless networks by splitting computation between clients and central servers. The framework reduces training delay by up to 75% and energy consumption by 80% compared to baseline approaches while maintaining faster convergence rates.
AIBullisharXiv โ CS AI ยท Mar 54/10
๐ง Researchers have developed RADAR, a neural framework that enables AI routing systems to handle asymmetric distance problems in vehicle routing. The system uses advanced mathematical techniques including SVD and Sinkhorn normalization to better solve real-world logistics challenges.
AIBullisharXiv โ CS AI ยท Mar 54/10
๐ง Researchers developed MasCOR, a machine-learning framework for optimizing e-fuel production systems that combines design and operational decisions under renewable energy uncertainty. The system demonstrates near-optimal performance with significantly lower computational costs than traditional mathematical programming approaches.
AINeutralarXiv โ CS AI ยท Mar 54/10
๐ง Researchers propose UrbanHuRo, a two-layer human-robot collaboration framework that jointly optimizes different urban services like delivery and sensing. The system demonstrated 29.7% improvement in sensing coverage and 39.2% increase in courier income while reducing overdue orders through coordinated optimization of heterogeneous services.
AINeutralarXiv โ CS AI ยท Mar 54/10
๐ง Researchers present AutoQD, a new AI method that automatically discovers diverse behavioral policies without requiring hand-crafted descriptors. The approach uses mathematical embeddings of policy occupancy measures to enable Quality-Diversity optimization algorithms to find varied high-performing solutions in reinforcement learning tasks.
AINeutralarXiv โ CS AI ยท Mar 54/10
๐ง Researchers analyzed the implicit bias of the Jordan-Kinderlehrer-Otto (JKO) scheme, a time-discretization method for Wasserstein gradient flow used in optimizing energy functionals over probability measures. They found that the JKO scheme adds a deceleration term at second order that corresponds to canonical implicit biases like Fisher information for entropy and kinetic energy for Riemannian gradient descent.
AINeutralarXiv โ CS AI ยท Mar 44/103
๐ง Researchers propose QuADD (Quantization-aware Dataset Distillation), a new framework that jointly optimizes dataset compression and precision to create more efficient synthetic training datasets. The method integrates differentiable quantization within the distillation process, achieving better accuracy per bit than existing approaches on image classification and 3GPP beam management tasks.
AINeutralarXiv โ CS AI ยท Mar 44/103
๐ง Researchers introduce SynthCharge, a parametric generator for creating diverse electric vehicle routing problem instances with feasibility screening. The tool addresses limitations in existing benchmark datasets by producing scalable, verifiable instances to enable better evaluation of learning-based routing optimization models.
AINeutralarXiv โ CS AI ยท Mar 44/105
๐ง Researchers propose a novel non-parametric method for robust counterfactual inference in Markov Decision Processes that computes tight bounds across all compatible causal models. The approach provides closed-form expressions instead of requiring exponentially complex optimization problems, making it highly efficient and scalable for real-world applications.
AINeutralarXiv โ CS AI ยท Mar 44/102
๐ง Researchers introduce iJKOnet, a new method combining the JKO framework with inverse optimization to learn population dynamics from evolutionary snapshots. The approach uses adversarial training without restrictive architectural requirements and demonstrates improved performance over existing JKO-based methods.
AINeutralarXiv โ CS AI ยท Mar 44/102
๐ง Researchers developed AIGB-Pearl, a new AI-driven auto-bidding system that combines generative planning with policy optimization to improve advertising performance. The system addresses limitations of existing offline reinforcement learning methods by incorporating a trajectory evaluator and safe exploration mechanisms beyond static datasets.
AINeutralarXiv โ CS AI ยท Mar 44/102
๐ง Researchers propose ML-LCA framework to integrate machine learning-based materials discovery with lifecycle assessment for sustainable-by-design materials. The framework addresses the current inefficiency where environmental impacts are evaluated only after resources are invested in potentially unsustainable solutions.
AINeutralarXiv โ CS AI ยท Mar 34/104
๐ง Researchers analyzed scaling laws for signSGD optimization in machine learning, comparing it to standard SGD under a power-law random features model. The study identifies unique effects in signSGD that can lead to steeper compute-optimal scaling laws than SGD in noise-dominant regimes.
AINeutralarXiv โ CS AI ยท Mar 34/104
๐ง Researchers developed a new analysis of KL-regularized multi-armed bandits (MABs) using KL-UCB algorithm, achieving near-optimal regret bounds. The study provides the first high-probability regret bound with linear dependence on the number of arms and establishes matching lower bounds, offering comprehensive understanding across all regularization regimes.
$NEAR
AIBullisharXiv โ CS AI ยท Mar 34/103
๐ง Researchers propose Astral, a new neural network training method for physics-informed neural networks (PiNNs) that uses error majorants instead of residual minimization. The method provides direct upper bounds on errors and demonstrates faster convergence with more reliable error estimation across various partial differential equations.
AIBullisharXiv โ CS AI ยท Mar 34/103
๐ง Researchers developed a Wavelet-Enhanced Convolutional Network to improve tidal current speed forecasting by learning multi-periodic patterns in tidal data. The model achieved a 10-step average Mean Absolute Error of 0.025, demonstrating at least 1.44% error reduction compared to baseline methods.
AINeutralarXiv โ CS AI ยท Mar 34/104
๐ง Researchers developed a quantum annealing approach to solve staff allocation problems across multiple educational sites in Italy. The study demonstrates quantum optimization methods can efficiently handle complex resource allocation tasks in real-world educational scheduling scenarios.
AIBullisharXiv โ CS AI ยท Mar 35/105
๐ง Researchers introduce ADE-CoT (Adaptive Edit-CoT), a new test-time scaling framework that improves image editing efficiency by 2x while maintaining superior performance. The system uses dynamic resource allocation, edit-specific verification, and opportunistic stopping to optimize the image editing process compared to traditional methods.
AINeutralarXiv โ CS AI ยท Feb 274/105
๐ง Researchers published a comprehensive survey on Neural Routing Solvers (NRSs) that use deep learning to solve vehicle routing problems. The study introduces a new hierarchical taxonomy based on heuristic principles and proposes an improved evaluation pipeline that reveals gaps in current research methodologies.
AINeutralarXiv โ CS AI ยท Feb 274/105
๐ง Researchers introduce Causal Computational Asymmetry (CCA), a new method for identifying causal relationships by training neural networks in both directions and determining causality based on which direction converges faster during optimization. The method achieved 26/30 correct causal identifications across synthetic benchmarks and is embedded in a broader Causal Compression Learning framework.
AINeutralarXiv โ CS AI ยท Feb 274/109
๐ง Researchers propose PASTN, a lightweight neural network for large-scale traffic flow prediction that uses positional-aware embeddings and temporal attention mechanisms. The model demonstrates improved efficiency and effectiveness across various geographical scales from counties to entire states.
AINeutralarXiv โ CS AI ยท Feb 274/106
๐ง Researchers have introduced LLM4AD, a unified Python platform that leverages large language models for algorithm design across optimization, machine learning, and scientific discovery domains. The platform features modular components, comprehensive evaluation tools, and extensive support resources including tutorials and a graphical user interface to facilitate LLM-assisted algorithm development.
AIBullishApple Machine Learning ยท Feb 244/103
๐ง Researchers introduce depyf, a new tool designed to make PyTorch 2.x's compiler more transparent for machine learning researchers. The tool decompiles bytecode back into readable source code, helping researchers better understand and utilize the compiler's optimization capabilities.