AINeutralarXiv – CS AI · May 97/10
🧠Researchers propose BehaviorGuard, an online defense framework against backdoor attacks in deep reinforcement learning that detects malicious behavior by analyzing action distribution shifts rather than relying on reward anomalies or model fine-tuning. The approach works in both single and multi-agent DRL environments and demonstrates superior efficacy and efficiency compared to existing defense methods.
AIBullisharXiv – CS AI · Mar 47/102
🧠Researchers propose MIStar, a memory-enhanced improvement search framework using heterogeneous graph neural networks for flexible job-shop scheduling problems in smart manufacturing. The approach significantly outperforms traditional heuristics and state-of-the-art deep reinforcement learning methods in optimizing production schedules.
$NEAR
AINeutralarXiv – CS AI · 3d ago6/10
🧠Researchers introduce PIRS (Physics-Informed Reward Shaping), a method that improves deep reinforcement learning controllers for building energy management by replacing ad-hoc comfort metrics with ISO 7730 Predicted Mean Vote (PMV) standards. Tested on CityLearn v2.1.2, PIRS demonstrates competitive performance against manual baselines while substantially outperforming non-physics-grounded approaches in load ramping and peak demand metrics.
AINeutralarXiv – CS AI · 3d ago6/10
🧠Researchers introduce Global PSRO, an improved algorithm for computing Nash equilibria in two-player zero-sum games by using Population Exploitability metrics to guide strategy expansion more efficiently than existing methods. The approach reduces computational requirements while achieving better approximations of equilibrium solutions, advancing game-theoretic AI applications.
AIBullisharXiv – CS AI · 3d ago6/10
🧠Researchers developed SMamba-DDPG, a deep reinforcement learning framework that models how pedestrians behave differently when interacting with autonomous vehicles versus human-driven vehicles. The study found that pedestrians react faster to AVs and adopt lower crossing speeds, with AV interactions showing lower conflict rates than HDV scenarios.
AINeutralarXiv – CS AI · 3d ago6/10
🧠Researchers propose a novel framework for visualizing latent motion phase structures in deep reinforcement learning locomotion policies by extending clustering features beyond state observations to include actions and next states. The method successfully identifies clearer phase transition patterns across three MuJoCo environments, advancing interpretability of neural network-based control policies.
AINeutralarXiv – CS AI · 4d ago6/10
🧠A comprehensive benchmark study reveals that properly calibrated rule-based autoscalers outperform six mainstream deep reinforcement learning algorithms on cost in adaptive resource control tasks. The research challenges assumptions about DRL superiority, identifying baseline calibration and reward engineering as greater bottlenecks than algorithm selection.
AINeutralarXiv – CS AI · 4d ago6/10
🧠Researchers propose R2E-IG, a deep reinforcement learning model using mixture-of-experts architecture to improve vehicle routing problem solutions across different data distributions. The approach combines residual-refined expert modules with instance-level gating and dynamic weight adaptation training, achieving competitive performance on both standard and out-of-distribution test cases.
AINeutralarXiv – CS AI · 4d ago6/10
🧠This academic survey examines deep reinforcement learning (DRL) approaches for optimizing computational offloading in vehicular edge computing systems. The research classifies existing DRL strategies across learning paradigms, system architectures, and optimization objectives while identifying challenges in scalability and coordination for next-generation intelligent transportation systems.
AINeutralarXiv – CS AI · 4d ago6/10
🧠Researchers propose DRLHQ, a deep reinforcement learning approach with heterogeneous query attention mechanisms to solve capacitated location-routing problems (CLRPs) and their open variants. This marks the first end-to-end learning framework for CLRPs, demonstrating superior performance over traditional and DRL-based baselines on benchmark datasets.
AIBullisharXiv – CS AI · Mar 176/10
🧠Researchers developed a Hierarchical Takagi-Sugeno-Kang Fuzzy Classifier System that converts opaque deep reinforcement learning agents into human-readable IF-THEN rules, achieving 81.48% fidelity in tests. The framework addresses the critical explainability problem in AI systems used for safety-critical applications by providing interpretable rules that humans can verify and understand.
AIBullisharXiv – CS AI · Mar 45/102
🧠Researchers developed a deep reinforcement learning approach to optimize beam management in millimeter-wave radio access networks, achieving up to 16% throughput improvements and 3-7x latency reduction. The method uses adaptive beam selection based on real-time observations to enhance multi-user MIMO performance in practical network setups.
AIBullisharXiv – CS AI · Mar 37/107
🧠Researchers have developed a new framework that combines Large Language Models (LLMs) with Deep Reinforcement Learning to improve data efficiency, interpretability, and cross-environment transferability. The approach uses LLMs to map natural language instructions into executable rules and create semantically annotated options for better skill reuse and constraint monitoring.
AIBullisharXiv – CS AI · Mar 27/1011
🧠Researchers developed a deep reinforcement learning approach using heterogeneous graph networks to solve Flexible Job Shop Scheduling Problems with limited buffers and material kitting constraints. The method outperforms traditional heuristics by improving buffer utilization and decision quality through better modeling of complex dependencies in production scheduling.
AIBullisharXiv – CS AI · Mar 26/1015
🧠Researchers developed LACE-RL, a deep reinforcement learning framework that optimizes serverless computing by balancing cold-start latency and carbon emissions. The system dynamically adjusts keep-alive durations based on real-time carbon intensity and workload patterns, achieving 51.69% fewer cold starts and 77.08% lower idle carbon emissions compared to static policies.
AIBullisharXiv – CS AI · Feb 276/104
🧠Researchers propose an agentic AI framework using multiple LLM-based agents to optimize cell-free Open RAN networks through intent-driven automation. The system reduces active radio units by 42% in energy-saving mode while cutting memory usage by 92% through parameter-efficient fine-tuning.
AINeutralarXiv – CS AI · Mar 44/103
🧠Researchers propose DRL-GS, a deep reinforcement learning algorithm that optimizes network topology design by combining a verifier, graph neural network, and DRL agent. The approach addresses limitations of traditional heuristic methods by efficiently searching large topology spaces while incorporating management constraints.
$LINK
AINeutralarXiv – CS AI · Mar 34/103
🧠A research paper surveys the application of deep reinforcement learning (DRL) to network intrusion detection systems, finding that while DRL shows promise and occasionally outperforms traditional methods, many technologies remain underexplored. The study identifies key challenges including training efficiency, minority attack detection, and dataset imbalances, while proposing integration with generative methods for improved performance.
AINeutralOpenAI News · Nov 214/103
🧠The article title references benchmarking safe exploration techniques in deep reinforcement learning, which is a critical area of AI research focused on developing algorithms that can learn while avoiding harmful or dangerous actions. However, no article body content was provided for analysis.
AINeutralOpenAI News · Nov 153/105
🧠This appears to be an academic research paper exploring count-based exploration methods in deep reinforcement learning. The article body is empty, preventing detailed analysis of the research findings or methodology.
AINeutralHugging Face Blog · May 41/106
🧠The article appears to be incomplete or improperly formatted, containing only a title about deep reinforcement learning with no substantive content in the body. Without proper article content, no meaningful analysis of deep reinforcement learning concepts, applications, or market implications can be provided.