AINeutralarXiv – CS AI · 9h ago6/10
🧠
Learning to replenish: A hybrid deep reinforcement learning for dynamic inventory management in the pharmaceutical supply chains
Researchers propose a hybrid deep reinforcement learning algorithm (A3C DPPO) to optimize inventory replenishment in pharmaceutical supply chains, addressing challenges of unpredictable demand, variable lead times, and product shelf-life constraints. The approach demonstrates cost reductions compared to benchmark methods while maintaining service levels, with validation using real-world pharmaceutical data.