AIBullisharXiv – CS AI · May 126/10
🧠Researchers introduce QD-LLM, a framework that evolves lightweight prompt embeddings (~32K parameters) to steer frozen large language models toward diverse outputs without fine-tuning. The approach outperforms existing quality-diversity optimization methods by 46.4% in coverage and demonstrates practical applications in test generation and training data improvement.
🧠 Llama
AINeutralarXiv – CS AI · Apr 206/10
🧠Researchers introduce TeLAPA, a continual reinforcement learning framework that maintains diverse policy archives instead of relying on single-model preservation, addressing the loss of plasticity problem where retained policies fail to serve as effective starting points for rapid adaptation across new tasks.
AIBullisharXiv – CS AI · Mar 166/10
🧠Researchers developed Q-DIG, a red-teaming method that uses Quality Diversity techniques to identify diverse language instruction failures in Vision-Language-Action models for robotics. The approach generates adversarial prompts that expose vulnerabilities in robot behavior and improves task success rates when used for fine-tuning.
AINeutralarXiv – CS AI · Mar 54/10
🧠Researchers present AutoQD, a new AI method that automatically discovers diverse behavioral policies without requiring hand-crafted descriptors. The approach uses mathematical embeddings of policy occupancy measures to enable Quality-Diversity optimization algorithms to find varied high-performing solutions in reinforcement learning tasks.
AINeutralarXiv – CS AI · Mar 24/106
🧠Researchers developed QD-MAPPER, a framework using Quality Diversity algorithms and Neural Cellular Automata to automatically generate diverse maps for evaluating Multi-Agent Path Finding (MAPF) algorithms. This addresses the limitation of testing MAPF algorithms on fixed, human-designed maps that may not cover all scenarios and could lead to overfitting.
AINeutralarXiv – CS AI · Mar 24/106
🧠Researchers propose Mixed Guidance Graph Optimization (MGGO) to improve multi-agent pathfinding systems by optimizing both edge directions and weights in guidance graphs. The paper introduces two MGGO methods, including one using Quality Diversity algorithms with neural networks, to provide stricter guidance for agent movement in lifelong scenarios.