AINeutralarXiv – CS AI · Mar 96/10
🧠Researchers have developed ContextBench, a new benchmark for evaluating methods that generate targeted inputs to trigger specific behaviors in language models. The study introduces enhanced Evolutionary Prompt Optimization techniques that better balance effectiveness in activating AI model features while maintaining linguistic fluency.
AIBullisharXiv – CS AI · Mar 66/10
🧠Researchers propose EvoTool, a new framework that optimizes AI agent tool-use policies through evolutionary algorithms rather than traditional gradient-based methods. The system decomposes agent policies into four modules and uses blame attribution and targeted mutations to improve performance, showing over 5-point improvements on benchmarks.
🧠 GPT-4
AIBullisharXiv – CS AI · Mar 36/108
🧠Researchers introduced RAISE, a training-free evolutionary framework that improves text-to-image generation by adaptively refining outputs based on prompt complexity. The system achieves state-of-the-art alignment scores while reducing computational costs by 30-80% compared to existing methods.
AIBullisharXiv – CS AI · Mar 36/105
🧠Researchers introduce CEMMA, a co-evolutionary framework for improving AI safety alignment in multimodal large language models. The system uses evolving adversarial attacks and adaptive defenses to create more robust AI systems that better resist jailbreak attempts while maintaining functionality.
AINeutralarXiv – CS AI · May 124/10
🧠Researchers present RDEx-CASK, an enhanced optimization algorithm that extends RDEx-CSOP with three modifications targeting stagnation issues in constrained single-objective optimization. The method introduces Cauchy-sampled scale factors, a small feasible-only archive, and per-individual stagnation counters that trigger adaptive parameter adjustments, achieving competitive performance on CEC benchmark problems.
AINeutralarXiv – CS AI · Mar 174/10
🧠Researchers developed an evolutionary transfer learning approach to adapt chess AI heuristics for Dragonchess, a 3D chess variant. While direct transfers from Stockfish failed, evolutionary optimization using CMA-ES significantly improved AI performance in this complex multi-layer game environment.
AIBullisharXiv – CS AI · Mar 115/10
🧠Researchers present GenePlan, a framework that uses large language models with evolutionary algorithms to generate domain-specific planners for classical planning tasks in PDDL. The system achieved a 0.91 SAT score across eight benchmark domains, nearly matching state-of-the-art performance while significantly outperforming other LLM-based approaches.
🧠 GPT-4
AIBullisharXiv – CS AI · Mar 115/10
🧠Researchers developed CMA-ES-IG, a new algorithm that helps robots learn user preferences more effectively by incorporating user experience considerations. The algorithm suggests perceptually distinct and informative robot behaviors for users to rank, showing improved scalability, computational efficiency, and user satisfaction compared to existing methods.
AINeutralarXiv – CS AI · Mar 34/105
🧠Researchers have developed Lilium, an automated evolutionary method that uses AI to improve skull-face overlay accuracy in forensic identification of skeletal remains. The system employs a Differential Evolution algorithm with 3D cone-based representation to model soft-tissue variability and outperforms existing state-of-the-art methods.