AINeutralarXiv โ CS AI ยท 4h ago5
๐ง Researchers have developed an agentic LLM framework using Retrieval-Augmented Generation to automate adverse media screening for anti-money laundering compliance in financial institutions. The system addresses high false-positive rates in traditional keyword-based approaches by implementing multi-step web searches and computing Adverse Media Index scores to distinguish between high-risk and low-risk individuals.
AIBullisharXiv โ CS AI ยท 4h ago4
๐ง Researchers introduce RF-Agent, a framework that uses Large Language Models as agents to automatically design reward functions for control tasks through Monte Carlo Tree Search. The method improves upon existing approaches by better utilizing historical feedback and enhancing search efficiency across 17 diverse low-level control tasks.
AINeutralarXiv โ CS AI ยท 4h ago5
๐ง Researchers introduce Jailbreak Foundry (JBF), a system that automatically converts AI jailbreak research papers into executable code modules for standardized testing. The system successfully reproduced 30 attacks with high accuracy and reduces implementation code by nearly half while enabling consistent evaluation across multiple AI models.
AIBullisharXiv โ CS AI ยท 4h ago4
๐ง Researchers propose SafeGen-LLM, a new approach to enhance safety in robotic task planning by combining supervised fine-tuning with policy optimization guided by formal verification. The system demonstrates superior safety generalization across multiple domains compared to existing classical planners, reinforcement learning methods, and base large language models.
AIBullisharXiv โ CS AI ยท 4h ago4
๐ง Researchers introduce CowPilot, a framework that combines autonomous AI agents with human collaboration for web navigation tasks. The system achieved 95% success rate while requiring humans to perform only 15.2% of total steps, demonstrating effective human-AI cooperation for complex web tasks.
AIBullisharXiv โ CS AI ยท 4h ago6
๐ง Researchers introduce CoMind, a multi-agent AI system that leverages community knowledge to automate machine learning engineering tasks. The system achieved a 36% medal rate on 75 past Kaggle competitions and outperformed 92.6% of human competitors in eight live competitions, establishing new state-of-the-art performance.
AIBullisharXiv โ CS AI ยท 4h ago4
๐ง Researchers have developed Radiologist Copilot, an AI agentic framework that orchestrates specialized tools to complete the entire radiology reporting workflow beyond simple report generation. The system integrates image localization, interpretation, template selection, report composition, and quality control to support radiologists throughout the comprehensive reporting process.
AIBullisharXiv โ CS AI ยท 4h ago5
๐ง Researchers introduced AC3 (Actor-Critic for Continuous Chunks), a new reinforcement learning framework that addresses challenges in long-horizon robotic manipulation tasks with sparse rewards. The system uses continuous action chunks with stabilization mechanisms and achieved superior performance on 25 benchmark tasks using minimal demonstrations.
AIBullisharXiv โ CS AI ยท 4h ago7
๐ง Researchers developed LIA, a supervised fine-tuning approach using DeepSeek-R1-Distill-Llama-8B to automatically assign software issues to developers. The system achieved up to 187.8% improvement over the base model and 211.2% better performance than existing methods in developer recommendation accuracy.
AIBullisharXiv โ CS AI ยท 4h ago4
๐ง Researchers have developed GenAI-Net, a generative AI framework that automates the design of chemical reaction networks (CRNs) for synthetic biology applications. The system can automatically generate biomolecular circuits for various functions including logic gates, oscillators, and classifiers, potentially accelerating the development of biomanufacturing and therapeutic technologies.
AIBullisharXiv โ CS AI ยท 4h ago8
๐ง Researchers developed DECO, a multimodal diffusion transformer for bimanual robot manipulation that integrates vision, proprioception, and tactile signals. The system achieved 72.25% success rate on complex manipulation tasks, with a 21% improvement over baseline methods when tested on over 2,000 robot rollouts.
AINeutralarXiv โ CS AI ยท 4h ago4
๐ง Researchers have developed a hierarchical AI agent system that can automatically modify urban planning layouts using natural language instructions and GeoJSON data. The system decomposes editing tasks into geometric operations across multiple spatial levels and includes validation mechanisms to ensure spatial consistency during multi-step urban modifications.
$MATIC
AIBullisharXiv โ CS AI ยท 4h ago1
๐ง Researchers introduce Channel-of-Mobile-Experts (CoME), a new AI agent architecture that uses four specialized experts to handle different reasoning stages for mobile device automation. The system employs progressive training strategies and information gain-driven optimization to improve mobile agent performance on complex tasks.
AINeutralarXiv โ CS AI ยท 4h ago1
๐ง Researchers have developed MEDIC, a neural network framework for Data Quality Monitoring (DQM) in particle physics experiments that uses machine learning to automatically detect detector anomalies and identify malfunctioning components. The simulation-driven approach using modified Delphes detector simulation represents an initial step toward comprehensive ML-based DQM systems for future particle detectors.