y0news
#code-generation7 articles
7 articles
AIBullisharXiv โ€“ CS AI ยท 4h ago2
๐Ÿง 

SWE-Hub: A Unified Production System for Scalable, Executable Software Engineering Tasks

Researchers introduce SWE-Hub, a comprehensive system for generating scalable, executable software engineering tasks for training AI agents. The platform addresses current limitations in AI software development by providing unified environment automation, bug synthesis, and diverse task generation across multiple programming languages.

AIBullisharXiv โ€“ CS AI ยท 4h ago2
๐Ÿง 

MIST-RL: Mutation-based Incremental Suite Testing via Reinforcement Learning

Researchers propose MIST-RL, a reinforcement learning framework that improves AI code generation by creating more efficient test suites. The method achieves 28.5% higher fault detection while using 19.3% fewer test cases, demonstrating significant improvements in AI code verification efficiency.

AIBullisharXiv โ€“ CS AI ยท 4h ago1
๐Ÿง 

What Papers Don't Tell You: Recovering Tacit Knowledge for Automated Paper Reproduction

Researchers propose a new framework called 'method' that addresses the challenge of automated paper reproduction by recovering tacit knowledge that academic papers leave implicit. The graph-based agent framework achieves 10.04% performance gap against official implementations, improving over baselines by 24.68% across 40 recent papers.

$LINK
AIBullisharXiv โ€“ CS AI ยท 4h ago1
๐Ÿง 

RepoRepair: Leveraging Code Documentation for Repository-Level Automated Program Repair

RepoRepair is a new AI-powered automated program repair system that uses hierarchical code documentation to fix bugs across entire software repositories. The system achieves a 45.7% repair rate on SWE-bench Lite at $0.44 per fix by leveraging LLMs like DeepSeek-V3 and Claude-4 for fault localization and code repair.

AIBullisharXiv โ€“ CS AI ยท 4h ago2
๐Ÿง 

Inference-Time Safety For Code LLMs Via Retrieval-Augmented Revision

Researchers developed a new inference-time safety mechanism for code-generating AI models that uses retrieval-augmented generation to identify and fix security vulnerabilities in real-time. The approach leverages Stack Overflow discussions to guide AI code revision without requiring model retraining, improving security while maintaining interpretability.

AIBullisharXiv โ€“ CS AI ยท 4h ago1
๐Ÿง 

LFPO: Likelihood-Free Policy Optimization for Masked Diffusion Models

Researchers propose Likelihood-Free Policy Optimization (LFPO), a new framework for improving Diffusion Large Language Models by bypassing likelihood computation issues that plague existing methods. LFPO uses geometric velocity rectification to optimize denoising logits directly, achieving better performance on code and reasoning tasks while reducing inference time by 20%.

AINeutralarXiv โ€“ CS AI ยท 4h ago0
๐Ÿง 

Agentic Scientific Simulation: Execution-Grounded Model Construction and Reconstruction

Researchers introduce JutulGPT, an AI agent system for physics-based simulation that addresses the problem of underspecified natural language descriptions in scientific modeling. The system uses an execution-grounded approach where the simulator validates physical accuracy, but reveals limitations in tracking tacit assumptions made through simulator defaults.