#automated-repair News & Analysis

3 articles tagged with #automated-repair. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

3 articles

AIBullisharXiv – CS AI · Jun 236/10

🧠

From Empirical Evaluation to Context-Aware Enhancement: Repairing Regression Errors with LLMs

Researchers introduce RegressionBug4APR, a benchmark of 200 real-world Java and Python regression bugs, to evaluate automated program repair (APR) techniques. The study finds that traditional APR tools fail entirely on regression bugs, while LLM-based approaches show promise, achieving 1.6x better results when enhanced with bug-inducing change context.

AIBullisharXiv – CS AI · Apr 106/10

🧠

Fine-grained Approaches for Confidence Calibration of LLMs in Automated Code Revision

Researchers propose fine-grained confidence calibration methods for large language models in automated code revision tasks, addressing the limitation of traditional global calibration approaches. By applying local Platt-scaling to task-specific confidence scores, the study demonstrates improved calibration accuracy across multiple code repair and refinement tasks, enabling developers to better trust LLM outputs.

AIBearisharXiv – CS AI · Mar 126/10

🧠

Why LLMs Fail: A Failure Analysis and Partial Success Measurement for Automated Security Patch Generation

A research study analyzing 319 LLM-generated security patches found that only 24.8% achieve full correctness, with most failures due to semantic misunderstanding rather than syntax errors. LLMs preserve functionality well but struggle significantly with security fixes, with success rates varying dramatically by vulnerability type.