y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#bug-detection News & Analysis

4 articles tagged with #bug-detection. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

4 articles
AIBullisharXiv โ€“ CS AI ยท Mar 277/10
๐Ÿง 

Model2Kernel: Model-Aware Symbolic Execution For Safe CUDA Kernels

Researchers developed Model2Kernel, a system that automatically detects memory safety bugs in CUDA kernels used for large language model inference. The system discovered 353 previously unknown bugs across popular platforms like vLLM and Hugging Face with only nine false positives.

๐Ÿข Hugging Face
AINeutralarXiv โ€“ CS AI ยท Apr 66/10
๐Ÿง 

GBQA: A Game Benchmark for Evaluating LLMs as Quality Assurance Engineers

Researchers introduced GBQA, a new benchmark with 30 games and 124 verified bugs to test whether large language models can autonomously discover software bugs. The best-performing model, Claude-4.6-Opus, only identified 48.39% of bugs, highlighting the significant challenges in autonomous bug detection.

๐Ÿง  Claude
AINeutralarXiv โ€“ CS AI ยท Mar 27/1019
๐Ÿง 

Once4All: Skeleton-Guided SMT Solver Fuzzing with LLM-Synthesized Generators

Researchers developed Once4All, an LLM-assisted fuzzing framework for testing SMT solvers that addresses syntax validity issues and computational overhead. The system found 43 confirmed bugs in leading solvers Z3 and cvc5, with 40 already fixed by developers.