AIBearishArs Technica – AI · 4d ago7/10
🧠Microsoft-packaged software repositories were compromised for the second time in weeks with 73 malicious packages containing credential-stealing malware that automatically executes when opened by AI agents. This represents a significant supply chain vulnerability affecting automated development workflows and highlights growing threats to AI-driven software development practices.
AIBearisharXiv – CS AI · 5d ago7/10
🧠Researchers have discovered that large language models generate code with recurring, predictable vulnerabilities that can be exploited through a black-box attack called FSTab. The technique achieves up to 94% attack success by identifying patterns in LLM-generated software without requiring access to source code, raising critical security concerns for production systems relying on AI code generation.
🧠 GPT-5🧠 Claude🧠 Gemini
AIBearisharXiv – CS AI · Jun 47/10
🧠Researchers discovered that incidental contextual cues in prompts systematically steer LLM code generation toward different algorithms, even when all outputs are functionally correct. Across 46,535 experiments, subtle variations in wording and metadata produced algorithm-choice shifts up to 100 percentage points, creating unpredictable performance and security outcomes in production code.
AINeutralCrypto Briefing · Jun 17/10
🧠Nvidia CEO Jensen Huang reported that AI-generated code commits on GitHub surged to 1.4 billion in 2026, tripling from previous levels. While this demonstrates significant productivity gains from AI-assisted development, it raises substantial questions about code quality, security vulnerabilities, and the adequacy of current review processes.
🏢 Nvidia
AIBullisharXiv – CS AI · May 287/10
🧠Researchers introduce VULPO, an on-policy LLM optimization framework for vulnerability detection that achieves 203% improvement over baseline models by incorporating context-aware reasoning and multidimensional reward signals. The approach combines a new ContextVul dataset with specialized fine-tuning to create more effective security analysis tools that reason through complex code interactions.
AIBearisharXiv – CS AI · May 117/10
🧠A comprehensive survey of 87 machine learning vulnerability detection studies reveals that the field has stalled despite a decade of research, trapped in self-reinforcing feedback loops that optimize for narrow, artificial problems. Researchers identify twelve interconnected pain points spanning datasets, formulations, metrics, and evaluation approaches that perpetuate focus on binary C/C++ function-level classification while neglecting vulnerability type prediction, multilingual support, and broader detection granularities.
AINeutralarXiv – CS AI · Mar 46/105
🧠Researchers propose Human-Certified Module Repositories (HCMRs) as a new framework to ensure trustworthy software development in the AI era. The system combines human oversight with automated analysis to certify and curate reusable code modules, addressing growing security concerns as AI increasingly generates and assembles software components.
AIBullishOpenAI News · Oct 307/106
🧠OpenAI has launched Aardvark, an AI-powered autonomous security researcher that can find, validate, and help fix software vulnerabilities at scale. The system is currently in private beta with early testing available through sign-up.
AINeutralarXiv – CS AI · Jun 46/10
🧠Researchers conducted a reproducibility study of Vul-RAG, a RAG-based framework for detecting software vulnerabilities using LLMs, and found that while results are reproducible with open-weight models, performance plateaus around 0.30 pairwise accuracy regardless of model sophistication. The findings suggest that simply scaling up model capacity does not substantially improve vulnerability detection capabilities.
AINeutralarXiv – CS AI · May 276/10
🧠ConVer is a compositional verification tool that leverages large language models and contract synthesis to formally verify C programs more efficiently than traditional bounded model checking. The tool achieves 82-96% success on simple benchmarks and 67% on complex programs, demonstrating significant progress in automated software verification despite limitations on recursive and loop-intensive code.
AI × CryptoNeutralarXiv – CS AI · May 276/10
🤖Researchers propose a cryptographic registry provenance system to prevent dependency confusion attacks in software ecosystems by requiring mandatory publisher signatures, cryptographic registry identity, registry countersignatures, and consumer-side enforcement. Analysis of eight major ecosystems reveals none currently implement all four defense layers, leaving package managers vulnerable to attacks that exploit the lack of provenance verification.
GeneralBearishArs Technica – AI · May 186/10
📰Bug bounty platforms are being overwhelmed by low-quality AI-generated submissions that waste time and resources, straining corporate vulnerability disclosure programs. This surge reflects broader challenges in maintaining security reward schemes as AI tools democratize report generation without improving actual security research quality.
AIBullisharXiv – CS AI · May 126/10
🧠Researchers introduce VulTriage, an LLM-based framework that enhances vulnerability detection in source code through triple-path context augmentation combining control flow analysis, vulnerability knowledge retrieval, and semantic summarization. The approach achieves state-of-the-art results on benchmark datasets and demonstrates strong generalization to low-resource scenarios.
AINeutralThe Register – AI · Apr 136/10
🧠Linux 7.0 has been released as Linus Torvalds explores how AI could enhance bug detection and streamline the kernel development process. The milestone reflects the open-source community's growing interest in leveraging AI tools to improve software quality and development workflows.
AI × CryptoBullishCrypto Briefing · Apr 117/10
🤖Gavriel Cohen discusses how open-source projects drive AI innovation through community collaboration, highlighting NanoClaw's rapid growth as a case study. The analysis covers the commercial viability of AI-native service companies with high-margin potential and addresses critical security vulnerabilities in modern software architecture that developers must prioritize.
AIBearisharXiv – CS AI · Mar 126/10
🧠A research study analyzing 319 LLM-generated security patches found that only 24.8% achieve full correctness, with most failures due to semantic misunderstanding rather than syntax errors. LLMs preserve functionality well but struggle significantly with security fixes, with success rates varying dramatically by vulnerability type.
AINeutralarXiv – CS AI · Mar 96/10
🧠Researchers have developed ESAA-Security, a new architecture for conducting secure, verifiable audits of AI-generated code using structured agent workflows rather than unstructured LLM conversations. The system creates an immutable audit trail through event-sourcing and produces comprehensive security reports across 26 tasks and 95 executable checks.
AIBullisharXiv – CS AI · Mar 26/1012
🧠Researchers developed Hybrid Class-Aware Selective Replay (Hybrid-CASR), a continual learning method that improves AI-based software vulnerability detection by addressing catastrophic forgetting in temporal scenarios. The method achieved 0.667 Macro-F1 score while reducing training time by 17% compared to baseline approaches on CVE data from 2018-2024.