AIBearisharXiv – CS AI · 2d ago7/10
🧠Researchers present DEPO, a reinforcement learning algorithm that enables large language models to evade AI-text detectors through paraphrasing while maintaining semantic fidelity. The constrained optimization approach treats detector evasion as the primary objective with semantic preservation as an explicit constraint, demonstrating robust performance across multiple detectors and datasets.
AIBearisharXiv – CS AI · 6d ago7/10
🧠Researchers evaluated LLM-generated peer reviews for scientific papers using ACL Rolling Review data, finding limited alignment between LLM and human reviews while discovering that authors can strategically game LLM feedback to improve paper scores by up to 35%. The study highlights emerging risks in automated academic review systems as both reviewers and authors increasingly leverage language models.
AIBearisharXiv – CS AI · May 127/10
🧠A new threat called Agentic Denominator Gaming could exploit AI conferences' stable acceptance rates by flooding submissions with low-quality papers generated by automated agents, inflating the denominator to boost legitimate papers' acceptance odds without intending publication of the spam itself. This systemic vulnerability exposes academic peer review to coordinated attacks that would degrade review quality and increase reviewer burnout while requiring institutional policy reforms beyond technical solutions.
AIBearisharXiv – CS AI · May 77/10
🧠A comprehensive bibliometric audit reveals that academic papers evaluating large language models systematically lag behind frontier AI capabilities by a median of 10.85 points on the Epoch AI Capabilities Index, with this gap widening at 5.53 points annually. The study finds that most papers fail to disclose critical configuration details and make broad claims about "AI" capabilities rather than specific tested models, distorting how AI progress is understood in policy and media.
🧠 GPT-4🧠 GPT-5🧠 Claude
AIBearisharXiv – CS AI · Mar 46/102
🧠Researchers developed a method to detect AI-generated content at scale and found that 6.5-16.9% of peer reviews at major AI conferences after ChatGPT's release were substantially modified by LLMs. The study reveals concerning patterns where AI-generated reviews correlate with lower reviewer confidence, last-minute submissions, and reduced engagement in rebuttals.
AINeutralarXiv – CS AI · 2d ago6/10
🧠A pilot study of 24 college students found that constraining LLM access to limited prompts preserves student authorship confidence and perceived ownership while maintaining essay quality, suggesting that moderate restrictions rather than outright bans may optimize AI assistance in educational settings.
AINeutralarXiv – CS AI · May 286/10
🧠Researchers develop a game-theoretic framework modeling how students collectively adopt responsible or opportunistic AI use in academic assessments. The study reveals that small, well-designed changes to assessment incentives can trigger rapid behavioral shifts toward ethical AI practices, whereas policy statements alone typically fail to change behavior.
AIBearishArs Technica – AI · May 156/10
🧠arXiv, the preprint repository for scientific papers, has implemented a policy banning AI-generated content submissions, with violators facing year-long submission bans. A moderator announced the enforcement on social media, signaling the platform's effort to maintain research integrity amid growing concerns about low-quality AI-generated submissions flooding academic repositories.
AIBearisharXiv – CS AI · May 126/10
🧠A new benchmarking framework reveals that AI tools in academic research excel at exploration and summaries but fail at precision tasks requiring exact information extraction. The study demonstrates that explainable AI features are inadequate, forcing researchers to manually verify outputs, and literature review tools lack reproducibility and transparency for systematic research.
🏢 xAI
AINeutralarXiv – CS AI · May 96/10
🧠Researchers propose IntraGuard, a defense framework that embeds hidden safeguards into PDF manuscripts to detect when AI chatbots are used to generate peer reviews instead of human experts. The system achieves 84% success rate in disrupting AI-generated reviews while maintaining transparency for legitimate human reviewers, addressing growing concerns about academic integrity as LLMs proliferate.
AINeutralarXiv – CS AI · May 46/10
🧠A text mining analysis of academic literature reveals that ChatGPT research in programming education emphasizes pedagogical implementation and student engagement while underexploring assessment design and institutional governance. The literature positions ChatGPT ambivalently—as both a valuable learning aid and a source of academic integrity risks—signaling the need for stronger frameworks around responsible AI integration in education.
🧠 ChatGPT
AINeutralarXiv – CS AI · Apr 146/10
🧠A large-scale survey of 457 software engineering researchers reveals that generative AI adoption is widespread in academic research, concentrated primarily in writing and early-stage tasks. While researchers perceive significant productivity gains, persistent concerns about accuracy, bias, and lack of governance frameworks highlight the need for clearer guidelines on responsible AI integration in academic practice.
AINeutralarXiv – CS AI · Apr 146/10
🧠Researchers have introduced C-ReD, a Chinese benchmark dataset for detecting AI-generated text that addresses gaps in model diversity and data homogeneity. The dataset, derived from real-world prompts, demonstrates reliable in-domain detection and strong generalization to unseen language models, with resources publicly available on GitHub.
AINeutralarXiv – CS AI · Apr 106/10
🧠Researchers introduce AI-Sinkhole, an AI-agent augmented DNS-blocking framework that dynamically detects and temporarily blocks LLM chatbot services during proctored exams to prevent academic integrity violations. The system uses quantized LLMs for semantic classification and Pi-Hole for network-wide DNS blocking, achieving robust cross-lingual detection with F1-scores exceeding 0.83.
AINeutralFortune Crypto · Mar 64/10
🧠Economists have developed 'Macro Buddy,' a chatbot designed to help students learn and reason rather than cheat, as survey data shows 90% of 1,100 U.S. college students reported using generative AI in 2025. The tool represents an educational approach to integrating AI into academic settings while addressing concerns about academic integrity.
AINeutralarXiv – CS AI · Mar 54/10
🧠A study of 29 STEM faculty members reveals mixed adoption of generative AI tools in higher education, with educators using AI for content generation and curriculum design while expressing concerns about academic integrity and assessment validity. The research highlights the need for institutional support and rethinking of pedagogical approaches to effectively integrate AI technologies into educational settings.