AIBullisharXiv – CS AI · May 277/10
🧠Researchers introduce E3, an automated review assistant that identifies technical concerns in research papers with 90.2% recall—outperforming human reviewers and leading AI models. The system detects unsupported claims, missing ablations, weak baselines, and validity threats, with evaluation conducted on 100 ICLR 2026 papers using a contamination-resistant backtesting protocol.
🏢 OpenAI🏢 Anthropic🧠 GPT-5
AINeutralarXiv – CS AI · Feb 277/107
🧠A research paper introduces the concept of 'vibe researching' where AI agents can autonomously execute entire research pipelines from idea to submission using specialized skills. The study analyzes how AI agents excel at speed and methodological tasks but struggle with theoretical originality and tacit knowledge, creating a cognitive rather than sequential delegation boundary in research workflows.
AINeutralarXiv – CS AI · 6d ago6/10
🧠Researchers introduce AutoSci, an AI-driven system designed to automate the full scientific research lifecycle by managing literature review, experiments, manuscript writing, and peer review responses. The system uses a memory-centric architecture with four specialized modules to maintain structured knowledge, execute research workflows, and continuously improve its procedures through feedback.
AINeutralarXiv – CS AI · May 286/10
🧠Researchers introduce CiteCheck, a hybrid framework that detects when large language models fabricate or corrupt scientific citations by combining scholarly database retrieval with structured LLM verification. The system achieves 88.7% macro-F1 on a new 982-citation physics benchmark, outperforming GPT, Claude, and Gemini, addressing a critical reliability problem as LLMs become integrated into scientific research workflows.
🧠 Claude🧠 Gemini
AIBullishGoogle Research Blog · May 196/10
🧠Empirical Research Assistance (ERA) represents a significant advancement in AI-assisted scientific research, transitioning from academic publication to practical computational discovery tools. The development demonstrates how machine learning can accelerate the research process across scientific disciplines, with implications for both the academic and technology sectors.
AINeutralarXiv – CS AI · May 116/10
🧠Researchers introduce CoCoReviewBench, a new benchmark dataset of 3,900 papers from ICLR and NeurIPS designed to reliably evaluate AI review systems. The benchmark addresses critical gaps in current evaluation methods by prioritizing correctness over mere overlap with human reviews, revealing that existing AI reviewers struggle with hallucinations and reasoning accuracy.
AINeutralarXiv – CS AI · May 16/10
🧠Researchers introduce RPC-Bench, a large-scale benchmark containing 15,000 human-verified question-answer pairs designed to evaluate how well AI models understand research papers. Testing reveals that even the strongest models like GPT-5 achieve only 68.2% accuracy on comprehension tasks, dropping significantly when conciseness is factored in, exposing critical gaps in academic document understanding.
🧠 GPT-5
AIBullisharXiv – CS AI · Mar 96/10
🧠A comprehensive survey examines how large multimodal language models are transforming scientific research across five key areas: literature search, idea generation, content creation, multimodal artifact production, and peer review evaluation. The research highlights both the potential for AI-assisted scientific discovery and the ethical concerns regarding research integrity and misuse of generative models.
AIBullisharXiv – CS AI · Mar 36/104
🧠Researchers developed a framework that improves AI-generated research ideas by incorporating relevant data during the ideation process. The approach increased idea feasibility by 20% and overall quality by 7%, with human studies confirming that data-augmented AI assistance helps researchers generate higher-quality ideas.
AIBullishOpenAI News · Jan 276/107
🧠Prism is a new free LaTeX-native workspace that integrates GPT-5.2 to help researchers write, collaborate, and conduct research in a unified platform. The tool aims to streamline academic and research workflows by combining document preparation with AI-powered reasoning capabilities.
AIBullishOpenAI News · May 306/104
🧠OpenAI has launched an affordable AI offering specifically designed for universities to help them integrate artificial intelligence technology into their campus operations responsibly. This education-focused initiative aims to make AI more accessible to academic institutions while ensuring proper governance and implementation.
AINeutralGoogle Research Blog · Sep 305/106
🧠AlphaEvolve represents a new AI system designed to advance theoretical computer science research by serving as a research partner. The system focuses on algorithms and theory, potentially accelerating discoveries in fundamental computer science areas.