#academic-ai News & Analysis

14 articles tagged with #academic-ai. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

14 articles

AIBullisharXiv – CS AI · Jun 97/10

🧠

Advancing Mathematics Research with AI-Driven Formal Proof Search

Researchers demonstrated that AI-driven formal proof systems can autonomously solve open mathematics problems, resolving 9 Erdős problems and 44 OEIS conjectures at modest computational cost. This breakthrough validates LLMs as practical research tools when combined with formal verification systems like Lean, marking the first large-scale evaluation of this approach on genuinely open problems.

AIBullisharXiv – CS AI · May 277/10

🧠

E3: Issue-Level Backtesting for Automated Research Critique

Researchers introduce E3, an automated review assistant that identifies technical concerns in research papers with 90.2% recall—outperforming human reviewers and leading AI models. The system detects unsupported claims, missing ablations, weak baselines, and validity threats, with evaluation conducted on 100 ICLR 2026 papers using a contamination-resistant backtesting protocol.

🏢 OpenAI🏢 Anthropic🧠 GPT-5

AINeutralarXiv – CS AI · Feb 277/107

🧠

Vibe Researching as Wolf Coming: Can AI Agents with Skills Replace or Augment Social Scientists?

A research paper introduces the concept of 'vibe researching' where AI agents can autonomously execute entire research pipelines from idea to submission using specialized skills. The study analyzes how AI agents excel at speed and methodological tasks but struggle with theoretical originality and tacit knowledge, creating a cognitive rather than sequential delegation boundary in research workflows.

AIBullishCrypto Briefing · Jun 106/10

🧠

Stanford University deploys Marlowe DGX SuperPOD with 248 Nvidia GPUs for research access

Stanford University has deployed a Marlowe DGX SuperPOD equipped with 248 Nvidia GPUs to support research initiatives, enhancing the institution's computational capabilities and reducing dependence on cloud infrastructure. The deployment signals a broader trend of academic institutions investing in on-premises AI infrastructure to maintain research independence and efficiency.

🏢 Nvidia

AINeutralarXiv – CS AI · Jun 16/10

🧠

AutoSci: A Memory-Centric Agentic System for the Full Scientific Research Lifecycle

Researchers introduce AutoSci, an AI-driven system designed to automate the full scientific research lifecycle by managing literature review, experiments, manuscript writing, and peer review responses. The system uses a memory-centric architecture with four specialized modules to maintain structured knowledge, execute research workflows, and continuously improve its procedures through feedback.

AINeutralarXiv – CS AI · May 286/10

🧠

CiteCheck: Retrieval-Grounded Detection of LLM Citation Hallucinations in Scientific Text

Researchers introduce CiteCheck, a hybrid framework that detects when large language models fabricate or corrupt scientific citations by combining scholarly database retrieval with structured LLM verification. The system achieves 88.7% macro-F1 on a new 982-citation physics benchmark, outperforming GPT, Claude, and Gemini, addressing a critical reliability problem as LLMs become integrated into scientific research workflows.

🧠 Claude🧠 Gemini

AIBullishGoogle Research Blog · May 196/10

🧠

Empirical Research Assistance (ERA): From Nature publication to catalyzing Computational Discovery

Empirical Research Assistance (ERA) represents a significant advancement in AI-assisted scientific research, transitioning from academic publication to practical computational discovery tools. The development demonstrates how machine learning can accelerate the research process across scientific disciplines, with implications for both the academic and technology sectors.

AINeutralarXiv – CS AI · May 116/10

🧠

CoCoReviewBench: A Completeness- and Correctness-Oriented Benchmark for AI Reviewers

Researchers introduce CoCoReviewBench, a new benchmark dataset of 3,900 papers from ICLR and NeurIPS designed to reliably evaluate AI review systems. The benchmark addresses critical gaps in current evaluation methods by prioritizing correctness over mere overlap with human reviews, revealing that existing AI reviewers struggle with hallucinations and reasoning accuracy.

AINeutralarXiv – CS AI · May 16/10

🧠

RPC-Bench: A Fine-grained Benchmark for Research Paper Comprehension

Researchers introduce RPC-Bench, a large-scale benchmark containing 15,000 human-verified question-answer pairs designed to evaluate how well AI models understand research papers. Testing reveals that even the strongest models like GPT-5 achieve only 68.2% accuracy on comprehension tasks, dropping significantly when conciseness is factored in, exposing critical gaps in academic document understanding.

🧠 GPT-5

AIBullisharXiv – CS AI · Mar 96/10

🧠

Transforming Science with Large Language Models: A Survey on AI-assisted Scientific Discovery, Experimentation, Content Generation, and Evaluation

A comprehensive survey examines how large multimodal language models are transforming scientific research across five key areas: literature search, idea generation, content creation, multimodal artifact production, and peer review evaluation. The research highlights both the potential for AI-assisted scientific discovery and the ethical concerns regarding research integrity and misuse of generative models.

AIBullisharXiv – CS AI · Mar 36/104

🧠

Augmenting Research Ideation with Data: An Empirical Investigation in Social Science

Researchers developed a framework that improves AI-generated research ideas by incorporating relevant data during the ideation process. The approach increased idea feasibility by 20% and overall quality by 7%, with human studies confirming that data-augmented AI assistance helps researchers generate higher-quality ideas.

AIBullishOpenAI News · Jan 276/107

🧠

Introducing Prism

Prism is a new free LaTeX-native workspace that integrates GPT-5.2 to help researchers write, collaborate, and conduct research in a unified platform. The tool aims to streamline academic and research workflows by combining document preparation with AI-powered reasoning capabilities.

AIBullishOpenAI News · May 306/104

🧠

OpenAI for Education

OpenAI has launched an affordable AI offering specifically designed for universities to help them integrate artificial intelligence technology into their campus operations responsibly. This education-focused initiative aims to make AI more accessible to academic institutions while ensuring proper governance and implementation.

AINeutralGoogle Research Blog · Sep 305/106

🧠

AI as a research partner: Advancing theoretical computer science with AlphaEvolve

AlphaEvolve represents a new AI system designed to advance theoretical computer science research by serving as a research partner. The system focuses on algorithms and theory, potentially accelerating discoveries in fundamental computer science areas.