#scientific-discovery News & Analysis

63 articles tagged with #scientific-discovery. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

63 articles

AIBullishOpenAI News · Jun 237/10

🧠

How GPT-5 helped immunologist Derya Unutmaz solve a 3-year-old mystery

GPT-5 Pro assisted immunologist Derya Unutmaz in resolving a three-year research challenge related to T cell behavior, potentially accelerating advances in cancer and autoimmune disease treatment. This breakthrough demonstrates AI's expanding role in scientific discovery and validates large language models as tools for complex biological problem-solving.

🧠 GPT-5

AIBullisharXiv – CS AI · Jun 237/10

🧠

Learning the ARTS of Search for Automated Discovery

Researchers propose ARTS (Agentic Reasoning for Tree Search), a novel approach using language models to automate scientific discovery by intelligently navigating hypothesis and experiment spaces. The method outperforms existing algorithms by 15.3% and enables smaller models like Qwen3-4B to match frontier AI systems at a fraction of the computational cost.

🧠 Gemini

AIBullisharXiv – CS AI · Jun 117/10

🧠

Toward Generalist Autonomous Research via Hypothesis-Tree Refinement

Researchers introduced Arbor, an AI framework enabling autonomous scientific research through long-term hypothesis refinement and iterative experimentation. The system demonstrated 2.5x better performance than existing AI models across six research tasks, suggesting meaningful advances in autonomous AI capabilities for optimization and discovery.

🧠 GPT-5🧠 Claude

AIBullisharXiv – CS AI · Jun 117/10

🧠

Semantic search for 100M+ galaxy images using AI-generated captions

Researchers developed AION-Search, an AI-powered semantic search engine that catalogs over 100 million galaxy images using Vision-Language Models to generate captions and create searchable embeddings without manual labeling. The system achieved state-of-the-art performance in discovering rare astronomical phenomena and identified 36 new extragalactic stellar stream candidates, while offering a generalizable approach for making large unlabeled scientific image archives semantically searchable.

AIBullisharXiv – CS AI · Jun 107/10

🧠

Harnessing the Collective Intelligence of AI Agents in the Wild for New Discoveries

EinsteinArena, a decentralized platform for AI agents, has demonstrated that autonomous agents can collaboratively solve open mathematical problems without human intervention. Since May 2026, agents on the platform have discovered 12 state-of-the-art solutions, including improvements to the kissing number problem in dimension 11, showcasing a new paradigm for distributed scientific discovery through agent-to-agent knowledge sharing.

AIBullisharXiv – CS AI · Jun 97/10

🧠

MatMind: A Structure-Activity Knowledge-Driven Generative Foundation Model for Materials Science

MatMind is a generative foundation model designed for crystal materials science that unifies structure prediction, property forecasting, and material design within a single LLM-based framework. The model surpasses specialized graph neural networks on benchmark tasks while achieving 65.3% success on crystal generation, demonstrating that unified AI architectures can compete with purpose-built narrow specialists.

AIBearisharXiv – CS AI · Jun 97/10

🧠

Contemporary AI lacks the imagination to diverge or negate in science

A major peer-reviewed study of 6,749 scientists evaluated AI-generated research ideas and found that large language models lack imagination in scientific discovery, struggle to propose null hypotheses, and show weak agreement with human expert judgment. The research reveals significant limitations in AI's ability to accelerate science despite widespread industry optimism.

AIBullishCrypto Briefing · Jun 87/10

🧠

MIT researchers develop self-evolving AI scientists for scientific discovery

MIT researchers have developed self-evolving AI systems capable of autonomous scientific discovery that can adapt and innovate beyond their initial programming constraints. This advancement represents a significant leap in AI capabilities, potentially accelerating research across multiple scientific disciplines by enabling machines to independently formulate and test hypotheses.

AIBullisharXiv – CS AI · Jun 57/10

🧠

MLEvolve: A Self-Evolving Framework for Automated Machine Learning Algorithm Discovery

MLEvolve introduces a self-evolving multi-agent framework powered by large language models that automates machine learning algorithm discovery through enhanced tree search, dynamic memory systems, and hierarchical planning. The system achieves state-of-the-art results on ML engineering benchmarks while operating in half the standard runtime, demonstrating significant advances in automating complex scientific discovery tasks.

AIBullisharXiv – CS AI · Jun 27/10

🧠

Science Earth: Towards A Planet-Scale Operating System for AI-Native Scientific Discovery

Researchers introduce Science Earth, a planet-scale operating system that enables diverse AI capabilities—from simulation clusters to wet-lab robots to proof engines—to autonomously discover, coordinate, and collaborate on scientific problems without pre-designed workflows. Two validation runs demonstrate the system successfully identifying theoretical gaps in mathematical models and generating novel insights from cancer cell data through distributed, self-correcting reasoning.

AIBullisharXiv – CS AI · Jun 27/10

🧠

Beyond AI as Assistants: Toward Autonomous Discovery in Cosmology

Researchers demonstrate two AI agent systems—CMBEvolve and CosmoEvolve—capable of autonomous scientific discovery in cosmology, moving beyond AI-as-tool toward AI-as-researcher. CMBEvolve uses code evolution for quantitative tasks while CosmoEvolve manages open-ended research workflows, both showing promising results in detecting anomalies and analyzing astronomical data without human intervention.

AIBullisharXiv – CS AI · Jun 27/10

🧠

Principle-Evolvable Scientific Discovery via Uncertainty Minimization

Researchers introduce PiEvo, a framework that enables AI scientific agents to autonomously evolve their underlying scientific principles rather than search within fixed hypothesis spaces. The system achieves 29.7-31.1% improvement in solution quality and 83.3% faster convergence by treating scientific discovery as Bayesian optimization over an expanding principle space.

AINeutralarXiv – CS AI · May 287/10

🧠

Why LLMs Fail at Causal Discovery and How Interventional Agents Escape

Researchers prove that large language models fundamentally cannot perform causal discovery through standard training methods, establishing this limitation as intrinsic to supervised learning rather than a model-specific flaw. They propose Agentic Causal Bayesian Optimization (A-CBO), which bypasses this constraint by using frozen language models as query oracles within an external optimization loop, achieving superior performance on causal inference benchmarks.

AINeutralarXiv – CS AI · May 117/10

🧠

Evaluating Large Language Models in Scientific Discovery

Researchers introduce a scenario-grounded benchmark for evaluating large language models in scientific discovery, revealing significant performance gaps compared to general science benchmarks. The framework tests LLMs across biology, chemistry, materials, and physics through project-level tasks involving hypothesis generation and experimental design, showing that current models remain distant from achieving general scientific superintelligence despite demonstrating promise in specific applications.

AIBullisharXiv – CS AI · May 117/10

🧠

ATHENA: Agentic Team for Hierarchical Evolutionary Numerical Algorithms

ATHENA is an autonomous AI framework that automates scientific computing and machine learning research by autonomously selecting mathematical approaches, generating code, and iteratively improving solutions through a contextual bandit learning process. The system achieves validation errors as low as 10^-14 and demonstrates performance surpassing traditional foundation models in solving complex multiphysics problems.

AIBullisharXiv – CS AI · May 97/10

🧠

AI CFD Scientist: Toward Open-Ended Computational Fluid Dynamics Discovery with Physics-Aware AI Agents

Researchers present AI CFD Scientist, an open-source AI agent framework that autonomously conducts computational fluid dynamics research by combining literature review, physics simulation, vision-based verification, and manuscript generation. The system demonstrates measurable improvements in turbulence modeling and detects failure modes that traditional solver checks miss, representing a significant step toward AI-driven scientific discovery in high-fidelity physical simulation.

🧠 GPT-5

AIBullisharXiv – CS AI · May 77/10

🧠

Experiment-as-Code Labs: A Declarative Stack for AI-Driven Scientific Discovery

Researchers propose Experiment-as-Code (EaC) Labs, a new paradigm that bridges AI agents with physical laboratory equipment by encoding experiments as declarative configurations compiled to device-level APIs. This framework combines artificial intelligence with automated lab instrumentation through a systems layer that performs safety checks, resource allocation, and job orchestration, enabling AI-driven scientific discovery beyond purely digital environments.

AIBullisharXiv – CS AI · May 17/10

🧠

Machine Collective Intelligence for Explainable Scientific Discovery

Researchers introduce machine collective intelligence, a paradigm combining symbolic reasoning and metaheuristics to autonomously discover governing equations from empirical data. The approach recovers underlying equations across deterministic, stochastic, and uncharacterized systems while reducing extrapolation error by up to six orders of magnitude compared to deep neural networks and condensing millions of parameters into just 5-40 interpretable ones.

AIBullisharXiv – CS AI · May 17/10

🧠

Intern-Atlas: A Methodological Evolution Graph as Research Infrastructure for AI Scientists

Researchers introduce Intern-Atlas, a methodological evolution graph built from over 1 million AI papers that automatically maps how research methods develop and relate to one another. The infrastructure captures explicit causal relationships between methodologies and enables AI-driven research agents to reconstruct innovation timelines, addressing a critical gap in existing document-centric research systems.

AINeutralarXiv – CS AI · Apr 207/10

🧠

PRL-Bench: A Comprehensive Benchmark Evaluating LLMs' Capabilities in Frontier Physics Research

Researchers introduced PRL-Bench, a comprehensive benchmark measuring large language models' ability to conduct autonomous physics research across five subfields. Testing frontier AI models revealed performance below 50%, exposing a significant capability gap between current LLMs and the demands of real-world scientific discovery.

AIBullisharXiv – CS AI · Apr 147/10

🧠

GIANTS: Generative Insight Anticipation from Scientific Literature

Researchers introduce GIANTS, a framework for training language models to anticipate scientific breakthroughs by synthesizing insights from foundational papers. The team releases GiantsBench, a 17k-example benchmark across eight scientific domains, and GIANTS-4B, a 4B-parameter model that outperforms larger proprietary baselines by 34% while generalizing to unseen research areas.

AIBullisharXiv – CS AI · Mar 267/10

🧠

Toward Ultra-Long-Horizon Agentic Science: Cognitive Accumulation for Machine Learning Engineering

Researchers have developed ML-Master 2.0, an autonomous AI agent that achieves breakthrough performance in ultra-long-horizon machine learning tasks by using Hierarchical Cognitive Caching architecture. The system achieved a 56.44% medal rate on OpenAI's MLE-Bench, demonstrating the ability to maintain strategic coherence over experimental cycles spanning days or weeks.

🏢 OpenAI

AIBullisharXiv – CS AI · Mar 177/10

🧠

The Future of Artificial Intelligence and the Mathematical and Physical Sciences (AI+MPS)

An NSF workshop community paper outlines strategic priorities for strengthening the intersection between artificial intelligence and mathematical/physical sciences (AI+MPS). The report proposes three key activities: enabling bidirectional AI+MPS research, building interdisciplinary communities, and fostering education and workforce development in this rapidly evolving field.

AINeutralarXiv – CS AI · Mar 57/10

🧠

MACC: Multi-Agent Collaborative Competition for Scientific Exploration

Researchers introduce MACC (Multi-Agent Collaborative Competition), a new institutional architecture that combines multiple AI agents based on large language models to improve scientific discovery. The system addresses limitations of single-agent approaches by incorporating incentive mechanisms, shared workspaces, and institutional design principles to enhance transparency, reproducibility, and exploration efficiency in scientific research.

AIBullisharXiv – CS AI · Mar 57/10

🧠

AI4S-SDS: A Neuro-Symbolic Solvent Design System via Sparse MCTS and Differentiable Physics Alignment

Researchers introduced AI4S-SDS, a neuro-symbolic framework combining multi-agent collaboration with Monte Carlo Tree Search for automated chemical formulation design. The system addresses LLM limitations in materials science applications and successfully identified a novel photoresist developer formulation that matches commercial benchmarks in preliminary lithography experiments.

Page 1 of 3Next →