#scientific-computing News & Analysis

106 articles tagged with #scientific-computing. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

106 articles

AIBearisharXiv – CS AI · Jun 257/10

🧠

Silent Failures in Physics-Informed Neural Networks: Parameter Poisoning and the Limits of Loss-Based Validation

Researchers demonstrate that Physics-Informed Neural Networks (PINNs) can achieve low training loss while producing wildly inaccurate solutions when underlying PDE parameters are corrupted, revealing a critical gap between loss minimization and physical correctness. The study proposes a post-hoc defense mechanism that sweeps residual loss across parameter values to recover true parameters without retraining, offering a practical solution across multiple PDE systems and network architectures.

AIBullisharXiv – CS AI · Jun 237/10

🧠

LLM-Guided Test-Time Discovery of Quantum-Chemical Approximation Algorithms

Researchers introduce LADeQ, an LLM-guided system that autonomously discovers and implements quantum chemistry approximation algorithms at test-time without pretraining. The approach accelerates coupled cluster and configuration interaction calculations while maintaining user-specified accuracy tolerances, demonstrating how language models can innovate within scientific computing workflows.

AIBullisharXiv – CS AI · Jun 117/10

🧠

SirenFNO: Efficient and Full Frequency Learning of Fourier Neural Operators

Researchers introduce SirenFNO, a neural network framework that improves Fourier Neural Operators by eliminating frequency truncation limitations and enabling full-spectrum learning. The approach achieves 4-15x parameter reduction while maintaining discretization invariance, with functional decomposition variants reaching up to 73x fewer parameters across multiple PDE benchmarks.

AIBullisharXiv – CS AI · Jun 97/10

🧠

SIGA: Self-Evolving Coding-Agent Adapters for Scientific Simulation

Researchers introduce SIGA, an AI adapter system that enables general coding agents to operate specialized scientific simulators without extensive domain training. The system achieves a 36x speedup compared to human experts on GEOS multiphysics simulator configuration, demonstrating that lightweight grounding layers can make general AI tools practical for scientific software.

AIBullisharXiv – CS AI · Jun 87/10

🧠

FP8 is All You Need (Part 1): Debunking Hardware FP64 as the HPC Holy Grail

A research paper challenges the long-held belief that native FP64 (double-precision) hardware is essential for scientific computing, arguing that FP8 tensor operations combined with advanced mathematical schemes can achieve equivalent accuracy at dramatically higher speeds on modern GPUs like NVIDIA's Blackwell B300.

🏢 Nvidia

AIBullisharXiv – CS AI · Jun 27/10

🧠

Towards a Physics Foundation Model

Researchers introduce the General Physics Transformer (GPhyT), a foundation model trained on 1.8 TB of simulation data that can simulate diverse physical systems without domain-specific retraining. The model demonstrates breakthrough capabilities in multi-domain physics prediction, zero-shot generalization to unseen systems, and stable long-horizon forecasting, potentially democratizing access to high-fidelity scientific simulations.

AIBullisharXiv – CS AI · May 297/10

🧠

Battery-Sim-Agent: Leveraging LLM-Agent for Inverse Battery Parameter Estimation

Researchers introduce Battery-Sim-Agent, an LLM-based framework that uses AI agents to estimate battery parameters by mimicking scientific reasoning rather than traditional black-box optimization. The system outperforms conventional methods like Bayesian optimization on benchmark tests and demonstrates practical applicability on real-world battery datasets, representing a novel approach to accelerating battery innovation through physics-informed AI reasoning.

AIBearisharXiv – CS AI · May 297/10

🧠

Physics Is All You Need? A Case Study in Physicist-Supervised AI Development of Scientific Software

A physicist supervised Claude AI models over 12 days to build CLAX-PT, a physics simulation module, documenting how AI agents struggle with architectural redesign and distinguishing symptom-fixes from root-cause solutions. The study reveals that supervision design and human domain expertise, rather than model capability alone, determine whether AI-generated scientific code produces trustworthy results.

🧠 Claude

AIBullisharXiv – CS AI · May 287/10

🧠

Hybrid Neural World Models

Researchers present hybrid neural world models that use machine learning surrogates to accelerate physical dynamics simulations while maintaining accuracy at discontinuities like shocks and contacts. The approach achieves 26-72x speedups over traditional solvers while implicitly learning to identify uncertain regions without explicit training, with an optional fallback mode using classical solvers for high-confidence predictions.

AIBullisharXiv – CS AI · May 277/10

🧠

Recursive Flow Matching

Researchers introduce Recursive Flow Matching (RecFM), a generative AI framework that significantly improves the speed and accuracy of physics simulations by enforcing self-consistency across computational scales. The method achieves high-fidelity predictions in 1-4 steps with up to 20× speedup over existing diffusion models while reducing error by 15%, addressing a critical bottleneck in scientific computing.

AIBullisharXiv – CS AI · May 277/10

🧠

DGLD: Domain-Gated Latent Diffusion for the Discovery of Novel Energetic Materials

Researchers introduce Domain-Gated Latent Diffusion (DGLD), an AI method that discovered 12 novel energetic materials using generative diffusion models with quality-gated training and multi-task guidance. The breakthrough identified two lead compounds with performance metrics rivaling HMX-class materials for the first time in 15 years, validated through DFT simulations and released with open-source code.

AIBullisharXiv – CS AI · May 127/10

🧠

Towards a Virtual Neuroscientist: Autonomous Neuroimaging Analysis via Multi-Agent Collaboration

Researchers introduce NIAgent, a multi-agent AI system that automates end-to-end neuroimaging analysis by enabling specialist agents to collaboratively build and optimize executable programs. The system outperforms conventional static workflows like fMRIPrep by adapting dynamically to data and incorporating hierarchical quality control, addressing a critical bottleneck in clinical biomarker development.

AIBullisharXiv – CS AI · May 117/10

🧠

ATHENA: Agentic Team for Hierarchical Evolutionary Numerical Algorithms

ATHENA is an autonomous AI framework that automates scientific computing and machine learning research by autonomously selecting mathematical approaches, generating code, and iteratively improving solutions through a contextual bandit learning process. The system achieves validation errors as low as 10^-14 and demonstrates performance surpassing traditional foundation models in solving complex multiphysics problems.

AIBullisharXiv – CS AI · May 117/10

🧠

It Just Takes Two: Scaling Amortized Inference to Large Sets

Researchers introduce a novel training strategy for neural posterior estimation that decouples representation learning from posterior modeling, enabling amortized inference on large observation sets by training only on pairs of examples. The approach dramatically reduces computational requirements while maintaining or improving performance across diverse benchmarks, making scalable Bayesian inference practical for real-world applications.

AIBullisharXiv – CS AI · May 17/10

🧠

Heterogeneous Scientific Foundation Model Collaboration

Researchers introduce Eywa, a heterogeneous agentic framework that enables large language models to coordinate and reason across specialized scientific foundation models beyond natural language. The system improves performance on domain-specific tasks by allowing language models to guide inference over non-linguistic data modalities in physical, life, and social sciences.

AIBullisharXiv – CS AI · May 17/10

🧠

Collaborative Agent Reasoning Engineering (CARE): A Three-Party Design Methodology for Systematically Engineering AI Agents with Subject Matter Experts, Developers, and Helper Agents

Researchers introduce CARE, a systematic methodology for engineering LLM-based agents in scientific domains through collaboration between subject-matter experts, developers, and AI helper agents. The approach replaces ad-hoc development with stage-gated phases and reusable artifacts, demonstrating measurable improvements in development efficiency and performance on complex queries.

AIBullisharXiv – CS AI · Apr 157/10

🧠

AutoSurrogate: An LLM-Driven Multi-Agent Framework for Autonomous Construction of Deep Learning Surrogate Models in Subsurface Flow

AutoSurrogate is an LLM-driven framework that automates the construction of deep learning surrogate models for subsurface flow simulation, enabling domain scientists without machine learning expertise to build high-quality models through natural language instructions. The system autonomously handles data profiling, architecture selection, hyperparameter optimization, and quality assessment while managing failure modes, demonstrating superior performance to expert-designed baselines on geological carbon storage tasks.

AIBullisharXiv – CS AI · Mar 57/10

🧠

Mozi: Governed Autonomy for Drug Discovery LLM Agents

Researchers have introduced Mozi, a dual-layer architecture designed to make AI agents more reliable for drug discovery by implementing governance controls and structured workflows. The system addresses critical issues of unconstrained tool use and poor long-term reliability that have limited LLM deployment in pharmaceutical research.

AINeutralarXiv – CS AI · Mar 57/10

🧠

End-to-end event reconstruction for precision physics at future colliders

Researchers developed an end-to-end AI-based event reconstruction system for future particle colliders that uses geometric algebra transformer networks and object condensation clustering. The system outperforms traditional rule-based algorithms by 10-20% in reconstruction efficiency and improves energy resolution by 22%, while reducing fake-particle rates by up to two orders of magnitude.

AIBullisharXiv – CS AI · Mar 46/102

🧠

AI-for-Science Low-code Platform with Bayesian Adversarial Multi-Agent Framework

Researchers have developed a Bayesian adversarial multi-agent framework for AI-driven scientific code generation, featuring three coordinated LLM agents that work together to improve reliability and reduce errors. The Low-code Platform (LCP) enables non-expert users to generate scientific code through natural language prompts, demonstrating superior performance in benchmark tests and Earth Science applications.

AIBullisharXiv – CS AI · Mar 47/102

🧠

Physics-Informed Neural Networks with Architectural Physics Embedding for Large-Scale Wave Field Reconstruction

Researchers developed Physics-Embedded PINNs (PE-PINN) that achieve 10x faster convergence than standard physics-informed neural networks and orders of magnitude memory reduction compared to traditional methods for large-scale wave field reconstruction. The breakthrough enables high-fidelity electromagnetic wave modeling for wireless communications, sensing, and room acoustics applications.

AIBullisharXiv – CS AI · Mar 47/102

🧠

From Complex Dynamics to DynFormer: Rethinking Transformers for PDEs

Researchers have developed DynFormer, a new Transformer-based neural operator that improves partial differential equation (PDE) solving by incorporating physics-informed dynamics. The system achieves up to 95% reduction in relative error compared to existing methods while significantly reducing GPU memory consumption through specialized attention mechanisms for different physical scales.

AIBullishGoogle DeepMind Blog · Feb 127/108

🧠

Gemini 3 Deep Think: Advancing science, research and engineering

Gemini 3 Deep Think represents an updated specialized reasoning mode designed to tackle complex challenges in modern science, research, and engineering. The advancement focuses on enhanced problem-solving capabilities for technical and scientific applications.

AIBullishOpenAI News · Dec 117/108

🧠

Advancing science and math with GPT-5.2

OpenAI has released GPT-5.2, their most advanced model for mathematics and science applications, achieving state-of-the-art performance on benchmarks like GPQA Diamond and FrontierMath. The model demonstrates significant research capabilities, including solving open theoretical problems and generating reliable mathematical proofs.

AIBullishOpenAI News · Jan 307/107

🧠

Strengthening America’s AI leadership with the U.S. National Laboratories

OpenAI is partnering with U.S. National Laboratories to deploy its latest reasoning AI models for scientific research and breakthroughs. This collaboration aims to strengthen America's artificial intelligence leadership by leveraging the nation's premier research institutions.

Page 1 of 5Next →