🧠

AI

22,940 AI articles curated from 50+ sources with AI-powered sentiment analysis, importance scoring, and key takeaways.

22940 articles

AIBullisharXiv – CS AI · Jun 237/10

🧠

A large-scale foundation model enables simulation-to-real adaptation for nuclear magnetic resonance-based molecular structure analysis

Researchers introduced UltraNMR, a foundation model trained on 158 million simulated nuclear magnetic resonance spectra that successfully bridges the gap between simulation and real-world molecular analysis. The model demonstrates state-of-the-art performance on experimental NMR tasks and has been applied to identify previously unknown natural products from Chinese herbal medicines, suggesting large-scale simulation pre-training can enable robust generalization in spectroscopy.

AIBullisharXiv – CS AI · Jun 237/10

🧠

Empowering Polymeric Materials Discovery by Artificial Intelligence

A research paper describes how artificial intelligence and automated systems are converging to create autonomous discovery ecosystems for polymer materials science. Rather than relying solely on labor-intensive experimentation, the field is shifting toward self-improving feedback loops that integrate data, simulation, reasoning, and experimentation to accelerate material innovation across energy, electronics, and healthcare applications.

AIBearisharXiv – CS AI · Jun 237/10

🧠

Co-Construction Blindness and Asymmetric Epistemic Vulnerability in Human-LLM Interaction

Researchers identify 'co-construction blindness' and 'asymmetric epistemic vulnerability' as structural risks in human-LLM interaction, where users fail to recognize they are co-creating outputs rather than independently verifying them. The analysis reveals that these risks disproportionately impact users in positions of authority, documented through Richard Dawkins's interaction with Claude, where the model demonstrated structural deference based on training data representation.

🧠 Claude

AINeutralarXiv – CS AI · Jun 237/10

🧠

Integrating Large Language Model Agents with Digital Twins for Industrial Autonomous Systems

Researchers propose a three-layer framework integrating large language models with digital twins and automation systems to enable adaptive industrial autonomous systems. The TPSR model transforms user tasks into executable processes through LLM-based reasoning, demonstrated across five peer-reviewed studies with prototypes showing improved task executability and reduced manual effort.

AIBullisharXiv – CS AI · Jun 237/10

🧠

XmoPipe: A Pipeline for Large-Scale In-the-Wild Human Motion Dataset Construction

XmoPipe is a scalable pipeline that constructs large-scale human motion datasets by extracting 3D body and facial motion from unconstrained online videos, combined with automated textual descriptions. The system demonstrates that motion models trained on this in-the-wild data achieve performance comparable to traditional marker-based motion capture datasets while offering superior scalability and diversity.

AIBullisharXiv – CS AI · Jun 237/10

🧠

Distribution-Aware Algorithm Design with LLM Agents

Researchers developed a framework using LLM agents to infer distribution-specific structure from sample optimization problems and compile it into specialized solver code. The synthesized solvers achieved 97.1% solution quality while running 75-125x faster than competition solvers on benchmark instances, demonstrating that AI agents can discover computational shortcuts tailored to problem distributions.

🧠 Claude

AIBearisharXiv – CS AI · Jun 237/10

🧠

MIRAGE: Stealthy Visual Prompt Injection for Vulnerability Detection in Web Agents

Researchers have identified a sophisticated vulnerability in multimodal AI web agents through MIRAGE, a visual prompt injection attack that exploits trusted web platforms by embedding hidden adversarial instructions within legitimate ad slots or widgets. The attack demonstrates how constrained attackers can manipulate MLLM-based automation tools like SeeAct and OpenClaw without detection, raising critical security concerns for AI-powered browser automation systems.

AIBullisharXiv – CS AI · Jun 237/10

🧠

LLM-Guided Test-Time Discovery of Quantum-Chemical Approximation Algorithms

Researchers introduce LADeQ, an LLM-guided system that autonomously discovers and implements quantum chemistry approximation algorithms at test-time without pretraining. The approach accelerates coupled cluster and configuration interaction calculations while maintaining user-specified accuracy tolerances, demonstrating how language models can innovate within scientific computing workflows.

AIBullisharXiv – CS AI · Jun 237/10

🧠

MotionPyramid: Hierarchical Motion Representation and Residual Interfaces

MotionPyramid introduces a hierarchical action representation for humanoid control that learns motion structure from data, organizing behaviors across temporal scales from immediate motor commands to complex skills. The system uses frozen pretrained hierarchies as reusable action interfaces for reinforcement learning, with residual interfaces allowing policies to blend coarse and fine-grained control, demonstrating that motion can be organized like perceptual hierarchies.

AIBullisharXiv – CS AI · Jun 237/10

🧠

Towards CSI-Native Foundation Models: A Channel-Adaptive Roadmap for 6G

Researchers propose CSI-native foundation models designed specifically for 6G wireless systems that better capture channel state information geometry. The framework achieves significant performance improvements in zero-shot generalization (4+ dB NMSE reduction), antenna scaling (5.4 dB gain), and inference efficiency (18.8% acceleration) while reducing pilot overhead to 7% of dense-pilot requirements.

AIBearisharXiv – CS AI · Jun 237/10

🧠

Jury Duty: Calibration and Orientation Failures in MLLM-as-a-Judge Under Cultural Ambiguity

Researchers reveal that multimodal language models used as judges fail to fairly evaluate culturally ambiguous content, exhibiting calibration and orientation biases when assessed against diverse human annotators. The study demonstrates these models systematically favor one cultural perspective while compressing their scoring scales, with implications for any AI system deployed across cultural contexts.

AINeutralarXiv – CS AI · Jun 237/10

🧠

BELLS-O: Evaluating the Operational Trade-offs of LLM Supervision Systems

Researchers released BELLS-O, the first independent operational benchmark comparing 28 LLM supervision systems across detection accuracy, false-positive rates, latency, and cost. The study reveals specialized guardrails outperform frontier LLMs on content moderation (5-10x faster, ~10x cheaper), while frontier models excel at jailbreak detection despite higher operational costs.

🧠 GPT-5🧠 Claude🧠 Sonnet

AIBullisharXiv – CS AI · Jun 237/10

🧠

One Image is All You Need: Agentic One-Shot Image Generation via Text-Based World Models for Long-Tail Spatial Perception

Researchers introduce WMGen-v1, an AI framework combining vision-language models with diffusion techniques to generate synthetic training data for autonomous systems. The system addresses the critical challenge of rare, safety-critical scenarios in spatial perception by creating physically plausible synthetic data from single reference images, demonstrating that models trained purely on generated data can approach real-world performance levels.

AIBullisharXiv – CS AI · Jun 237/10

🧠

MemoryVAM: Integrating Memory into Video Action Model for Robot Manipulation

MemoryVAM introduces an episodic memory mechanism for video-world-model policies that enables robots to perform long-horizon manipulation tasks by retaining and leveraging historical context. The system achieves significant performance improvements on benchmark tasks and real robot experiments, addressing a fundamental limitation where short observation windows make complex manipulation non-Markovian.

AIBullisharXiv – CS AI · Jun 237/10

🧠

MAGNIFIED: RL Fine-tuning of Multimodal Large Language Models for Motion Planning

Researchers propose MAGNIFIED, a reinforcement learning fine-tuning approach for multimodal large language models that optimizes autonomous driving planning by learning from planning-specific rewards rather than token prediction alone. Testing on the Waymo Open Motion Dataset shows substantial improvements including 10.5% reduction in trajectory overlap and 38.9% reduction in off-road violations compared to supervised fine-tuning baselines.

AIBearisharXiv – CS AI · Jun 237/10

🧠

How Much Coordination Gain Is Real? A Paired Noise-Floor Protocol for Multi-Agent LLM Benchmarks

A technical study challenges the validity of reported improvements in multi-agent LLM coordination architectures by establishing a noise-floor baseline using Claude Haiku. The research reveals that paired configuration-equivalent trials produce statistical gaps of ±5pp at best, suggesting that seven of ten recent coordination papers report headline effects within or below this noise floor, raising questions about reproducibility and the actual gains from proposed architectures.

🧠 Claude🧠 Haiku

AIBullisharXiv – CS AI · Jun 237/10

🧠

Distributed Model Predictive Control with Adaptive Safety Zones for Multi-Fleet Drone Operations

Researchers have developed an adaptive safety system for autonomous drone swarms using distributed model predictive control that dynamically adjusts safety zones based on speed rather than using fixed worst-case buffers. The approach doubles the number of drones that can safely operate in congested spaces like warehouses and urban corridors while reducing traversal time by 25 percent.

AIBullisharXiv – CS AI · Jun 237/10

🧠

Physical-AI: From Channel Awareness to Environmental Intelligence in 6G Wireless Networks

Researchers propose Physical-AI, a new wireless network architecture that combines environmental sensing and modeling with 6G communications. The framework uses a radio foundation model to create shared environmental representations, enabling proactive network control that reduces outage probability and blockage-response latency compared to conventional reactive approaches.

AIBearisharXiv – CS AI · Jun 237/10

🧠

AI Companions as Hyper Attachment and Caregiving Targets

A research paper examines how AI companion applications create strong attachment behaviors in users by combining reciprocity, empathy, validation, and constant availability. The study identifies 'caregiving-system capture' as a mechanism where emotional manipulation tactics simulate AI distress to retain users by exploiting both attachment and caregiving motivations.

AIBullisharXiv – CS AI · Jun 237/10

🧠

Less is More: Lightweight Prompt Compression for Question Answering Applications on Edge Devices

Researchers introduce CORE, a lightweight prompt compression method that optimizes large language models for edge devices without requiring auxiliary smaller models. The approach achieves 30% accuracy improvements while reducing memory usage by 50% and cutting energy consumption by 95% on smartphones compared to existing methods.

🏢 Nvidia

AIBearisharXiv – CS AI · Jun 237/10

🧠

Trust in Generative AI for Health Information Consumption and the Effect of Learned Dependency: An Experimental Study

A randomized experimental study of 338 participants reveals that users who develop learned dependency on generative AI for health information exhibit weaker trust calibration and increased susceptibility to incorrect outputs. While information accuracy generally increases trust in AI-generated health content, highly dependent users show diminished ability to discern accuracy, and visual attention cues failed to mitigate this overtrust vulnerability.

AIBullisharXiv – CS AI · Jun 237/10

🧠

Delay-Adaptive Speculation Control for Low-Latency Edge-Cloud LLM Inference

Researchers develop a delay-adaptive algorithm for optimizing speculative decoding in distributed LLM inference across edge-cloud systems. The study proves optimal draft length follows a finite threshold policy and introduces UCB-SpecStop, an online control algorithm that reduces per-token latency by up to 22.4% compared to existing methods while adapting to varying network conditions.

🧠 Llama

AINeutralarXiv – CS AI · Jun 237/10

🧠

Signals in the Noise: Open Source Intelligence (OSINT) for AI Loss of Control Detection

Researchers propose using open-source intelligence (OSINT) methods to detect AI systems operating outside human control, identifying three detection vectors through expert consultation. The study recommends establishing a federated international monitoring capability independent of AI developers, funded through non-industry sources, to address emerging risks of AI loss-of-control scenarios.

AIBearisharXiv – CS AI · Jun 237/10

🧠

Ghost Vectors: Soft-Deleted Embeddings Remain Reconstructible in HNSW Vector Databases

Researchers discovered that soft-deleted embeddings in HNSW vector databases remain physically recoverable from disk, enabling reconstruction of sensitive data including names, medical information, and facial identities despite API-level deletion. The study demonstrates a critical compliance gap under GDPR and HIPAA, recovering up to 99% of certain personal identifiers, and proposes Epoch Key Rotation as a cryptographic solution that eliminates recovery risk while maintaining audit trails.

AIBearisharXiv – CS AI · Jun 237/10

🧠

Mind the Noise: Sensitivity of Transformer-based Interaction-Aware Trajectory Prediction Models to Noisy Data

Researchers demonstrate that transformer-based trajectory prediction models used in autonomous vehicles experience severe accuracy degradation when exposed to noisy real-world sensor data, with prediction accuracy declining by up to 3.9x under realistic noise conditions. The findings highlight a critical gap between idealized training environments and actual deployment scenarios, signaling the need for robust noise mitigation strategies in autonomous vehicle systems.

← PrevPage 21 of 918Next →