AI Pulse News

Models, papers, tools. 34,435 articles with AI-powered sentiment analysis and key takeaways.

34435 articles

AINeutralarXiv – CS AI · Jun 56/10

🧠

Operation-Guided Progressive Human-to-AI Text Transformation Benchmark for Multi-Granularity AI-Text Detection

Researchers introduce OpAI-Bench, a comprehensive benchmark for detecting AI-generated text in progressive human-AI co-edited documents across multiple granularities. The study reveals that AI-text detectability follows non-monotonic patterns, with mixed-authorship intermediate versions often harder to detect than purely human or heavily AI-edited documents, challenging assumptions in existing detection methods.

AINeutralarXiv – CS AI · Jun 56/10

🧠

Regret Minimization with Adaptive Opponents in Repeated Games

Researchers introduce Repeated Policy Regret (RP-Regret), a new game-theoretic metric for analyzing regret minimization in repeated games with adaptive opponents who can respond to historical play. The paper proposes three algorithms to minimize RP-Regret despite its non-convex nature and demonstrates that when all players use these algorithms, certain subgame perfect equilibria can be learned, with experiments showing improved cooperation in games like Stag-Hunt.

AINeutralarXiv – CS AI · Jun 56/10

🧠

TempoVLA: Learning Speed-Controllable Vision-Language-Action Policies

TempoVLA introduces a controllable speed mechanism for Vision-Language-Action robot models, enabling flexible execution from fast transit to slow precision work. The approach uses trajectory augmentation during training and conditioning mechanisms during inference, allowing a single model to dynamically adjust operational speed based on task risk levels.

AINeutralarXiv – CS AI · Jun 56/10

🧠

Code2LoRA: Hypernetwork-Generated Adapters for Code Language Models under Software Evolution

Researchers introduce Code2LoRA, a hypernetwork framework that generates repository-specific LoRA adapters for code language models, eliminating the need for expensive fine-tuning or lengthy context injection. The approach achieves competitive performance with lower computational overhead and introduces RepoPeftBench, a 604-repository benchmark for evaluating code model adaptation techniques.

🏢 Hugging Face

AINeutralarXiv – CS AI · Jun 56/10

🧠

IDEAL: Leveraging Infinite and Dynamic Characterizations of Large Language Models for Query-focused Summarization

Researchers propose IDEAL, a novel framework for query-focused summarization that enhances large language models through two key innovations: Query-aware HyperExpert for fine-grained query alignment and Query-focused Infini-attention for processing lengthy documents. The approach demonstrates effectiveness across existing QFS benchmarks and expands LLM accessibility for personalized text summarization.

AINeutralarXiv – CS AI · Jun 56/10

🧠

CoT-Space: A Theoretical Framework for Internal Slow-Thinking via Reinforcement Learning

Researchers introduce CoT-Space, a theoretical framework that explains how Large Language Models improve reasoning through multi-step Chain-of-Thought processes via reinforcement learning. The framework models reasoning as an optimization problem in continuous semantic space, demonstrating that optimal reasoning length emerges naturally from the underfitting-overfitting trade-off, providing a principled foundation for understanding test-time scaling in modern LLMs.

AINeutralarXiv – CS AI · Jun 56/10

🧠

Detecting Perspective Shifts in Multi-agent Systems

Researchers introduce Temporal Data Kernel Perspective Space (TDKPS), a framework for detecting behavioral changes in multi-agent AI systems across time. The method enables monitoring of black-box agent dynamics at both individual and group levels, addressing a critical gap in evaluating evolving generative agent systems.

AINeutralarXiv – CS AI · Jun 56/10

🧠

CangLing-KnowFlow: A Unified Knowledge-and-Flow-fused Agent for Comprehensive Remote Sensing Applications

Researchers introduce CangLing-KnowFlow, an AI agent framework designed to automate complex remote sensing and Earth observation tasks across diverse applications. The system combines a knowledge base of 1,008 expert-validated workflows with dynamic error recovery and continuous learning capabilities, outperforming baseline models by 4% or more on standardized benchmarks.

AINeutralarXiv – CS AI · Jun 56/10

🧠

Towards AI epidemiology: a measurement standardisation framework for prospective risk detection

Researchers propose a measurement standardization framework for detecting risks in deployed AI systems through structured expert-AI interaction analysis, without requiring access to model internals. The framework aims to establish reliable alignment scoring methodologies that could enable institutional monitoring of AI behavior and support epidemiological studies of AI-related outcomes in professional settings.

AINeutralarXiv – CS AI · Jun 56/10

🧠

CaMeLs Can Use Computers Too: System-level Security for Computer Use Agents

Researchers introduce NOVA, a security architecture for Computer Use Agents that prevents prompt injection attacks through upfront branching plans and architectural isolation. The system maintains up to 57% performance parity with frontier models while improving smaller models by 19%, though new vulnerabilities like Branch Steering attacks remain.

AIBullisharXiv – CS AI · Jun 56/10

🧠

Learning Adaptive Parallel Execution for Efficient Code Localization

Researchers introduce FuseSearch, an AI system that optimizes parallel code localization by reducing redundant tool invocations from 34.9% to near-zero through adaptive execution strategies. The approach combines supervised fine-tuning and reinforcement learning to dynamically adjust search breadth, achieving state-of-the-art performance on SWE-bench while using 68.9% fewer tokens and delivering 93.6% speedup.

AINeutralarXiv – CS AI · Jun 56/10

🧠

Synapse: Federated Tool Routing via Typed Compendium Artifacts

Researchers introduce Synapse, a federated learning framework using typed artifacts that enables heterogeneous language models to collaborate without sharing weights or data. The system enables cross-architectural model transfer with minimal performance loss while maintaining formal privacy guarantees and schema-aware merging capabilities.

🧠 GPT-4

AINeutralarXiv – CS AI · Jun 56/10

🧠

DPBench: Structural Determinants of Multi-Agent LLM Coordination Under Simultaneous Resource Contention

DPBench introduces a benchmark for testing multi-agent LLM coordination using the Dining Philosophers problem, revealing that deadlock rates vary dramatically (25%-90%) across models under identical conditions. The research demonstrates that coordination success is primarily determined by protocol design—including communication structure and concurrency primitives—rather than model capability alone.

🧠 GPT-5🧠 Claude🧠 Opus

AIBullisharXiv – CS AI · Jun 56/10

🧠

No Need to Train Your RDB Foundation Model

Researchers present RDBLearn, a foundation model that enables in-context learning over relational databases without requiring model training or fine-tuning. By developing principled compression techniques that preserve semantic relationships within database columns rather than across heterogeneous data types, the approach allows existing single-table foundation models to operate effectively on multi-table database systems.

AINeutralarXiv – CS AI · Jun 56/10

🧠

2-Step Agent: A Framework for the Interaction of a Decision Maker with AI Decision Support

Researchers present the 2-Step Agent framework to model how decision makers learn from ML-based decision support systems. The study reveals that even when ML models are well-specified and agents behave rationally, misaligned prior beliefs can cause ML-DS to produce worse outcomes than no support at all, highlighting critical risks in deploying AI for high-stakes decisions.

$MKR

AINeutralarXiv – CS AI · Jun 56/10

🧠

Semantic Partial Grounding via LLMs

Researchers introduce SPG-LLM, a novel approach that leverages large language models to optimize the grounding process in classical planning by identifying irrelevant objects and actions before computation. The method achieves significantly faster grounding times—often by orders of magnitude—across seven challenging benchmarks while maintaining or improving plan quality.

AIBullisharXiv – CS AI · Jun 56/10

🧠

InfoDensity: Rewarding Information-Dense Traces for Efficient Reasoning

Researchers propose InfoDensity, a reinforcement learning reward framework that optimizes Large Language Models for efficient reasoning by measuring information density rather than just output length. The method tracks entropy trajectories to identify high-quality intermediate reasoning steps, achieving better accuracy-efficiency trade-offs on mathematical and general reasoning benchmarks.

AIBullisharXiv – CS AI · Jun 56/10

🧠

Evaluating the Utility of Personal Health Records in Personalized Health AI

A research study evaluates how large language models like Gemini 3.0 Flash can better answer patient health questions when provided with Personal Health Record (PHR) context. Testing 2,257 patient queries against de-identified PHRs showed significant improvements in helpfulness, safety, and accuracy, though the study identified specific gaps in LLM understanding of complex clinical data like temporal relationships.

🧠 Gemini

AINeutralarXiv – CS AI · Jun 56/10

🧠

Semi-Offline Reinforcement Learning for Optimized Text Generation

Researchers propose semi-offline reinforcement learning, a novel paradigm that bridges online and offline RL approaches to optimize text generation. The method balances exploration costs with training efficiency while providing theoretical frameworks for comparing different RL settings, demonstrating comparable or superior performance to existing state-of-the-art methods.

AIBullisharXiv – CS AI · Jun 56/10

🧠

Binary Gaussian Copula Synthesis: an LLM-powered data augmentation framework for early dialysis prediction in chronic kidney disease

Researchers developed Binary Gaussian Copula Synthesis (BGCS), an LLM-augmented data augmentation method that addresses severe class imbalance in chronic kidney disease datasets to improve early dialysis prediction. Tested on 15,169 CKD patients, BGCS outperformed existing methods like SMOTE and CTGAN, achieving 78-87% minority-class recall and enabling deployment in interpretable clinical decision-support systems.

AINeutralarXiv – CS AI · Jun 56/10

🧠

Separation Power of Equivariant Neural Networks

Researchers characterize the separation power of equivariant neural networks, demonstrating that non-polynomial activations like ReLU and sigmoid achieve equivalent maximum expressivity, while depth and architectural choices significantly influence a model's ability to distinguish inputs. This theoretical analysis provides a framework for comparing model expressivity and understanding the design principles behind convolutional and permutation-invariant networks.

AINeutralarXiv – CS AI · Jun 56/10

🧠

Inverse Entropic Optimal Transport Solves Semi-supervised Learning via Data Likelihood Maximization

Researchers propose EBiEOT, a novel semi-supervised learning framework that leverages both paired and unpaired data through likelihood maximization and inverse entropic optimal transport. The method demonstrates universal approximation properties and provides an end-to-end algorithm for learning conditional distributions, with potential applications in domain translation and other data-scarce scenarios.

AINeutralarXiv – CS AI · Jun 56/10

🧠

Comprehensive and Reliable Feature Attribution for Diverse Modalities and Models via Frequency-Domain Insights

Researchers introduce FreqX, a novel interpretability method for machine learning models that leverages signal processing and information theory to address challenges in personalized federated learning. The approach achieves 10x faster performance than existing methods while providing both attribution and concept information while maintaining privacy.

AINeutralarXiv – CS AI · Jun 56/10

🧠

PC-Talk: Precise Facial Animation Control for Audio-Driven Talking Face Generation

PC-Talk introduces a new framework for audio-driven talking face generation that enables precise control over facial animation through lip-audio alignment and emotion control via implicit keypoint deformations. The technology allows word-level editing of speaking styles, adjustment of lip movement scales, and realistic emotional expression generation with intensity modifications, achieving state-of-the-art results on benchmark datasets.

AINeutralarXiv – CS AI · Jun 56/10

🧠

Reformulating Neural Operators in $d+1$ Dimensions for Embedding Evolution

Researchers introduce a reformulated Neural Operators framework that models embedding evolution in d+1 dimensions, using Fourier-based operators to improve function space mappings. The approach demonstrates superior performance across multiple benchmarks while reducing computational overhead compared to traditional embedding-scaling methods.

← PrevPage 481 of 1378Next →