AI Pulse News

Models, papers, tools. 34,413 articles with AI-powered sentiment analysis and key takeaways.

34413 articles

AINeutralarXiv – CS AI · Jun 55/10

🧠

EEGDancer: Dynamic Emotion Latent Space Masked Modeling with Reinforcement Learning for EEG Continuous Emotion Prediction

Researchers propose EEGDancer, a machine learning framework that combines vector-quantized representation learning, masked temporal modeling, and reinforcement learning to predict continuous emotional states from EEG brain signals. The approach outperforms existing methods on standard emotion prediction datasets by modeling long-range temporal dependencies rather than treating emotion prediction as frame-by-frame regression.

AINeutralarXiv – CS AI · Jun 56/10

🧠

Deciphering Two Training Clocks in Grokking via Deep Linear Network Theory with Conditional ReLU Reduction

Researchers formalize the grokking phenomenon—where neural networks fit training data quickly but learn generalizable rules slowly—by analyzing deep linear networks and ReLU MLPs. The study identifies two distinct training timescales: fast classification loss decay and slower representation simplification, with implications for understanding how neural networks generalize.

AINeutralarXiv – CS AI · Jun 55/10

🧠

Compositional Boundaries for Density Fusion

This theoretical computer science paper addresses the mathematical foundations of distributed uncertainty management by establishing compositional boundaries for probabilistic density fusion. The research determines when local fusion rules can be executed hierarchically while maintaining order-invariance, a critical requirement for distributed systems where intermediate nodes combine data regardless of sequence.

AINeutralarXiv – CS AI · Jun 56/10

🧠

Staying with the Uncertainty: Uncertainty-Scaffolding Strategies for Artificial Moral Advisors in LLM-to-LLM Simulated Conversations

Researchers study how Large Language Models deployed as Artificial Moral Advisors should communicate with users discussing ethical dilemmas, proposing three uncertainty-focused conversation strategies and finding that different approaches sustain distinct quality levels of engagement rather than producing uniform belief revision.

AINeutralarXiv – CS AI · Jun 56/10

🧠

Better Literary Translation: A Multi-Aspect Data Generation and LLM Training Approach

Researchers have developed a multi-aspect iterative framework for improving literary translation using specialized LLMs and reinforcement learning. Their resulting models achieve competitive performance with Claude Sonnet 4.5 on English-to-Chinese literary translation benchmarks while demonstrating strong generalization to out-of-domain works.

🧠 Claude🧠 Sonnet

AINeutralarXiv – CS AI · Jun 55/10

🧠

To Be Multimodal or Not to Be: Query-Adaptive Audio-Visual Person Retrieval via Active Modality Detection

Researchers propose a query-adaptive audio-visual person retrieval system that intelligently detects which modalities (voice or face) are actually present in broadcast video archives, avoiding noise from absent modalities. By analyzing cross-modal score consistency, the system achieves 94.2% precision on BBC Rewind's 12,000+ videos, significantly outperforming both unimodal and fixed fusion approaches.

AINeutralarXiv – CS AI · Jun 56/10

🧠

Learning of Robot Safety Policies via Adversarial Synthetic Scenarios

Researchers propose an adversarial framework for developing safer robot systems by simulating hazardous scenarios through competing AI agents—one creating dangerous situations and another refining safety policies to prevent them. This approach aims to efficiently identify edge cases and high-risk failures that traditional random testing misses, advancing safety standards for physical AI systems in real-world environments.

AINeutralarXiv – CS AI · Jun 56/10

🧠

Causal Scaffolding for Physical Reasoning: A Benchmark for Causally-Informed Physical World Understanding in VLMs

Researchers introduce CausalPhys, a benchmark with over 3,000 curated video and image questions designed to evaluate how well vision-language models understand causal physical reasoning. The work includes expert-annotated causal graphs and proposes Causal Rationale-informed Fine-Tuning (CRFT) to improve VLM performance on physical world reasoning tasks.

AINeutralarXiv – CS AI · Jun 56/10

🧠

Measuring the sensitivity of LLM-based structured extraction to prompt, model, and schema choices in clinical discharge summaries

Researchers evaluated how large language models performing structured data extraction from clinical notes respond to variations in prompts, model sizes, and data schemas. The study found that schema design—particularly the distinction between absent versus undocumented information—drives disagreement more than prompt phrasing, while model choice significantly impacts multi-class categorization tasks.

AINeutralarXiv – CS AI · Jun 56/10

🧠

Deep Learning-based 3D Oral Cavity Reconstruction Using 2D Intraoral Images

Researchers propose a deep learning method that reconstructs 3D oral cavity models from just ten 2D intraoral images, eliminating the need for expensive scanning equipment or uncomfortable impression-taking procedures. Achieving 77.49% accuracy using MobileNetV2 and multi-head attention mechanisms, the approach offers a cost-effective alternative for dental modeling, though it currently exhibits uneven point distribution in reconstructed models.

AINeutralarXiv – CS AI · Jun 56/10

🧠

ATT-CR: Adaptive Triangular Transformer for Cloud Removal

Researchers introduce ATT-CR, a Transformer-based model that improves cloud removal in remote sensing images by reducing computational complexity and filtering cloudy pixel interference. The innovation combines Triangular Attention with lower computational costs (O(N)) and a Feature Selected Gating Module to distinguish between valid and invalid features, addressing scalability limitations in existing Transformer approaches.

AINeutralarXiv – CS AI · Jun 56/10

🧠

EGTR-Review: Efficient Evidence-Grounded Scientific Peer Review Generation via Multi-Agent Teacher Distillation

EGTR-Review presents a novel framework for automating scientific peer review using a multi-agent teacher model that distills its reasoning into a lightweight student model, achieving superior performance with significantly lower computational costs while maintaining evidence traceability and factual grounding.

AINeutralarXiv – CS AI · Jun 56/10

🧠

When Good Enough Is Optimal: Multiplication-Only Matrix Inversion Approximation for Quantized Gated DeltaNet

Researchers propose a fast matrix multiplication-based algorithm for matrix inversion in linear attention mechanisms, achieving up to 5x speedup on neural processing units while maintaining model accuracy under both standard and low-precision inference. The method addresses a critical computational bottleneck in long-context language modeling by using truncated Neumann expansion and parallel residual correction.

AIBullisharXiv – CS AI · Jun 56/10

🧠

Sample-efficient Low-level Motion Planning for Robotic Manipulation Tasks via Zero-shot Transfer Learning

Researchers propose iCEM+TL, a framework combining the Cross-Entropy Method with transfer learning to improve robotic manipulation planning efficiency. The approach achieves up to 23% success rate improvements in complex tasks like stacking and shelf placement, with validation demonstrated on a real Franka Emika robot.

AINeutralarXiv – CS AI · Jun 56/10

🧠

Metamorphic Testing with the Rashomon Set: Explanation Faithfulness in Machine Learning

Researchers propose a metamorphic testing framework to evaluate the trustworthiness of machine learning model explanations by identifying inconsistencies between model predictions and feature attributions, addressing the Rashomon effect where multiple models achieve similar performance but yield conflicting explanations.

AINeutralarXiv – CS AI · Jun 56/10

🧠

MDP-GRPO: Stabilized Group Relative Policy Optimization for Multi-Constraint Instruction Following

Researchers propose MDP-GRPO, an improved reinforcement learning method that stabilizes group relative policy optimization for instruction-following tasks by addressing three fundamental instabilities in reward normalization. The technique achieves up to 5% improvement in constraint satisfaction on language models while maintaining general performance capabilities.

🧠 Llama

AINeutralarXiv – CS AI · Jun 56/10

🧠

On Advantage Estimates for Max@K Policy Gradients

Researchers introduce MaxPO, a new policy-gradient method that improves advantage estimation for max@K objectives in reinforcement learning, addressing challenges in LLM post-training by reducing gradient variance through a Leave-Two-Out baseline that ensures centered advantages.

AINeutralarXiv – CS AI · Jun 56/10

🧠

TLA-Prover: Verifiable TLA+ Specification Synthesis via Preference-Optimized Low-Rank Adaptation

Researchers have developed TLA-Prover, a 20-billion-parameter AI model that significantly improves the synthesis of TLA+ formal specifications for distributed systems, achieving 30% correctness on verified benchmarks—roughly 3.5x better than previous baselines. The model combines supervised fine-tuning with repair-based policy optimization and uses TLC model checker feedback directly as a reward signal, eliminating the need for learned reward models.

AINeutralarXiv – CS AI · Jun 56/10

🧠

A Finite Certificate for the Positive $n=9$ Vasc Inequality

Researchers have proven the positive n=9 case of the Vasc cyclic inequality using a hybrid human-AI approach with the MechMath Agent Team, generating a finite certificate covering 40,320 sorted cones. The proof demonstrates the practical application of AI agents in mathematical verification, combining human mathematical reasoning with machine-generated computational verification.

AINeutralarXiv – CS AI · Jun 56/10

🧠

Learning to Route LLMs from Implicit Cost-Performance Preferences via Meta-Learning

Researchers introduce MetaRouter, a meta-learning framework that optimizes Large Language Model routing by learning individual users' implicit cost-performance preferences through minimal interaction. The system enables personalized query routing across multiple models, balancing expense reduction with performance maintenance more effectively than existing methods.

AINeutralarXiv – CS AI · Jun 55/10

🧠

Improving Answer Extraction in Context-based Question Answering Systems Using LLMs

Researchers propose an improved question answering system using fine-tuned large language models on the SQuAD dataset, achieving strong performance metrics (ROUGE-L: 86.84%, BERTScore: 95.38%). The work addresses limitations in current LLM-based QA systems' ability to extract accurate answers from given contexts, demonstrating that targeted fine-tuning substantially enhances reliability and precision.

AIBullisharXiv – CS AI · Jun 56/10

🧠

Towards the Readability of LLM-Generated Codes through Multitask Representation Engineering

Researchers propose a multitask representation engineering framework to improve the readability of code generated by large language models while maintaining correctness. The approach uses low-cost targeted control mechanisms to address the previously under-researched problem of code readability, balancing it against functional accuracy.

AINeutralarXiv – CS AI · Jun 56/10

🧠

DisasterBench: A Multimodal Benchmark for UAV-Based Disaster Response in Complex Environments

Researchers introduced DisasterBench, a multimodal AI benchmark designed to improve UAV-based disaster response by testing reasoning across 14 disaster types and 9 response-critical tasks. They also developed DisasterVL, a lightweight 2B-parameter model that achieves GPT-4o-level reasoning accuracy while operating efficiently on edge devices with limited computational resources.

🧠 GPT-4

AINeutralarXiv – CS AI · Jun 56/10

🧠

Bridging the Semantic-Collaborative Gap: An Asymmetric Graph Architecture for Cold-Start Item Recommendation

Researchers at Tubi have developed Shallow-RHS, a graph-based recommendation system that addresses the cold-start problem for new content by using asymmetric neural architectures. The model separates user-interaction modeling from content feature encoding, enabling immediate embeddings for newly ingested items while maintaining collaborative filtering capabilities in production environments.

AINeutralarXiv – CS AI · Jun 56/10

🧠

Design a Reliable LLM-Integrated Interface for Mortality Forecasting

Researchers propose an LLM-integrated interface for mortality forecasting that translates natural language inputs into structured actuarial predictions while maintaining statistical rigor. The system uses a constrained orchestration layer to enhance accessibility for non-expert users without compromising reproducibility or analytical validity in high-stakes forecasting workflows.

← PrevPage 479 of 1377Next →