AI Pulse News

Models, papers, tools. 34,383 articles with AI-powered sentiment analysis and key takeaways.

34383 articles

AINeutralarXiv – CS AI · Jun 56/10

🧠

Humans' ALMANAC: A Human Collaboration Dataset of Action-Level Mental Model Annotations for Agent Collaboration

Researchers introduce ALMANAC, a dataset of 2,987 annotated human collaboration actions designed to teach AI agents how to maintain mental models during teamwork. The dataset, built from the Map Task routing exercise, includes theory-informed annotations tracking participants' reasoning, partner intent perception, and shared goals—addressing a critical gap in training collaborative AI systems beyond task completion.

AINeutralarXiv – CS AI · Jun 56/10

🧠

Risk Assessment of Autonomous Driving: Integrating Technical Failures, Ethical Dilemmas, and Policy Frameworks

Researchers analyzing autonomous vehicle safety data from NHTSA, California DMV, and MIT datasets identify perception and classification errors as primary technical failure modes, while highlighting divergent ethical frameworks and inconsistent regulatory approaches across jurisdictions as critical barriers to safe, widespread deployment.

AINeutralarXiv – CS AI · Jun 56/10

🧠

Agent Memory: Characterization and System Implications of Stateful Long-Horizon Workloads

Researchers present the first comprehensive systems characterization of LLM agent memory architectures, introducing a taxonomy and profiling framework to analyze how different design choices impact performance across write and read paths. The study benchmarks ten representative systems and derives actionable recommendations for optimizing agent memory at scale.

AIBullisharXiv – CS AI · Jun 56/10

🧠

Goedel-Architect: Streamlining Formal Theorem Proving with Blueprint Generation and Refinement

Goedel-Architect is a new AI framework for formal theorem proving that uses blueprint generation and refinement to achieve state-of-the-art results on mathematical benchmarks. Built on DeepSeek-V4-Flash, it demonstrates significant improvements in solving complex mathematical problems while maintaining cost efficiency up to 500x lower than comparable solutions.

AINeutralarXiv – CS AI · Jun 56/10

🧠

RAINO: Anchoring Agents in Reality, A Systematic Review and Conceptual Framework for Realism in Agent-Based Modelling

Researchers present RAINO, a systematic framework addressing how realism is poorly defined and inconsistently operationalized in Agent-Based Models. The framework identifies Reality Anchors (empirical data, theory, expert knowledge) and their application as inputs or outputs, providing a conceptual foundation for evaluating and developing more realistic computational models.

AINeutralarXiv – CS AI · Jun 56/10

🧠

Predict and Reconstruct: Joint Objectives for Self-Supervised Language Representation Learning

Researchers propose a hybrid pre-training approach for language models that combines masked language modeling with a JEPA-style latent-space prediction objective, creating more semantically-aligned embeddings with better geometric properties than traditional MLM-only approaches despite achieving similar downstream accuracy.

🏢 Nvidia

AINeutralarXiv – CS AI · Jun 56/10

🧠

PEFT of SLM for Telecommunications Customer Support: A Comparative Study of LoRA Configurations with Energy Consumption Analysis

A research paper demonstrates that parameter-efficient fine-tuning of small language models (3B parameters) using LoRA achieves competitive performance for telecommunications customer support while consuming significantly less energy than larger models. Critically, the study reveals that traditional validation loss metrics poorly predict real-world conversational quality, with the lowest-loss model ranking 6th-7th in human-aligned evaluation while the worst-loss model ranked first.

🧠 GPT-5🧠 Claude🧠 Gemini

AINeutralarXiv – CS AI · Jun 56/10

🧠

The Virtual Roundtable: Multi-Agent Personas Simulating the Dynamics of Human Brainstorming

Researchers present a multi-agent AI system that simulates human brainstorming through diverse AI personas engaging in structured roundtable discussions. The architecture uses divergent and convergent thinking phases to generate and evaluate ideas while minimizing groupthink, demonstrated through a case study on AI smart glasses product concepts.

AINeutralarXiv – CS AI · Jun 56/10

🧠

From Scoring to Explanations: Evaluating SHAP and LLM Rationales for Rubric-based Teaching Quality Assessment

Researchers propose a framework combining SHAP explainability with LLM-generated rationales to improve transparency in automated rubric-based scoring systems for educational assessment. Testing on classroom transcripts reveals fine-tuned language models outperform LLMs in accuracy, but SHAP attributions provide more faithful and transferable explanations than LLM rationales across different model architectures.

AINeutralarXiv – CS AI · Jun 56/10

🧠

Multi-Granularity Reasoning for Natural Language Inference

Researchers propose Multi-Granularity Reasoning Network (MGRN), a novel approach to Natural Language Inference that processes semantic information across multiple hierarchical levels rather than relying solely on final-layer transformer representations. The framework demonstrates improved performance on NLI benchmarks by explicitly separating lexical, phrasal, and contextual semantic features.

AIBearisharXiv – CS AI · Jun 56/10

🧠

Geographic Bias and Diversity in AI Evaluation

A comprehensive literature review examines geographic bias in AI systems, revealing that foundation models encode structural imbalances in training data that disproportionately favor certain regions while underrepresenting others. The research identifies representation gaps, regional factual recall disparities, and the tendency of generative AI to default to prototypical Western places, establishing measurable benchmarks for evaluating geographic diversity across different model parameters and output types.

AIBearisharXiv – CS AI · Jun 56/10

🧠

Assessing the Geographic Diversity of AI's Platial Representations in Image Generation

Researchers evaluated geographic diversity in AI image generation models (GPT and DALL-E), finding that these systems produce stereotypical representations of places due to underlying model homogeneity. The study reveals counterintuitive results: older models sometimes show greater geographic diversity despite lower image quality, and the systems consistently depict identical prototypical features for specific locations.

🧠 DALL E

AINeutralarXiv – CS AI · Jun 56/10

🧠

Temporal Preference Concepts and their Functions in a Large Language Model

Researchers have identified how Large Language Models internally represent and process temporal preferences—the tradeoff between immediate gains and long-term consequences. The study reveals that LLMs discount future outcomes less steeply than humans but exhibit unstable preferences across contexts, suggesting that explicit control mechanisms rather than implicit training are necessary for reliable decision-making.

AINeutralarXiv – CS AI · Jun 56/10

🧠

Finite Element-Based Material Learning via Automatic Differentiation: Learning constitutive neural network models from full-field deformation data

Researchers have developed FE-MAD, a differentiable machine learning framework that integrates neural networks into finite element solvers to identify material properties from experimental deformation data. The method combines the flexibility of neural networks with the physical rigor of finite element analysis, demonstrated on hyperelastic material characterization across multiple experimental datasets without requiring manual surrogate models or analytic adjoints.

AINeutralarXiv – CS AI · Jun 56/10

🧠

Ontology-constrained multi-LLM scoring of hypothesis support in the predictive processing literature

Researchers developed a multi-LLM pipeline that uses ontology-constrained scoring to synthesize fragmented predictive coding neuroscience literature into quantifiable evidence spaces. The system scored 31 studies across ten language models using a 36-concept glossary, revealing structured disagreement patterns between experimental contexts and introducing 'hypothesis-space temperature' as a novel metric for measuring research dispersion.

AINeutralarXiv – CS AI · Jun 56/10

🧠

The Score Hamiltonian: Mapping Diffusion Models to Adiabatic Transport

Researchers establish a mathematical correspondence between score-based diffusion models and quantum adiabatic transport, revealing that sampling performance is fundamentally limited by the ratio of score-matching error to spectral gap. This theoretical breakthrough provides new bounds for density reconstruction and principled methods for designing annealing schedules in generative AI systems.

AINeutralarXiv – CS AI · Jun 56/10

🧠

Gradient Descent with Large Step Size Restores Symmetry in Deep Linear Networks with Multi-Pathway

Researchers demonstrate that discrete Gradient Descent with large step sizes produces fundamentally different training dynamics in deep linear networks compared to continuous Gradient Flow. Their analysis reveals that multi-pathway networks redistribute signals across pathways during later training stages rather than concentrating them in single pathways, challenging prevailing theoretical predictions and suggesting that optimization step size significantly influences neural network representation learning.

AINeutralarXiv – CS AI · Jun 56/10

🧠

Where's the Structure? A Systematic Literature Review of Empirical Research on Human-AI Collaboration and Hybrid Intelligence for Learning

A systematic literature review of 62 empirical studies examines human-AI collaboration in educational settings, finding that unstructured interaction between humans and AI produces suboptimal learning outcomes. The research identifies key design principles and structural frameworks that educational technologists can apply to create more effective AI-enhanced learning systems.

AINeutralarXiv – CS AI · Jun 56/10

🧠

Differentiable Efficient Operator Search

Researchers propose Efficient Operator Search, a differentiable framework that automates the design of token-reduction operators for multimodal foundation models. The approach unifies previously distinct manual techniques like pruning and merging into a shared search space, discovering hybrid operators that achieve better accuracy-efficiency trade-offs than hand-designed baselines.

AINeutralarXiv – CS AI · Jun 56/10

🧠

From Attack Simulation to SIEM Rule: Deterministic Detection-as-Code Synthesis with Probe-Level Traceability

Researchers present a deterministic synthesis method that automatically converts findings from attack simulation tools into SIEM detection rules, eliminating manual translation work. The system uses a 23-template library indexed by OWASP categories to map security probe findings to Sigma rules with full traceability to originating attacks, achieving 100% parseability across multiple backends.

AINeutralarXiv – CS AI · Jun 56/10

🧠

NIV: Neural Axis Variations for Variable Font Generation

Researchers introduce NIV (Neural Axis Variations), an AI method that automatically converts static fonts into variable fonts by predicting per-point glyph displacements across design axes like weight and width. Trained on over one million font variations from Google Fonts, the model generalizes across unseen fonts, scripts, and even handwriting, with outputs compatible with standard rendering engines.

AINeutralarXiv – CS AI · Jun 56/10

🧠

X-Band UAV-enabled Integrated Sensing and Communications for Vehicular Networks

Researchers present an optimization framework for UAV-enabled integrated sensing and communication systems operating in the X-band for vehicular networks. The study analyzes time allocation trade-offs between sensing accuracy and communication performance, considering practical UAV constraints and fading channel effects, with results demonstrating adaptive strategies responsive to channel conditions.

AIBullisharXiv – CS AI · Jun 56/10

🧠

Personal AI Agent for Camera Roll VQA

Researchers introduce camroll, a dataset and AI agent system designed to answer questions about personal photo libraries by retrieving and analyzing relevant images from users' camera rolls. The camroll-agent uses hierarchical memory and specialized tools to handle long-context visual reasoning across thousands of personalized images, outperforming existing baselines in understanding user-specific visual content.

AINeutralarXiv – CS AI · Jun 56/10

🧠

LoRi: Low-Rank Distillation for Implicit Reasoning

Researchers propose LoRi, a low-rank distillation framework that improves implicit chain-of-thought reasoning in large language models by aligning teacher-student model trajectories in a shared low-rank tensor subspace. The method addresses the performance gap between implicit and explicit reasoning approaches, showing consistent improvements across LLaMA and Qwen model families on mathematical benchmarks.

AINeutralarXiv – CS AI · Jun 56/10

🧠

Gradient descent at the Edge of Stability: free energy model and kinetic description of the two-layer network

Researchers propose a continuous-time mathematical model for analyzing gradient descent dynamics in the Edge of Stability regime, where large learning rates cause oscillations in neural network training. The model introduces an effective free energy framework that combines risk with a curvature-related term, enabling better prediction of training dynamics in wide two-layer networks and validated on matrix factorization and CIFAR-10 tasks.

← PrevPage 476 of 1376Next →