🧠

AI

21,049 AI articles curated from 50+ sources with AI-powered sentiment analysis, importance scoring, and key takeaways.

21049 articles

AINeutralarXiv – CS AI · Mar 36/104

🧠

How Small Can 6G Reason? Scaling Tiny Language Models for AI-Native Networks

Researchers evaluated compact AI language models for 6G networks, finding that mid-scale models (1.5-3B parameters) offer the best balance of performance and computational efficiency for edge deployment. The study shows diminishing returns beyond 3B parameters, with accuracy improving from 22% at 135M to 70% at 7B parameters.

AINeutralarXiv – CS AI · Mar 36/103

🧠

Scaling Retrieval Augmented Generation with RAG Fusion: Lessons from an Industry Deployment

Research on production RAG systems reveals that retrieval fusion techniques like multi-query retrieval and reciprocal rank fusion increase raw document recall but fail to improve end-to-end performance due to re-ranking limits and context constraints. The study found fusion variants actually decreased accuracy from 0.51 to 0.48 while adding latency overhead without corresponding benefits.

AIBullisharXiv – CS AI · Mar 36/104

🧠

Cognitive Prosthetic: An AI-Enabled Multimodal System for Episodic Recall in Knowledge Work

Researchers have developed the Cognitive Prosthetic Multimodal System (CPMS), an AI-enabled proof-of-concept that helps knowledge workers recall workplace experiences by capturing speech, physiological signals, and gaze behavior into queryable episodic memories. The system processes data locally for privacy and allows natural language queries to retrieve past workplace interactions based on semantic content, time, attention, or physiological state.

AIBullisharXiv – CS AI · Mar 36/103

🧠

Detection-Gated Glottal Segmentation with Zero-Shot Cross-Dataset Transfer and Clinical Feature Extraction

Researchers developed a detection-gated AI pipeline combining YOLOv8 and U-Net for accurate glottal segmentation in medical videoendoscopy. The system achieved state-of-the-art performance with zero-shot transfer capabilities across different clinical datasets, enabling real-time extraction of vocal function biomarkers at 35 frames per second.

AIBullisharXiv – CS AI · Mar 36/104

🧠

"When to Hand Off, When to Work Together": Expanding Human-Agent Co-Creative Collaboration through Concurrent Interaction

Researchers developed CLEO, an AI system that enables real-time collaborative context awareness between humans and AI agents by interpreting concurrent user actions on shared artifacts. A study with professional designers identified key interaction patterns and decision factors for when to delegate work to AI versus collaborate directly.

AIBullisharXiv – CS AI · Mar 35/104

🧠

EstLLM: Enhancing Estonian Capabilities in Multilingual LLMs via Continued Pretraining and Post-Training

Researchers developed EstLLM, enhancing Estonian language capabilities in multilingual LLMs through continued pretraining of Llama 3.1 8B with balanced data mixtures. The approach improved Estonian linguistic performance while maintaining English capabilities, demonstrating that targeted continued pretraining can substantially improve single-language performance in multilingual models.

AIBullisharXiv – CS AI · Mar 36/104

🧠

MAP-Diff: Multi-Anchor Guided Diffusion for Progressive 3D Whole-Body Low-Dose PET Denoising

Researchers developed MAP-Diff, a multi-anchor guided diffusion framework that improves 3D whole-body PET scan denoising by using intermediate-dose scans as trajectory anchors. The method achieves significant improvements in image quality metrics, increasing PSNR from 42.48 dB to 43.71 dB while reducing radiation exposure for patients.

AINeutralarXiv – CS AI · Mar 35/104

🧠

Mitigating topology biases in Graph Diffusion via Counterfactual Intervention

Researchers have developed FairGDiff, a new AI model that addresses bias issues in graph diffusion models used for generating synthetic network data. The model uses counterfactual intervention to eliminate topology biases related to sensitive attributes like gender and age while maintaining data utility.

$LINK

AIBullisharXiv – CS AI · Mar 36/103

🧠

MatRIS: Toward Reliable and Efficient Pretrained Machine Learning Interaction Potentials

Researchers introduce MatRIS, a new machine learning interaction potential model for materials science that achieves comparable accuracy to leading equivariant models while being significantly more computationally efficient. The model uses attention-based three-body interactions with linear O(N) complexity, demonstrating strong performance on benchmarks like Matbench-Discovery with an F1 score of 0.847.

AINeutralarXiv – CS AI · Mar 36/104

🧠

AMemGym: Interactive Memory Benchmarking for Assistants in Long-Horizon Conversations

Researchers introduce AMemGym, an interactive benchmarking environment for evaluating and optimizing memory management in long-horizon conversations with AI assistants. The framework addresses limitations in current memory evaluation methods by enabling on-policy testing with LLM-simulated users and revealing performance gaps in existing memory systems like RAG and long-context LLMs.

AIBullisharXiv – CS AI · Mar 36/103

🧠

TiledAttention: a CUDA Tile SDPA Kernel for PyTorch

TiledAttention is a new CUDA-based scaled dot-product attention kernel for PyTorch that enables easier modification of attention mechanisms for AI research. It provides a balance between performance and customizability, delivering significant speedups over standard attention implementations while remaining directly editable from Python.

$DOT

AINeutralarXiv – CS AI · Mar 36/104

🧠

Ignore All Previous Instructions: Jailbreaking as a de-escalatory peace building practise to resist LLM social media bots

Researchers propose 'jailbreaking' as a user-driven method to counter LLM-powered social media manipulation by exposing automated bot behavior. The study suggests users can deliberately trigger AI safeguards to reveal misleading political narratives and reduce online conflict escalation.

AIBullisharXiv – CS AI · Mar 36/102

🧠

Probabilistic Retrofitting of Learned Simulators

Researchers developed a training-efficient method to convert pre-trained deterministic AI models for solving Partial Differential Equations into probabilistic ones using Continuous Ranked Probability Score (CRPS) retrofitting. The approach achieves 20-54% improvements in accuracy metrics while requiring minimal additional training costs compared to retraining models from scratch.

AINeutralarXiv – CS AI · Mar 35/104

🧠

physfusion: A Transformer-based Dual-Stream Radar and Vision Fusion Framework for Open Water Surface Object Detection

Researchers have developed PhysFusion, a new AI framework that combines radar and camera data to improve object detection on water surfaces for unmanned vessels. The system achieves up to 94.8% accuracy by using physics-informed processing to handle challenging maritime conditions like wave clutter and poor visibility.

AIBullisharXiv – CS AI · Mar 36/103

🧠

Explanation-Guided Adversarial Training for Robust and Interpretable Models

Researchers propose Explanation-Guided Adversarial Training (EGAT), a framework that combines adversarial training with explainable AI to create more robust and interpretable deep neural networks. The method achieves 37% improvement in adversarial accuracy while producing semantically meaningful explanations with only 16% increase in training time.

AIBearisharXiv – CS AI · Mar 37/105

🧠

Real Money, Fake Models: Deceptive Model Claims in Shadow APIs

A systematic audit of 17 shadow APIs used in 187 academic papers reveals widespread deception, with performance divergence up to 47.21% and identity verification failures in 45.83% of tests. These third-party services claim to provide access to frontier LLMs like GPT-5 and Gemini-2.5 but deliver inconsistent outputs, undermining research validity and reproducibility.

AIBullisharXiv – CS AI · Mar 36/105

🧠

Agentic Code Reasoning

Researchers introduce 'semi-formal reasoning' for LLM agents to analyze code semantics without execution, showing significant accuracy improvements across multiple tasks. The methodology achieves 88-93% accuracy on patch verification and 87% on code question answering, potentially enabling practical applications in automated code review and static analysis.

AIBullisharXiv – CS AI · Mar 36/103

🧠

FluxMem: Adaptive Hierarchical Memory for Streaming Video Understanding

FluxMem is a new training-free framework for streaming video understanding that uses hierarchical memory compression to reduce computational costs. The system achieves state-of-the-art performance on video benchmarks while reducing latency by 69.9% and GPU memory usage by 34.5%.

AIBullisharXiv – CS AI · Mar 36/104

🧠

Closed-Loop Action Chunks with Dynamic Corrections for Training-Free Diffusion Policy

Researchers have developed DCDP, a Dynamic Closed-Loop Diffusion Policy framework that significantly improves robotic manipulation in dynamic environments. The system achieves 19% better adaptability without retraining while requiring only 5% additional computational overhead through real-time action correction and environmental dynamics integration.

AINeutralarXiv – CS AI · Mar 37/108

🧠

Diagnosing Generalization Failures from Representational Geometry Markers

Researchers propose a new approach to predict AI model failures by analyzing geometric properties of data representations rather than reverse-engineering internal mechanisms. They found that reduced manifold dimensionality and utility in training data consistently predict poor performance on out-of-distribution tasks across different architectures and datasets.

AIBullisharXiv – CS AI · Mar 36/105

🧠

Co-Evolutionary Multi-Modal Alignment via Structured Adversarial Evolution

Researchers introduce CEMMA, a co-evolutionary framework for improving AI safety alignment in multimodal large language models. The system uses evolving adversarial attacks and adaptive defenses to create more robust AI systems that better resist jailbreak attempts while maintaining functionality.

AIBullisharXiv – CS AI · Mar 37/104

🧠

FreeAct: Freeing Activations for LLM Quantization

Researchers propose FreeAct, a new quantization framework for Large Language Models that improves efficiency by using dynamic transformation matrices for different token types. The method achieves up to 5.3% performance improvement over existing approaches by addressing the memory and computational overhead challenges in LLMs.

AIBullisharXiv – CS AI · Mar 37/105

🧠

CHLU: The Causal Hamiltonian Learning Unit as a Symplectic Primitive for Deep Learning

Researchers propose the Causal Hamiltonian Learning Unit (CHLU), a physics-based deep learning primitive that addresses stability issues in temporal dynamics models. The CHLU uses symplectic integration and Hamiltonian structure to maintain infinite-horizon stability while preserving information, potentially solving the memory-stability trade-off in neural networks.

AIBullisharXiv – CS AI · Mar 37/104

🧠

Modular Memory is the Key to Continual Learning Agents

Researchers propose combining In-Weight Learning (IWL) and In-Context Learning (ICL) through modular memory architectures to solve continual learning challenges in AI. The framework aims to enable AI agents to continuously adapt and accumulate knowledge without catastrophic forgetting, addressing key limitations of current foundation models.

AIBullisharXiv – CS AI · Mar 37/105

🧠

ALTER: Asymmetric LoRA for Token-Entropy-Guided Unlearning of LLMs

Researchers introduce ALTER, a new framework for efficiently "unlearning" specific knowledge from large language models while preserving their overall utility. The system uses asymmetric LoRA architecture to selectively forget targeted information with 95% effectiveness while maintaining over 90% model utility, significantly outperforming existing methods.

← PrevPage 533 of 842Next →