🧠

AI

21,448 AI articles curated from 50+ sources with AI-powered sentiment analysis, importance scoring, and key takeaways.

21448 articles

AIBearisharXiv – CS AI · Mar 36/104

🧠

Are LLMs Ready to Replace Bangla Annotators?

A comprehensive study of 17 Large Language Models as automated annotators for Bangla hate speech detection reveals significant bias and instability issues. The research found that larger models don't necessarily perform better than smaller, task-specific ones, raising concerns about LLM reliability for sensitive annotation tasks in low-resource languages.

AIBullisharXiv – CS AI · Mar 36/103

🧠

HIMM: Human-Inspired Long-Term Memory Modeling for Embodied Exploration and Question Answering

Researchers propose HIMM, a new memory framework for AI embodied agents that separates episodic and semantic memory to improve long-term performance. The system achieves significant gains on benchmarks, with 7.3% improvement in LLM-Match and 11.4% in LLM MatchXSPL, addressing key challenges in deploying multimodal language models as embodied agent brains.

AIBullisharXiv – CS AI · Mar 36/103

🧠

A Graph Meta-Network for Learning on Kolmogorov-Arnold Networks

Researchers developed WS-KAN, the first weight-space architecture designed specifically for Kolmogorov-Arnold Networks (KANs), which learns directly from neural network parameters. The study shows KANs share permutation symmetries with MLPs and introduces a graph representation to better understand their computation structure.

AIBullisharXiv – CS AI · Mar 36/102

🧠

Inner Loop Inference for Pretrained Transformers: Unlocking Latent Capabilities Without Training

Researchers propose a new inference technique called "inner loop inference" that improves pretrained transformer models' performance by repeatedly applying selected layers during inference without additional training. The method yields consistent but modest accuracy improvements across benchmarks by allowing more refinement of internal representations.

AIBullisharXiv – CS AI · Mar 36/104

🧠

Unleashing Low-Bit Inference on Ascend NPUs: A Comprehensive Evaluation of HiFloat Formats

Researchers evaluated HiFloat (HiF8 and HiF4) formats for low-bit inference on Ascend NPUs, finding them superior to integer formats for high-variance data and preventing accuracy collapse in 4-bit regimes. The study demonstrates HiFloat's compatibility with existing quantization frameworks and its potential for efficient large language model inference.

AIBullisharXiv – CS AI · Mar 36/102

🧠

SWE-MiniSandbox: Container-Free Reinforcement Learning for Building Software Engineering Agents

Researchers introduced SWE-MiniSandbox, a container-free method for training software engineering AI agents using reinforcement learning that reduces disk usage to 5% and environment setup time to 25% of traditional container-based approaches. The system uses kernel-level isolation and lightweight pre-caching instead of bulky container images while maintaining comparable performance.

AIBullisharXiv – CS AI · Mar 36/103

🧠

Online Causal Kalman Filtering for Stable and Effective Policy Optimization

Researchers propose Online Causal Kalman Filtering for Policy Optimization (KPO) to address high-variance instability in reinforcement learning for large language models. The method uses Kalman filtering to smooth token-level importance sampling ratios, preventing training collapse and achieving superior results on math reasoning tasks.

AINeutralarXiv – CS AI · Mar 35/103

🧠

FIRE: Frobenius-Isometry Reinitialization for Balancing the Stability-Plasticity Tradeoff

Researchers propose FIRE, a new reinitialization method for deep neural networks that balances stability and plasticity when learning from nonstationary data. The method uses mathematical optimization to maintain prior knowledge while adapting to new tasks, showing superior performance across visual learning, language modeling, and reinforcement learning domains.

AIBullisharXiv – CS AI · Mar 36/103

🧠

Structure-Informed Estimation for Pilot-Limited MIMO Channels via Tensor Decomposition

Researchers developed a hybrid AI approach combining tensor decomposition with neural networks to improve MIMO channel estimation for 6G wireless systems under pilot signal limitations. The method achieves significant performance improvements over traditional approaches, with up to 13.11 dB better accuracy in specific scenarios.

AIBullisharXiv – CS AI · Mar 36/103

🧠

Entropy-Guided Dynamic Tokens for Graph-LLM Alignment in Molecular Understanding

Researchers have developed EDT-Former, an Entropy-guided Dynamic Token Transformer that improves how Large Language Models understand molecular graphs. The system achieves state-of-the-art results on molecular understanding benchmarks while being computationally efficient by avoiding costly LLM backbone fine-tuning.

AIBullisharXiv – CS AI · Mar 36/104

🧠

Learning to Explore with Parameter-Space Noise: A Deep Dive into Parameter-Space Noise for Reinforcement Learning with Verifiable Rewards

Researchers introduce PSN-RLVR, a new reinforcement learning method that uses parameter-space noise to improve AI exploration and reasoning capabilities. The technique addresses limitations in existing approaches by enabling better discovery of new problem-solving strategies rather than just reweighting existing solutions.

AINeutralarXiv – CS AI · Mar 36/104

🧠

Vision-DeepResearch Benchmark: Rethinking Visual and Textual Search for Multimodal Large Language Models

Researchers introduce Vision-DeepResearch Benchmark (VDR-Bench) with 2,000 VQA instances to better evaluate multimodal AI systems' visual and textual search capabilities. The benchmark addresses limitations in existing evaluations where answers could be inferred without proper visual search, and proposes a multi-round cropped-search workflow to improve model performance.

$NEAR

AIBullisharXiv – CS AI · Mar 36/104

🧠

Contribution-aware Token Compression for Efficient Video Understanding via Reinforcement Learning

Researchers developed CaCoVID, a reinforcement learning-based algorithm that compresses video tokens for large language models by selecting tokens based on their actual contribution to correct predictions rather than attention scores. The method uses combinatorial policy optimization to reduce computational overhead while maintaining video understanding performance.

AINeutralarXiv – CS AI · Mar 36/103

🧠

Toward Youth-Centered Privacy-by-Design in Smart Devices: A Systematic Review

A systematic review of 122 academic papers reveals significant gaps in privacy protection for youth using AI-enabled smart devices, with technical solutions dominating research (67%) while policy enforcement and educational integration remain underdeveloped. The study recommends a multi-stakeholder approach involving policymakers, manufacturers, and educators to create comprehensive privacy ecosystems for young users.

AIBullisharXiv – CS AI · Mar 36/104

🧠

TP-Blend: Textual-Prompt Attention Pairing for Precise Object-Style Blending in Diffusion Models

Researchers introduced TP-Blend, a training-free framework for diffusion models that enables simultaneous object and style blending using two separate text prompts. The system uses Cross-Attention Object Fusion and Self-Attention Style Fusion to produce high-resolution, photo-realistic edits with precise control over both content and appearance.

AINeutralarXiv – CS AI · Mar 35/105

🧠

Balancing Usability and Compliance in AI Smart Devices: A Privacy-by-Design Audit of Google Home, Alexa, and Siri

A research study analyzed privacy and usability trade-offs in AI smart devices (Google Home, Alexa, Siri) used by youth, finding that Google Home scored highest for usability while Siri led in regulatory compliance. The study revealed that while youth feel capable of managing their data, technical complexity and unclear policies limit their privacy control.

AINeutralarXiv – CS AI · Mar 35/104

🧠

Convenience vs. Control: A Qualitative Study of Youth Privacy with Smart Voice Assistants

A study of 26 young Canadians reveals that smart voice assistants' complex privacy controls and lack of transparency discourage privacy-protective behaviors among youth. Researchers propose design improvements including unified privacy hubs, plain-language data labels, and clearer retention policies to empower young users while maintaining convenience.

AINeutralarXiv – CS AI · Mar 35/104

🧠

Mental Models of Autonomy and Sentience Shape Reactions to AI

Research study with 2,702 participants found that people react differently to AI based on whether they perceive it as sentient (able to feel) versus autonomous (self-governing). Sentience increased moral consideration and mind perception more than autonomy, while autonomy increased perceived threat levels.

AIBullisharXiv – CS AI · Mar 36/104

🧠

When AI Gives Advice: Evaluating AI and Human Responses to Online Advice-Seeking for Well-Being

A research study comparing AI-generated advice to human Reddit responses found that large language models like GPT-4o significantly outperformed crowd-sourced advice on effectiveness, warmth, and user satisfaction metrics. The study suggests human advice can be enhanced through AI polishing, pointing toward hybrid systems combining AI, crowd input, and expert oversight.

AIBullisharXiv – CS AI · Mar 36/104

🧠

Robust Finetuning of Vision-Language-Action Robot Policies via Parameter Merging

Researchers developed a parameter merging technique that allows robot AI policies to learn new tasks while preserving their existing generalist capabilities. The method interpolates weights between finetuned and pretrained models, preventing overfitting and enabling lifelong learning in robotics applications.

AINeutralarXiv – CS AI · Mar 36/104

🧠

Near--Real-Time Conflict-Related Fire Detection in Sudan Using Unsupervised Deep Learning

Researchers developed a lightweight AI model using unsupervised deep learning to detect conflict-related fires in Sudan within 24-30 hours using commercially available satellite imagery. The Variational Auto-Encoder (VAE) approach outperformed traditional methods in identifying burn signatures from 4-band Planet Labs satellite data at 3-meter resolution.

$CRV$NEAR

AINeutralarXiv – CS AI · Mar 35/104

🧠

German General Social Survey Personas: A Survey-Derived Persona Prompt Collection for Population-Aligned LLM Studies

Researchers have created GGSS Personas, a comprehensive collection of survey-derived persona prompts based on the German General Social Survey that helps Large Language Models simulate human perspectives more accurately. The collection enables LLMs to generate responses aligned with the German population and outperforms existing classifiers, particularly when training data is limited.

AIBullisharXiv – CS AI · Mar 36/103

🧠

WavefrontDiffusion: Dynamic Decoding Schedule for Improved Reasoning

Researchers introduce WavefrontDiffusion, a new dynamic decoding approach for Diffusion Language Models that improves text generation quality by expanding from finalized positions rather than using fixed blocks. The method achieves state-of-the-art performance on reasoning and code generation benchmarks while maintaining computational efficiency equivalent to existing block-based methods.

AIBullisharXiv – CS AI · Mar 36/103

🧠

Protein Structure Tokenization via Geometric Byte Pair Encoding

Researchers have developed GeoBPE, a new protein structure tokenization method that converts protein backbone structures into discrete geometric tokens, achieving over 10x compression and data efficiency improvements. The approach uses geometry-grounded byte-pair encoding to create hierarchical vocabularies of protein structural primitives that align with functional families and enable better multimodal protein modeling.

AIBullisharXiv – CS AI · Mar 36/104

🧠

When Data is the Algorithm: A Systematic Study and Curation of Preference Optimization Datasets

Researchers conducted the first comprehensive analysis of open-source direct preference optimization (DPO) datasets used to align large language models, revealing significant quality variations. They created UltraMix, a curated dataset that's 30% smaller than existing options while delivering superior performance across benchmarks.

← PrevPage 547 of 858Next →