21,448 AI articles curated from 50+ sources with AI-powered sentiment analysis, importance scoring, and key takeaways.
AIBearisharXiv – CS AI · Mar 36/104
🧠A comprehensive study of 17 Large Language Models as automated annotators for Bangla hate speech detection reveals significant bias and instability issues. The research found that larger models don't necessarily perform better than smaller, task-specific ones, raising concerns about LLM reliability for sensitive annotation tasks in low-resource languages.
AIBullisharXiv – CS AI · Mar 36/103
🧠Researchers propose HIMM, a new memory framework for AI embodied agents that separates episodic and semantic memory to improve long-term performance. The system achieves significant gains on benchmarks, with 7.3% improvement in LLM-Match and 11.4% in LLM MatchXSPL, addressing key challenges in deploying multimodal language models as embodied agent brains.
AIBullisharXiv – CS AI · Mar 36/103
🧠Researchers developed WS-KAN, the first weight-space architecture designed specifically for Kolmogorov-Arnold Networks (KANs), which learns directly from neural network parameters. The study shows KANs share permutation symmetries with MLPs and introduces a graph representation to better understand their computation structure.
AIBullisharXiv – CS AI · Mar 36/102
🧠Researchers propose a new inference technique called "inner loop inference" that improves pretrained transformer models' performance by repeatedly applying selected layers during inference without additional training. The method yields consistent but modest accuracy improvements across benchmarks by allowing more refinement of internal representations.
AIBullisharXiv – CS AI · Mar 36/104
🧠Researchers evaluated HiFloat (HiF8 and HiF4) formats for low-bit inference on Ascend NPUs, finding them superior to integer formats for high-variance data and preventing accuracy collapse in 4-bit regimes. The study demonstrates HiFloat's compatibility with existing quantization frameworks and its potential for efficient large language model inference.
AIBullisharXiv – CS AI · Mar 36/102
🧠Researchers introduced SWE-MiniSandbox, a container-free method for training software engineering AI agents using reinforcement learning that reduces disk usage to 5% and environment setup time to 25% of traditional container-based approaches. The system uses kernel-level isolation and lightweight pre-caching instead of bulky container images while maintaining comparable performance.
AIBullisharXiv – CS AI · Mar 36/103
🧠Researchers propose Online Causal Kalman Filtering for Policy Optimization (KPO) to address high-variance instability in reinforcement learning for large language models. The method uses Kalman filtering to smooth token-level importance sampling ratios, preventing training collapse and achieving superior results on math reasoning tasks.
AINeutralarXiv – CS AI · Mar 35/103
🧠Researchers propose FIRE, a new reinitialization method for deep neural networks that balances stability and plasticity when learning from nonstationary data. The method uses mathematical optimization to maintain prior knowledge while adapting to new tasks, showing superior performance across visual learning, language modeling, and reinforcement learning domains.
AIBullisharXiv – CS AI · Mar 36/103
🧠Researchers developed a hybrid AI approach combining tensor decomposition with neural networks to improve MIMO channel estimation for 6G wireless systems under pilot signal limitations. The method achieves significant performance improvements over traditional approaches, with up to 13.11 dB better accuracy in specific scenarios.
AIBullisharXiv – CS AI · Mar 36/103
🧠Researchers have developed EDT-Former, an Entropy-guided Dynamic Token Transformer that improves how Large Language Models understand molecular graphs. The system achieves state-of-the-art results on molecular understanding benchmarks while being computationally efficient by avoiding costly LLM backbone fine-tuning.
AIBullisharXiv – CS AI · Mar 36/104
🧠Researchers introduce PSN-RLVR, a new reinforcement learning method that uses parameter-space noise to improve AI exploration and reasoning capabilities. The technique addresses limitations in existing approaches by enabling better discovery of new problem-solving strategies rather than just reweighting existing solutions.
AINeutralarXiv – CS AI · Mar 36/104
🧠Researchers introduce Vision-DeepResearch Benchmark (VDR-Bench) with 2,000 VQA instances to better evaluate multimodal AI systems' visual and textual search capabilities. The benchmark addresses limitations in existing evaluations where answers could be inferred without proper visual search, and proposes a multi-round cropped-search workflow to improve model performance.
$NEAR
AIBullisharXiv – CS AI · Mar 36/104
🧠Researchers developed CaCoVID, a reinforcement learning-based algorithm that compresses video tokens for large language models by selecting tokens based on their actual contribution to correct predictions rather than attention scores. The method uses combinatorial policy optimization to reduce computational overhead while maintaining video understanding performance.
AINeutralarXiv – CS AI · Mar 36/103
🧠A systematic review of 122 academic papers reveals significant gaps in privacy protection for youth using AI-enabled smart devices, with technical solutions dominating research (67%) while policy enforcement and educational integration remain underdeveloped. The study recommends a multi-stakeholder approach involving policymakers, manufacturers, and educators to create comprehensive privacy ecosystems for young users.
AIBullisharXiv – CS AI · Mar 36/104
🧠Researchers introduced TP-Blend, a training-free framework for diffusion models that enables simultaneous object and style blending using two separate text prompts. The system uses Cross-Attention Object Fusion and Self-Attention Style Fusion to produce high-resolution, photo-realistic edits with precise control over both content and appearance.
AINeutralarXiv – CS AI · Mar 35/105
🧠A research study analyzed privacy and usability trade-offs in AI smart devices (Google Home, Alexa, Siri) used by youth, finding that Google Home scored highest for usability while Siri led in regulatory compliance. The study revealed that while youth feel capable of managing their data, technical complexity and unclear policies limit their privacy control.
AINeutralarXiv – CS AI · Mar 35/104
🧠A study of 26 young Canadians reveals that smart voice assistants' complex privacy controls and lack of transparency discourage privacy-protective behaviors among youth. Researchers propose design improvements including unified privacy hubs, plain-language data labels, and clearer retention policies to empower young users while maintaining convenience.
AINeutralarXiv – CS AI · Mar 35/104
🧠Research study with 2,702 participants found that people react differently to AI based on whether they perceive it as sentient (able to feel) versus autonomous (self-governing). Sentience increased moral consideration and mind perception more than autonomy, while autonomy increased perceived threat levels.
AIBullisharXiv – CS AI · Mar 36/104
🧠A research study comparing AI-generated advice to human Reddit responses found that large language models like GPT-4o significantly outperformed crowd-sourced advice on effectiveness, warmth, and user satisfaction metrics. The study suggests human advice can be enhanced through AI polishing, pointing toward hybrid systems combining AI, crowd input, and expert oversight.
AIBullisharXiv – CS AI · Mar 36/104
🧠Researchers developed a parameter merging technique that allows robot AI policies to learn new tasks while preserving their existing generalist capabilities. The method interpolates weights between finetuned and pretrained models, preventing overfitting and enabling lifelong learning in robotics applications.
AINeutralarXiv – CS AI · Mar 36/104
🧠Researchers developed a lightweight AI model using unsupervised deep learning to detect conflict-related fires in Sudan within 24-30 hours using commercially available satellite imagery. The Variational Auto-Encoder (VAE) approach outperformed traditional methods in identifying burn signatures from 4-band Planet Labs satellite data at 3-meter resolution.
$CRV$NEAR
AINeutralarXiv – CS AI · Mar 35/104
🧠Researchers have created GGSS Personas, a comprehensive collection of survey-derived persona prompts based on the German General Social Survey that helps Large Language Models simulate human perspectives more accurately. The collection enables LLMs to generate responses aligned with the German population and outperforms existing classifiers, particularly when training data is limited.
AIBullisharXiv – CS AI · Mar 36/103
🧠Researchers introduce WavefrontDiffusion, a new dynamic decoding approach for Diffusion Language Models that improves text generation quality by expanding from finalized positions rather than using fixed blocks. The method achieves state-of-the-art performance on reasoning and code generation benchmarks while maintaining computational efficiency equivalent to existing block-based methods.
AIBullisharXiv – CS AI · Mar 36/103
🧠Researchers have developed GeoBPE, a new protein structure tokenization method that converts protein backbone structures into discrete geometric tokens, achieving over 10x compression and data efficiency improvements. The approach uses geometry-grounded byte-pair encoding to create hierarchical vocabularies of protein structural primitives that align with functional families and enable better multimodal protein modeling.
AIBullisharXiv – CS AI · Mar 36/104
🧠Researchers conducted the first comprehensive analysis of open-source direct preference optimization (DPO) datasets used to align large language models, revealing significant quality variations. They created UltraMix, a curated dataset that's 30% smaller than existing options while delivering superior performance across benchmarks.