y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto
🤖All29,831🧠AI12,757⛓️Crypto10,810💎DeFi1,117🤖AI × Crypto549📰General4,598
🧠

AI

9,317 AI articles curated from 50+ sources with AI-powered sentiment analysis, importance scoring, and key takeaways.

9317 articles
AIBullisharXiv – CS AI · Feb 276/107
🧠

Requesting Expert Reasoning: Augmenting LLM Agents with Learned Collaborative Intervention

Researchers introduce AHCE (Active Human-Augmented Challenge Engagement), a framework that enables AI agents to collaborate with human experts more effectively through learned policies. The system achieved 32% improvement on normal difficulty tasks and 70% on difficult tasks in Minecraft experiments by treating humans as interactive reasoning tools rather than simple help sources.

AIBullisharXiv – CS AI · Feb 276/104
🧠

Agentic AI for Intent-driven Optimization in Cell-free O-RAN

Researchers propose an agentic AI framework using multiple LLM-based agents to optimize cell-free Open RAN networks through intent-driven automation. The system reduces active radio units by 42% in energy-saving mode while cutting memory usage by 92% through parameter-efficient fine-tuning.

AINeutralarXiv – CS AI · Feb 276/104
🧠

Correcting Human Labels for Rater Effects in AI Evaluation: An Item Response Theory Approach

Researchers propose using psychometric modeling to correct systematic biases in human evaluations of AI systems, demonstrating how Item Response Theory can separate true AI output quality from rater behavior inconsistencies. The approach was tested on OpenAI's summarization dataset and showed improved reliability in measuring AI model performance.

AINeutralarXiv – CS AI · Feb 275/102
🧠

Cognitive Models and AI Algorithms Provide Templates for Designing Language Agents

Researchers propose using cognitive models and AI algorithms as templates for designing modular language agents that combine multiple large language models. The position paper formalizes agent templates that specify roles for individual LLMs and how their functionalities should be composed to solve complex problems beyond single model capabilities.

AIBullisharXiv – CS AI · Feb 276/107
🧠

Strategy Executability in Mathematical Reasoning: Leveraging Human-Model Differences for Effective Guidance

Researchers identified why AI mathematical reasoning guidance is inconsistent and developed Selective Strategy Retrieval (SSR), a framework that improves AI math performance by combining human and model strategies. The method showed significant improvements of up to 13 points on mathematical benchmarks by addressing the gap between strategy usage and executability.

AIBullisharXiv – CS AI · Feb 275/104
🧠

Mapping the Landscape of Artificial Intelligence in Life Cycle Assessment Using Large Language Models

Researchers conducted a comprehensive review of artificial intelligence applications in life cycle assessment (LCA) using large language models to analyze trends and patterns. The study found dramatic growth in AI adoption for environmental assessments, with a notable shift toward LLM-driven approaches and strong correlations between AI methods and LCA stages.

AINeutralarXiv – CS AI · Feb 275/105
🧠

CWM: Contrastive World Models for Action Feasibility Learning in Embodied Agent Pipelines

Researchers propose Contrastive World Models (CWM), a new approach for training AI agents to better distinguish between physically feasible and infeasible actions in embodied environments. The method uses contrastive learning with hard negative examples to outperform traditional supervised fine-tuning, achieving 6.76 percentage point improvement in precision and better safety margins under stress conditions.

AIBullisharXiv – CS AI · Feb 276/106
🧠

A Framework for Assessing AI Agent Decisions and Outcomes in AutoML Pipelines

Researchers propose an Evaluation Agent framework to assess AI agent decision-making in AutoML pipelines, moving beyond outcome-focused metrics to evaluate intermediate decisions. The system can detect faulty decisions with 91.9% F1 score and reveals impacts ranging from -4.9% to +8.3% in final performance metrics.

AIBearisharXiv – CS AI · Feb 276/106
🧠

ConstraintBench: Benchmarking LLM Constraint Reasoning on Direct Optimization

Researchers introduced ConstraintBench, a new benchmark testing whether large language models can directly solve constrained optimization problems without external solvers. The study found that even the best frontier models only achieve 65% constraint satisfaction, with feasibility being a bigger challenge than optimality.

AINeutralarXiv – CS AI · Feb 276/105
🧠

How Do Latent Reasoning Methods Perform Under Weak and Strong Supervision?

Researchers analyzed latent reasoning methods in AI, which perform multi-step reasoning in continuous latent spaces rather than textual spaces. The study reveals two key issues: pervasive shortcut behavior where models achieve high accuracy without actual latent reasoning, and a failure to implement structured search despite encoding multiple possibilities.

AIBullisharXiv – CS AI · Feb 276/106
🧠

SideQuest: Model-Driven KV Cache Management for Long-Horizon Agentic Reasoning

Researchers introduce SideQuest, a novel KV cache management system that uses Large Reasoning Models to compress memory usage during long-horizon AI tasks. The system reduces peak token usage by up to 65% while maintaining accuracy by having the model itself determine which tokens are useful to keep in memory.

AIBullisharXiv – CS AI · Feb 276/104
🧠

HARU-Net: Hybrid Attention Residual U-Net for Edge-Preserving Denoising in Cone-Beam Computed Tomography

Researchers developed HARU-Net, a novel AI architecture for denoising cone-beam computed tomography (CBCT) medical images that outperforms existing state-of-the-art methods while using less computational resources. The system addresses critical noise issues in low-dose dental and maxillofacial imaging by combining hybrid attention mechanisms with residual U-Net architecture.

AIBullisharXiv – CS AI · Feb 276/106
🧠

RLHFless: Serverless Computing for Efficient RLHF

Researchers introduce RLHFless, a serverless computing framework for Reinforcement Learning from Human Feedback (RLHF) that addresses resource inefficiencies in training large language models. The system achieves up to 1.35x speedup and 44.8% cost reduction compared to existing solutions by dynamically adapting to resource demands and optimizing workload distribution.

AINeutralOpenAI News · Feb 276/105
🧠

An update on our mental health-related work

OpenAI provides updates on its mental health safety initiatives, including new parental controls, trusted contact features, and enhanced distress detection capabilities. The company also addresses recent litigation developments related to its mental health work.

AIBullishThe Verge – AI · Feb 266/104
🧠

Microsoft’s Copilot Tasks AI uses its own computer to get things done

Microsoft announced Copilot Tasks, a new AI system that handles background tasks using cloud-based computers and browsers. The feature can schedule appointments, generate study plans, and complete various jobs on recurring, scheduled, or one-time basis using natural language commands.

AIBullishWired – AI · Feb 266/105
🧠

This AI Agent Is Designed to Not Go Rogue

IronCurtain is a new open source project that implements a unique security method to constrain AI assistant agents and prevent them from going rogue. The project aims to provide safeguards for AI systems before they can cause disruption to users' digital environments.

AIBearishWired – AI · Feb 266/106
🧠

How Chinese AI Chatbots Censor Themselves

Stanford and Princeton researchers discovered that Chinese AI chatbots exhibit significantly more censorship behaviors than Western models, frequently avoiding political topics or providing inaccurate responses. This highlights the growing divide in AI development approaches between China and Western countries, with implications for AI transparency and reliability.

AINeutralDecrypt – AI · Feb 266/106
🧠

Google Drops Nano Banana 2: The New King of AI Image Generation?

Google has released Nano Banana 2, an AI image generation model that combines professional-level world knowledge with high-speed performance and improved text handling. The launch faces immediate competition from ByteDance's newly announced Seedream 5 model in the AI image generation space.

Google Drops Nano Banana 2: The New King of AI Image Generation?
AIBullishTechCrunch – AI · Feb 266/106
🧠

Mistral AI inks a deal with global consulting giant Accenture

Mistral AI has secured a partnership with global consulting giant Accenture, joining competitors OpenAI and Anthropic who have also recently formed partnerships with the same firm. This partnership positions Mistral AI alongside major AI players in Accenture's enterprise AI consulting portfolio.

AIBullishMicrosoft Research Blog · Feb 266/102
🧠

CORPGEN advances AI agents for real work

Microsoft Research introduces CORPGEN, a new approach to advance AI agents for real-world workplace scenarios. The system aims to help AI agents handle multiple interdependent tasks simultaneously, similar to how knowledge workers juggle various responsibilities throughout their workday.

← PrevPage 187 of 373Next →
Filters
Sentiment
Importance
Sort
Stay Updated
Everything combined