y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#artificial-intelligence News & Analysis

750 articles tagged with #artificial-intelligence. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

750 articles
AIBullisharXiv – CS AI · Mar 166/10
🧠

When to Ensemble: Identifying Token-Level Points for Stable and Fast LLM Ensembling

Researchers have developed SAFE, a new framework for ensembling Large Language Models that selectively combines models at specific token positions rather than every token. The method improves both accuracy and efficiency in long-form text generation by considering tokenization mismatches and consensus in probability distributions.

AINeutralDecrypt – AI · Mar 157/10
🧠

What Is AGI? The AI Goal Everyone Talks About But No One Can Clearly Define

Artificial General Intelligence (AGI) remains poorly defined despite widespread discussion in Silicon Valley and the tech industry. Experts highlight the lack of clear metrics or arrival points for determining when AGI has been achieved, creating ambiguity around this widely-promoted AI milestone.

What Is AGI? The AI Goal Everyone Talks About But No One Can Clearly Define
AINeutralFortune Crypto · Mar 147/10
🧠

We need a new Turing test — and Moltbook just proved it

Moltbook, an AI platform, has demonstrated capabilities that suggest current AI evaluation methods like the Turing test may be inadequate. The platform's feed contained content that appeared to showcase advanced AI reasoning beyond typical chatbot interactions.

We need a new Turing test — and Moltbook just proved it
AIBullisharXiv – CS AI · Mar 126/10
🧠

Resource-constrained Amazons chess decision framework integrating large language models and graph attention

Researchers developed a lightweight AI framework for the Game of the Amazons that combines graph attention networks with large language models, achieving 15-56% improvement in decision accuracy while using minimal computational resources. The hybrid approach demonstrates weak-to-strong generalization by leveraging GPT-4o-mini for synthetic training data and graph-based learning for structural reasoning.

🧠 GPT-4
AIBullisharXiv – CS AI · Mar 126/10
🧠

Designing Service Systems from Textual Evidence

Researchers developed PP-LUCB, an algorithm that efficiently identifies optimal service system configurations by combining biased AI evaluation with selective human audits. The method reduces human audit costs by 90% while maintaining accuracy in selecting the best performing systems from textual evidence like customer support transcripts.

AIBullisharXiv – CS AI · Mar 126/10
🧠

Adaptive RAN Slicing Control via Reward-Free Self-Finetuning Agents

Researchers propose a novel self-finetuning framework for AI agents that enables continuous learning without handcrafted rewards, demonstrating superior performance in dynamic Radio Access Network slicing tasks. The approach uses bi-perspective reflection to generate autonomous feedback and distill long-term experiences into model parameters, outperforming traditional reinforcement learning methods.

AINeutralarXiv – CS AI · Mar 126/10
🧠

Prompts and Prayers: the Rise of GPTheology

A research paper introduces the concept of 'GPTheology' - the phenomenon of AI being perceived and treated as divine entities in modern culture. The study examines how AI interactions are developing ritualistic qualities and new belief systems through analysis of online communities and real-world projects like AI-powered religious statues.

🧠 ChatGPT
AIBullisharXiv – CS AI · Mar 126/10
🧠

Emulating Clinician Cognition via Self-Evolving Deep Clinical Research

Researchers developed DxEvolve, a self-evolving AI diagnostic system that mimics clinical reasoning through interactive workflows and continuous learning. The system achieved 90.4% diagnostic accuracy on benchmarks, comparable to human clinicians at 88.8%, and showed significant improvements over traditional AI models.

AIBearishThe Verge – AI · Mar 116/10
🧠

Grammarly says it will stop using AI to clone experts without permission

Grammarly has disabled its AI 'Expert Review' feature that generated writing suggestions claiming to be 'inspired by' real writers without their permission, including journalists from The Verge. The company acknowledged they 'missed the mark' and plans to redesign the feature to give experts control over their representation.

Grammarly says it will stop using AI to clone experts without permission
AIBullishFortune Crypto · Mar 116/10
🧠

How AI is about to transform the C-suite for small businesses

Mastercard is launching an AI-powered virtual CFO solution, marking a significant step in how artificial intelligence will transform executive-level financial management for small businesses. This development represents the growing integration of AI tools into core business operations and decision-making processes.

How AI is about to transform the C-suite for small businesses
AIBullisharXiv – CS AI · Mar 116/10
🧠

Evaluate-as-Action: Self-Evaluated Process Rewards for Retrieval-Augmented Agents

Researchers propose EvalAct, a new method that improves retrieval-augmented AI agents by converting retrieval quality assessment into explicit actions and using Process-Calibrated Advantage Rescaling (PCAR) for optimization. The approach shows superior performance on multi-step reasoning tasks across seven open-domain QA benchmarks by providing better process-level feedback signals.

AIBullisharXiv – CS AI · Mar 116/10
🧠

Automating Forecasting Question Generation and Resolution for AI Evaluation

Researchers developed an automated system using LLM-powered web research agents to generate and resolve forecasting questions at scale, creating 1,499 diverse real-world questions with 96% quality rate. The system demonstrates that more advanced AI models perform significantly better at forecasting tasks, with potential applications for improving AI evaluation benchmarks.

🧠 GPT-5🧠 Gemini
AIBearisharXiv – CS AI · Mar 116/10
🧠

Common Sense vs. Morality: The Curious Case of Narrative Focus Bias in LLMs

Researchers have identified a critical flaw in Large Language Models (LLMs) where they prioritize moral reasoning over commonsense understanding, struggling to detect logical contradictions within moral dilemmas. The study introduces the CoMoral benchmark and reveals a 'narrative focus bias' where LLMs better identify contradictions attributed to secondary characters rather than primary narrators.

AIBearishThe Register – AI · Mar 106/10
🧠

Amazon insists AI coding isn't source of outages

The article title suggests Amazon is defending its AI coding systems against claims that they are causing service outages. Without the full article content, the specific details of Amazon's response and the nature of the outages cannot be analyzed.

AINeutralMicrosoft Research Blog · Mar 106/10
🧠

From raw interaction to reusable knowledge: Rethinking memory for AI agents

Microsoft Research highlights a counterintuitive problem where giving AI agents more memory actually reduces their effectiveness. As interaction logs accumulate, they become large, filled with irrelevant content, and difficult to search through, making it harder for agents to find relevant information for current tasks.

AIBullishFortune Crypto · Mar 106/10
🧠

Something big is changing in auditing

According to Steve Soter, VP at Workiva, the auditing industry is experiencing a significant transformation with the emergence of 'AI Auditors.' This represents a major shift in how auditing processes are being conducted and automated.

Something big is changing in auditing
AIBullishAI News · Mar 96/10
🧠

City Union Bank launches AI centre to support banking operations

City Union Bank in India has established a Centre of Excellence for Artificial Intelligence through a four-party agreement to test AI solutions on real banking problems. This represents a shift from banks simply purchasing analytics tools to building internal AI testing environments for direct application to banking operations.

AINeutralWired – AI · Mar 96/10
🧠

Can AI Kill the Venture Capitalist?

The article explores whether artificial intelligence could disrupt the venture capital industry itself, even as VCs heavily invest in AI technologies across other sectors. It raises questions about VCs' preparedness for AI to potentially transform their own business model and investment processes.

Can AI Kill the Venture Capitalist?
AIBullisharXiv – CS AI · Mar 96/10
🧠

Artificial Intelligence for Detecting Fetal Orofacial Clefts and Advancing Medical Education

Researchers developed an AI system that can detect fetal orofacial clefts in ultrasound images with over 93% sensitivity and 95% specificity, matching senior radiologist performance. The system was trained on 45,139 ultrasound images from 9,215 fetuses across 22 hospitals and can also improve junior radiologist diagnostic accuracy by 6%.

🏢 Microsoft
← PrevPage 13 of 30Next →