y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#llm-applications News & Analysis

23 articles tagged with #llm-applications. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

23 articles
AIBullishOpenAI News · 4d ago7/10
🧠

How Endava builds an agentic organization with Codex

Endava leverages Codex to transform into an agentic organization, enabling AI-driven automation of software development workflows. The approach dramatically accelerates delivery timelines and compresses requirements analysis from weeks to mere hours, signaling a shift toward AI-augmented enterprise operations.

AIBullisharXiv – CS AI · 5d ago7/10
🧠

GENESIS: Harnessing AI Agents for Autonomous 6G RAN Synthesis, Research, and Testing

GENESIS is an AI framework that automates the research and development of 6G cellular networks by converting specifications and research into validated production code through over-the-air testing. The system addresses critical limitations of LLMs in radio access networks by combining AI agents with persistent knowledge management and real-world hardware validation rather than relying solely on simulations.

AIBullisharXiv – CS AI · May 97/10
🧠

Agentic Retrieval-Augmented Generation for Financial Document Question Answering

Researchers introduce FinAgent-RAG, an advanced AI framework designed to answer complex financial questions by combining iterative retrieval, reasoning, and self-verification. The system achieves 76-78% accuracy on financial benchmarks while reducing computational costs by 41%, demonstrating practical viability for institutional financial analysis.

AINeutralarXiv – CS AI · Mar 56/10
🧠

Belief-Sim: Towards Belief-Driven Simulation of Demographic Misinformation Susceptibility

Researchers introduce BeliefSim, a framework that uses Large Language Models to simulate how different demographic groups are susceptible to misinformation based on their underlying beliefs. The system achieves up to 92% accuracy in predicting misinformation susceptibility by incorporating psychology-informed belief profiles.

AINeutralarXiv – CS AI · 3d ago6/10
🧠

From GPS Points to Travel Patterns: Flexible and Semantic Trajectory Generation with LLMs

Researchers propose HTP, an LLM-based framework that generates realistic urban trajectories by first synthesizing travel patterns and then GPS points, addressing privacy concerns in smart city applications. The method outperforms existing approaches by 29.78% and can generate variable-length trajectories under multiple conditions, advancing synthetic data generation for urban analytics.

AINeutralarXiv – CS AI · 3d ago6/10
🧠

Automating Low-Risk Code Review at Meta: RADAR, Risk Calibration, and Review Efficiency

Meta's RADAR system automates low-risk code review at scale, processing 535K+ diffs and landing 331K+ changes while maintaining safety metrics significantly better than human review. The system addresses a critical bottleneck where AI-driven code generation has outpaced reviewer capacity, reducing review time by 330% while keeping revert and incident rates substantially lower than non-automated diffs.

AIBullisharXiv – CS AI · 4d ago6/10
🧠

Fine-Tuned LLM as a Complementary Predictor Improving Ads System

Researchers demonstrate a novel approach to advertising systems by using fine-tuned large language models as complementary predictors for advertiser forecasting rather than traditional ranking roles. Deployed in production-scale environments, this method improves candidate generation and downstream ranking by leveraging LLM knowledge to predict likely advertisers from user data, delivering measurable offline and online business improvements.

AINeutralarXiv – CS AI · 5d ago6/10
🧠

CitePrism: Human-in-the-Loop AI for Citation Auditing and Editorial Integrity

CitePrism introduces a human-in-the-loop AI framework designed to assist editors and reviewers in auditing manuscript citations for relevance, accuracy, and ethical appropriateness. The system combines large language models, semantic similarity analysis, and metadata verification to flag potentially problematic citations, achieving moderate agreement with human reviewers in preliminary testing on a pavement engineering manuscript.

AINeutralMIT Technology Review · May 226/10
🧠

The Download: coding’s future, the ‘Steroid Olympics,’ and AI-driven science

Anthropic showcased Code with Claude at its London developer event, demonstrating AI-driven coding capabilities that represent a significant evolution in how developers will write and ship software. The event highlighted practical applications of large language models in software development workflows, raising questions about the future role of traditional coding practices.

🏢 Anthropic🧠 Claude
AIBullishGoogle DeepMind Blog · May 126/10
🧠

Co-Scientist: A multi-agent AI partner to accelerate research

Google has introduced Co-Scientist, a multi-agent AI system built on Gemini designed to assist researchers in accelerating scientific discovery. The tool represents a significant step in applying large language models to collaborative research workflows, potentially transforming how scientists approach complex problems.

Co-Scientist: A multi-agent AI partner to accelerate research
🧠 Gemini
AINeutralarXiv – CS AI · May 126/10
🧠

Teacher-Aware Evolution of Heuristic Programs from Learned Optimization Policies

Researchers propose a teacher-aware evolutionary framework that leverages pre-trained learned optimization policies to guide the automatic design of heuristic programs for combinatorial optimization problems. The method uses behavioral feedback from teacher policies during evolution rather than relying solely on endpoint performance, achieving better results than baseline LLM-driven approaches without requiring neural inference at deployment.

AINeutralarXiv – CS AI · May 126/10
🧠

LASSA Architecture-Based Autonomous Fault-Tolerant Control of Unmanned Underwater Vehicles

Researchers propose LASSA, an LLM-based autonomous control architecture for unmanned underwater vehicles that combines large language models with physical constraint verification to enable fault-tolerant operation in communication-limited environments. Lake experiments demonstrate the system successfully detects faults, replans missions, and maintains operational safety without false alarms.

AINeutralarXiv – CS AI · May 116/10
🧠

ARMOR: An Agentic Framework for Reaction Feasibility Prediction via Adaptive Utility-aware Multi-tool Reasoning

Researchers introduce ARMOR, an agentic framework that improves chemical reaction feasibility prediction by intelligently combining multiple AI tools rather than relying on single models. The system uses hierarchical tool organization and memory-augmented reasoning to resolve conflicting predictions, demonstrating significant performance gains especially when different tools disagree on outcomes.

AIBullisharXiv – CS AI · May 96/10
🧠

Knowledge Graphs, the Missing Link in Agentic AI-based Formal Verification

Researchers propose a Knowledge Graph-based approach to improve AI-assisted formal verification of hardware designs, addressing the challenge of generating accurate SystemVerilog Assertions from natural-language specifications. By structuring design information from RTL code, specifications, and tool feedback into a queryable knowledge graph, the method achieves higher compilation success rates and formal coverage (78.5%-99.4%) while reducing syntax errors, though complex temporal reasoning remains challenging.

AINeutralarXiv – CS AI · May 16/10
🧠

LLM as Clinical Graph Structure Refiner: Enhancing Representation Learning in EEG Seizure Diagnosis

Researchers propose using large language models as graph structure refiners to improve EEG-based seizure detection by identifying and removing redundant connections in noisy neural signal data. A two-stage framework combining Transformer-based edge prediction with LLM validation demonstrates improved accuracy and more interpretable graph representations on the TUSZ dataset.

AIBullisharXiv – CS AI · Apr 156/10
🧠

INFORM-CT: INtegrating LLMs and VLMs FOR Incidental Findings Management in Abdominal CT

Researchers propose INFORM-CT, an AI framework combining large language models and vision-language models to automate detection and reporting of incidental findings in abdominal CT scans. The system uses a planner-executor approach that outperforms traditional manual inspection and existing pure vision-based models in accuracy and efficiency.

AIBullisharXiv – CS AI · Apr 146/10
🧠

Learning Preference-Based Objectives from Clinical Narratives for Sequential Treatment Decision-Making

Researchers propose Clinical Narrative-informed Preference Rewards (CN-PR), a machine learning framework that extracts reward signals from patient discharge summaries to train reinforcement learning models for treatment decisions. The approach achieves strong alignment with clinical outcomes, including improved organ support-free days and faster shock resolution, offering a scalable alternative to traditional reward design in healthcare AI.

AIBullisharXiv – CS AI · Mar 266/10
🧠

PASTA: A Scalable Framework for Multi-Policy AI Compliance Evaluation

Researchers have developed PASTA, a scalable AI compliance evaluation framework that can assess multiple policies simultaneously using LLM-powered analysis. The system evaluates five major AI policies in under two minutes for approximately $3, with expert validation showing strong alignment with human judgment.

AINeutralarXiv – CS AI · Mar 44/102
🧠

A Natural Language Agentic Approach to Study Affective Polarization

Researchers developed a multi-agent platform using large language models to study affective polarization in social media through virtual communities. The framework addresses limitations of real-world studies by creating simulated environments where AI agents engage in discussions to analyze political and social divisions.

AIBullisharXiv – CS AI · Mar 35/1011
🧠

Demonstrating ViviDoc: Generating Interactive Documents through Human-Agent Collaboration

ViviDoc is a new human-agent collaborative system that generates interactive educational documents using a multi-agent pipeline and Document Specification framework. The system allows educators to review and refine AI-generated content plans before code production, significantly outperforming naive AI generation methods.

$RNDR