#llm News & Analysis

956 articles tagged with #llm. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

956 articles

AINeutralHugging Face Blog · Sep 65/105

🧠

Spread Your Wings: Falcon 180B is here

The article title suggests the announcement of Falcon 180B, likely referring to a large language model with 180 billion parameters. However, the article body appears to be empty or unavailable for analysis.

AINeutralHugging Face Blog · Jul 175/106

🧠

Open-Source Text Generation & LLM Ecosystem at Hugging Face

The article discusses Hugging Face's open-source text generation and large language model ecosystem. However, no article body content was provided for detailed analysis.

AIBullishHugging Face Blog · May 155/107

🧠

Run a Chatgpt-like Chatbot on a Single GPU with ROCm

The article discusses how to run a ChatGPT-like chatbot on a single GPU using ROCm (Radeon Open Compute). This approach makes large language model deployment more accessible by reducing hardware requirements.

AINeutralHugging Face Blog · May 44/105

🧠

StarCoder: A State-of-the-Art LLM for Code

The article title references StarCoder, which appears to be a state-of-the-art large language model specialized for code generation and programming tasks. However, the article body is empty, preventing detailed analysis of the model's capabilities, features, or market implications.

AINeutralarXiv – CS AI · Mar 34/106

🧠

Confusion-Aware Rubric Optimization for LLM-based Automated Grading

Researchers introduce CARO (Confusion-Aware Rubric Optimization), a new framework that improves LLM-based automated grading by using confusion matrices to separate and fix specific error patterns instead of aggregating all errors together. This approach prevents conflicting constraints and significantly outperforms existing methods in teacher education and STEM datasets.

AINeutralarXiv – CS AI · Mar 34/106

🧠

Optimizing In-Context Demonstrations for LLM-based Automated Grading

Researchers introduce GUIDE, a new framework for improving automated grading of student responses using large language models. The system addresses key limitations in current LLM-based grading by optimizing the selection of training examples and generating better explanations for scoring decisions.

AINeutralarXiv – CS AI · Mar 34/106

🧠

EMPA: Evaluating Persona-Aligned Empathy as a Process

Researchers introduce EMPA, a new framework for evaluating persona-aligned empathy in LLM-based dialogue agents by treating empathetic responses as sustained processes rather than isolated interactions. The system uses controllable scenarios and multi-agent testing to assess long-term empathetic behavior in AI systems.

AINeutralarXiv – CS AI · Mar 34/105

🧠

Emerging Human-like Strategies for Semantic Memory Foraging in Large Language Models

Researchers analyzed how Large Language Models access semantic memory using the Semantic Fluency Task, finding that LLMs exhibit similar memory foraging patterns to humans. The study reveals convergent and divergent search strategies in LLMs that mirror human cognitive behavior, potentially enabling better human-AI alignment or productive cognitive disalignment.

AINeutralarXiv – CS AI · Mar 34/104

🧠

Exploring Plan Space through Conversation: An Agentic Framework for LLM-Mediated Explanations in Planning

Researchers present a multi-agent Large Language Model framework for interactive AI planning systems that provides context-dependent explanations to human planners. The system aims to facilitate collaborative decision-making between humans and AI rather than replacing human planners entirely.

AIBullisharXiv – CS AI · Mar 34/107

🧠

Bridging Policy and Real-World Dynamics: LLM-Augmented Rebalancing for Shared Micromobility Systems

Researchers introduce AMPLIFY, an LLM-augmented framework for optimizing shared micromobility vehicle rebalancing in urban transportation systems. The system combines baseline rebalancing algorithms with real-time AI adaptation to handle emergent events like demand surges and regulatory changes, showing improved performance in Chicago e-scooter data testing.

AINeutralarXiv – CS AI · Mar 34/105

🧠

Agentic Scientific Simulation: Execution-Grounded Model Construction and Reconstruction

Researchers introduce JutulGPT, an AI agent system for physics-based simulation that addresses the problem of underspecified natural language descriptions in scientific modeling. The system uses an execution-grounded approach where the simulator validates physical accuracy, but reveals limitations in tracking tacit assumptions made through simulator defaults.

AINeutralarXiv – CS AI · Mar 34/108

🧠

Texterial: A Text-as-Material Interaction Paradigm for LLM-Mediated Writing

Researchers introduce Texterial, a new interaction paradigm that reimagines text as a malleable material that can be sculpted like clay or cultivated like plants in AI-assisted writing tools. The study presents two technical probes demonstrating gestural text refinement and serendipitous idea growth, expanding the design space for LLM-mediated writing interfaces.

AINeutralarXiv – CS AI · Mar 34/105

🧠

Rooted Absorbed Prefix Trajectory Balance with Submodular Replay for GFlowNet Training

Researchers propose RapTB, a new training objective for Generative Flow Networks (GFlowNets) that addresses mode collapse issues in fine-tuning large language models. The method includes a submodular replay strategy (SubM) and demonstrates improved performance in molecule generation tasks while maintaining diversity and validity.

AINeutralarXiv – CS AI · Mar 34/105

🧠

SSKG Hub: An Expert-Guided Platform for LLM-Empowered Sustainability Standards Knowledge Graphs

Researchers have developed SSKG Hub, an AI-powered platform that transforms complex sustainability disclosure standards into structured knowledge graphs using large language models and expert validation. The system features automated extraction, expert review processes, and role-based governance to create auditable, provenance-linked knowledge graphs for sustainability standards analysis.

AINeutralarXiv – CS AI · Mar 34/106

🧠

LexChronos: An Agentic Framework for Structured Event Timeline Extraction in Indian Jurisprudence

Researchers developed LexChronos, an AI framework that extracts structured event timelines from Indian Supreme Court judgments using a dual-agent architecture. The system achieved 0.8751 F1 score on synthetic data and showed 75% preference over unstructured approaches in legal text summarization tasks.

AINeutralarXiv – CS AI · Mar 34/105

🧠

FLANS at SemEval-2026 Task 7: RAG with Open-Sourced Smaller LLMs for Everyday Knowledge Across Diverse Languages and Cultures

Researchers developed FLANS, a system using retrieval-augmented generation with open-source smaller language models for the SemEval-2025 multilingual knowledge task. The system creates culturally-aware knowledge bases from Wikipedia content and integrates live search capabilities, focusing on privacy and sustainability through smaller LLMs deployed on the Ollama platform.

$CRV

AINeutralarXiv – CS AI · Mar 24/105

🧠

ARGUS: Seeing the Influence of Narrative Features on Persuasion in Argumentative Texts

Researchers introduce ARGUS, a framework for studying how narrative features influence persuasion in online arguments. The study analyzes a ChangeMyView corpus using both traditional classifiers and large language models to identify which storytelling elements make arguments more convincing.

AINeutralarXiv – CS AI · Mar 24/106

🧠

ArgLLM-App: An Interactive System for Argumentative Reasoning with Large Language Models

Researchers have developed ArgLLM-App, a web-based system that uses Large Language Models for argumentative reasoning in decision-making tasks. The system allows human users to visualize explanations and contest reasoning mistakes, making AI decisions more transparent and contestable.

AINeutralarXiv – CS AI · Mar 24/107

🧠

LLM-hRIC: LLM-empowered Hierarchical RAN Intelligent Control for O-RAN

Researchers propose LLM-hRIC, a new framework that combines large language models with hierarchical radio access network intelligent controllers to improve O-RAN networks. The system uses LLM-powered non-real-time controllers for strategic guidance and reinforcement learning for near-real-time decision making in network management.

$NEAR

AIBullisharXiv – CS AI · Mar 24/109

🧠

Low-Resource Dialect Adaptation of Large Language Models: A French Dialect Case-Study

Researchers developed a cost-effective method to adapt large language models to minority dialects using continual pre-training and LoRA techniques, successfully improving Quebec French dialect performance with minimal computational resources. The study demonstrates that parameter-efficient fine-tuning can expand quality LLM access to underserved linguistic communities while updating only 1% of model parameters.

AINeutralarXiv – CS AI · Mar 24/107

🧠

Personality as Relational Infrastructure: User Perceptions of Personality-Trait-Infused LLM Messaging

Researchers studied how personality-trait-infused LLM messaging affects user perceptions in behavior change systems. The study found that personality-based personalization works through aggregate exposure patterns rather than individual message optimization, with users rating personality-informed messages as more personalized and appropriate.

AINeutralHugging Face Blog · Dec 43/107

🧠

We Got Claude to Fine-Tune an Open Source LLM

The article title suggests a demonstration of using Claude AI to fine-tune an open source large language model, but the article body appears to be empty or incomplete. Without content details, the specific methodology, results, or implications cannot be analyzed.

AINeutralHugging Face Blog · Dec 43/106

🧠

Rethinking LLM Evaluation with 3C3H: AraGen Benchmark and Leaderboard

The article title references AraGen, a new benchmark and leaderboard for evaluating Large Language Models using a 3C3H framework, but the article body is empty. Without content, no meaningful analysis of this LLM evaluation methodology can be provided.

AINeutralHugging Face Blog · Jan 243/104

🧠

Open-source LLMs as LangChain Agents

The article discusses the implementation of open-source Large Language Models (LLMs) as agents within the LangChain framework. However, the article body appears to be empty or unavailable, preventing detailed analysis of the specific content and implications.

AINeutralHugging Face Blog · Jul 43/105

🧠

Deploy LLMs with Hugging Face Inference Endpoints

The article appears to discuss deploying Large Language Models (LLMs) using Hugging Face Inference Endpoints. However, the article body is empty, preventing a complete analysis of the content and specific implementation details.

← PrevPage 38 of 39Next →