y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#large-language-models News & Analysis

Over the past month, coverage of #large-language-models has grown significantly, with 100 articles published in the last 30 days out of 273 total indexed pieces. The discussion landscape shows predominantly neutral sentiment at 59%, though bullish perspectives account for 37% of coverage. Notably, sentiment has softened compared to the prior quarter, declining 14.2 percentage points in bullish tone. ArXiv's computer science and AI section dominates source coverage, with Llama, Gemini, and GPT-4 emerging as the most frequently discussed models. Scan the articles below for recent developments and perspectives on the topic.

sentiment · last 30d (100 articles) · -14.2pp bullish vs prior 90d
Top sources:arXiv – CS AI · 254Crypto Briefing · 2TechCrunch – AI · 2IEEE Spectrum – AI · 1Decrypt · 1
Most-discussed entities:Llama · 7Gemini · 6GPT-4 · 6Claude · 4Anthropic · 4
416 articles
AIBullisharXiv – CS AI · Mar 26/1013
🧠

FedRot-LoRA: Mitigating Rotational Misalignment in Federated LoRA

Researchers propose FedRot-LoRA, a new framework that solves rotational misalignment issues in federated learning for large language models. The solution uses orthogonal transformations to align client updates before aggregation, improving training stability and performance without increasing communication costs.

AIBullisharXiv – CS AI · Feb 275/104
🧠

Mapping the Landscape of Artificial Intelligence in Life Cycle Assessment Using Large Language Models

Researchers conducted a comprehensive review of artificial intelligence applications in life cycle assessment (LCA) using large language models to analyze trends and patterns. The study found dramatic growth in AI adoption for environmental assessments, with a notable shift toward LLM-driven approaches and strong correlations between AI methods and LCA stages.

AINeutralarXiv – CS AI · Feb 275/102
🧠

Cognitive Models and AI Algorithms Provide Templates for Designing Language Agents

Researchers propose using cognitive models and AI algorithms as templates for designing modular language agents that combine multiple large language models. The position paper formalizes agent templates that specify roles for individual LLMs and how their functionalities should be composed to solve complex problems beyond single model capabilities.

AIBullisharXiv – CS AI · Feb 276/106
🧠

RLHFless: Serverless Computing for Efficient RLHF

Researchers introduce RLHFless, a serverless computing framework for Reinforcement Learning from Human Feedback (RLHF) that addresses resource inefficiencies in training large language models. The system achieves up to 1.35x speedup and 44.8% cost reduction compared to existing solutions by dynamically adapting to resource demands and optimizing workload distribution.

AIBullisharXiv – CS AI · Feb 276/105
🧠

SoPE: Spherical Coordinate-Based Positional Embedding for Enhancing Spatial Perception of 3D LVLMs

Researchers introduce SoPE (Spherical Coordinate-based Positional Embedding), a new method that enhances 3D Large Vision-Language Models by mapping point-cloud data into spherical coordinate space. This approach overcomes limitations of existing Rotary Position Embedding (RoPE) by better preserving spatial structures and directional variations in 3D multimodal understanding.

AIBullisharXiv – CS AI · Feb 276/108
🧠

G-reasoner: Foundation Models for Unified Reasoning over Graph-structured Knowledge

Researchers introduce G-reasoner, a unified framework combining graph and language foundation models to enable better reasoning over structured knowledge. The system uses a 34M-parameter graph foundation model with QuadGraph abstraction to outperform existing retrieval-augmented generation methods across six benchmarks.

AINeutralIEEE Spectrum – AI · Feb 116/104
🧠

How Can AI Companions Be Helpful, not Harmful?

AI companions are becoming increasingly popular due to advances in large language models, but research from UT Austin highlights potential harms including reduced well-being, disconnection from the physical world, and commitment burden on users. While AI companions may offer benefits like addressing loneliness and building social skills, researchers emphasize the need to establish harm pathways early to guide better design and prevent negative outcomes.

AIBullishHugging Face Blog · Sep 136/104
🧠

Fine-tuning Llama 2 70B using PyTorch FSDP

The article discusses fine-tuning Meta's Llama 2 70B large language model using PyTorch's Fully Sharded Data Parallel (FSDP) technique. This approach enables efficient training of large AI models by distributing parameters across multiple GPUs, making advanced AI model customization more accessible.

AINeutralarXiv – CS AI · Apr 205/10
🧠

Analyzing Chain of Thought (CoT) Approaches in Control Flow Code Deobfuscation Tasks

Researchers demonstrate that Chain-of-Thought prompting significantly improves large language models' ability to deobfuscate control flow code, with GPT-5 achieving 16-20% performance gains over zero-shot prompting. The approach offers a potential alternative to expensive manual reverse engineering, though practical deployment remains limited to research benchmarks.

🧠 GPT-5
AINeutralarXiv – CS AI · Apr 75/10
🧠

Discrete Prototypical Memories for Federated Time Series Foundation Models

Researchers propose FeDPM, a federated learning framework that addresses semantic misalignment issues when using Large Language Models for time series analysis. The system uses discrete prototypical memories to better handle cross-domain time-series data while preserving privacy in distributed settings.

AINeutralarXiv – CS AI · Mar 265/10
🧠

Cluster-R1: Large Reasoning Models Are Instruction-following Clustering Agents

Researchers have developed Cluster-R1, a new approach that trains large reasoning models (LRMs) as autonomous clustering agents capable of following instructions and inferring optimal cluster structures. The method reframes instruction-following clustering as a generative task and demonstrates superior performance over traditional embedding-based methods across 28 diverse tasks in the ReasonCluster benchmark.

AINeutralarXiv – CS AI · Mar 264/10
🧠

From Oracle to Noisy Context: Mitigating Contextual Exposure Bias in Speech-LLMs

Researchers developed a new training framework to address contextual exposure bias in Speech-LLMs, where models trained on perfect conversation history perform poorly with error-prone real-world context. Their approach combines teacher error knowledge, context dropout, and direct preference optimization to improve robustness, achieving WER reductions from 5.59% to 5.17% on TED-LIUM 3.

AINeutralarXiv – CS AI · Mar 264/10
🧠

A Comprehensive Survey on Enterprise Financial Risk Analysis from Big Data and LLMs Perspective

A comprehensive survey paper examines enterprise financial risk analysis from Big Data and large language models perspectives, systematizing existing research methods and identifying future investigation directions. The paper addresses gaps in current surveys by providing a holistic synthesis of AI-driven approaches to financial risk prediction.

AIBullisharXiv – CS AI · Mar 174/10
🧠

LAMB: LLM-based Audio Captioning with Modality Gap Bridging via Cauchy-Schwarz Divergence

Researchers have developed LAMB, a new AI framework that improves automated audio captioning by better aligning audio features with large language models through Cauchy-Schwarz divergence optimization. The system achieved state-of-the-art performance on AudioCaps dataset by bridging the modality gap between audio and text embeddings.

AIBullisharXiv – CS AI · Mar 115/10
🧠

GenePlan: Evolving Better Generalized PDDL Plans using Large Language Models

Researchers present GenePlan, a framework that uses large language models with evolutionary algorithms to generate domain-specific planners for classical planning tasks in PDDL. The system achieved a 0.91 SAT score across eight benchmark domains, nearly matching state-of-the-art performance while significantly outperforming other LLM-based approaches.

🧠 GPT-4
AINeutralarXiv – CS AI · Mar 114/10
🧠

RbtAct: Rebuttal as Supervision for Actionable Review Feedback Generation

Researchers propose RbtAct, a novel approach that uses peer review rebuttals as supervision to train AI models for generating more actionable scientific review feedback. The system leverages a new dataset RMR-75K and fine-tuned Llama-3.1-8B model to produce focused, implementable guidance rather than superficial comments.

🧠 Llama
AINeutralarXiv – CS AI · Mar 64/10
🧠

A unified foundational framework for knowledge injection and evaluation of Large Language Models in Combustion Science

Researchers developed the first comprehensive framework for creating domain-specialized Large Language Models for combustion science, using 3.5 billion tokens from scientific literature and code. The study found that standard RAG approaches hit a performance ceiling at 60% accuracy, highlighting the need for more advanced knowledge injection methods including knowledge graphs and continued pretraining.

AINeutralarXiv – CS AI · Mar 54/10
🧠

Rethinking Role-Playing Evaluation: Anonymous Benchmarking and a Systematic Study of Personality Effects

Researchers propose an anonymous evaluation method for Role-Playing Agents (RPAs) built on large language models, revealing that current benchmarks are biased by character name recognition. The study shows that incorporating personality traits, whether human-annotated or self-generated by AI models, significantly improves role-playing performance under anonymous conditions.

AINeutralarXiv – CS AI · Mar 54/10
🧠

Causality Elicitation from Large Language Models

Researchers propose a new pipeline to extract causal relationships from large language models by sampling documents, identifying events, and using causal discovery methods. The approach aims to reveal the causal hypotheses that LLMs assume rather than establishing real-world causality.

AINeutralarXiv – CS AI · Mar 54/10
🧠

HAMLET: A Hierarchical and Adaptive Multi-Agent Framework for Live Embodied Theatrics

Researchers have developed HAMLET, a hierarchical multi-agent AI framework that creates immersive, interactive theatrical experiences using large language models. The system generates narrative blueprints from simple topics and enables AI actors to perform with adaptive reasoning, emotional states, and physical interactions with scene props.

AINeutralarXiv – CS AI · Mar 44/102
🧠

A Natural Language Agentic Approach to Study Affective Polarization

Researchers developed a multi-agent platform using large language models to study affective polarization in social media through virtual communities. The framework addresses limitations of real-world studies by creating simulated environments where AI agents engage in discussions to analyze political and social divisions.

AIBullisharXiv – CS AI · Mar 35/105
🧠

Harmonizing Dense and Sparse Signals in Multi-turn RL: Dual-Horizon Credit Assignment for Industrial Sales Agents

Researchers propose Dual-Horizon Credit Assignment (DuCA), a new framework for optimizing large language models in industrial sales applications. The method addresses training instability by separately normalizing short-term linguistic rewards and long-term commercial rewards, achieving 6.82% improvement in conversion rates while reducing repetition and detection issues.

← PrevPage 16 of 17Next →