#emotion-recognition News & Analysis

12 articles tagged with #emotion-recognition. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

12 articles

AINeutralarXiv – CS AI · 3d ago6/10

🧠

A Shared Valence Axis Across Modern LLMs and Human EEG: The Saturation Regularity

Researchers demonstrate that Large Language Models and human brain activity share a common valence (emotional) axis, with LLMs trained on emotion-evocative sentences producing representations that align with EEG patterns across 123 subjects. However, directly supervising neural networks to match this axis paradoxically degrades performance, leading to a discovery called the 'saturation regularity' that suggests optimal brain decoding requires ensemble methods leveraging residual diversity rather than additional constraint-based training.

AINeutralarXiv – CS AI · 3d ago6/10

🧠

UF-AMA: A unified framework for cross-domain emotion recognition via adaptive multimodal alignment

Researchers introduce UF-AMA, a unified framework for cross-domain emotion recognition using multimodal physiological signals like EEG and eye-tracking data. The model employs adaptive alignment mechanisms and multi-level domain adaptation to achieve state-of-the-art performance in cross-subject and cross-session emotion recognition tasks.

AINeutralarXiv – CS AI · 3d ago5/10

🧠

Beyond the Mouth: Upper-Face Affective Cues in Audiovisual Sentence Recognition under Acoustic Uncertainty

A new study demonstrates that upper-face affective cues significantly enhance audiovisual speech recognition systems when audio quality degrades, particularly in noisy environments. Rather than encoding linguistic content directly, emotional facial expressions improve model calibration and robustness, suggesting that human communication relies on socially expressive signals beyond traditional mouth-region visual cues.

AIBullisharXiv – CS AI · 3d ago6/10

🧠

Dive into Waves: Morlet Spectral Transformer for Cross-Subject Emotion Decoding from EEG

Researchers propose Morlet Spectral Transformer (MST), a novel neural network architecture for detecting emotions from EEG brain signals across different subjects. The method outperforms larger pretrained models by using specialized wavelet-based signal processing and frequency-specific spatial analysis, demonstrating that intelligent representation design can replace computationally expensive pretraining approaches.

AIBullisharXiv – CS AI · May 296/10

🧠

E3AD: An Emotion-Aware Vision-Language-Action Model for Human-Centric End-to-End Autonomous Driving

Researchers introduce E3AD, an emotion-aware vision-language-action model that enhances autonomous driving systems by interpreting passenger emotional states alongside driving commands. The framework combines semantic understanding with emotion detection (Valence-Arousal-Dominance model) and dual-pathway spatial reasoning to improve both trajectory planning and human-vehicle comfort alignment.

AINeutralarXiv – CS AI · May 286/10

🧠

Resource-Constrained Affect Modelling via Variance Regularisation Pruning

Researchers introduce Variance-Regularised Pruning (VR), a neural network pruning technique that reduces model size while maintaining robust performance across diverse users. The method balances computational efficiency with cross-participant stability in affective computing systems, achieving 80% sparsity without sacrificing reliability on the AGAIN emotion recognition dataset.

AINeutralarXiv – CS AI · May 286/10

🧠

SMILE-Next: Teaching Large Language Models to Detect, Classify, and Reason about Laughter

Researchers introduce SMILE-Next, a comprehensive dataset and specialized large language model framework for understanding laughter in real-world contexts. The work combines laughter detection, classification, and reasoning tasks with novel training techniques including laughter-specific self-instruction and a mixture-of-experts architecture to improve multimodal language model performance on this underexplored domain.

AINeutralarXiv – CS AI · Apr 146/10

🧠

HumanVBench: Probing Human-Centric Video Understanding in MLLMs with Automatically Synthesized Benchmarks

Researchers introduced HumanVBench, a comprehensive benchmark for evaluating how well multimodal AI models understand human-centric video content across 16 tasks including emotion recognition and speech-visual alignment. The study evaluated 30 leading MLLMs and found significant performance gaps, even among top proprietary models, while introducing automated synthesis pipelines to enable scalable benchmark creation with minimal human effort.

AINeutralarXiv – CS AI · Apr 106/10

🧠

A-MBER: Affective Memory Benchmark for Emotion Recognition

Researchers introduce A-MBER, a benchmark dataset designed to evaluate AI assistants' ability to recognize emotions based on long-term interaction history rather than immediate context. The benchmark tests whether models can retrieve relevant past interactions, infer current emotional states, and provide grounded explanations—revealing that memory's value lies in selective, context-aware interpretation rather than simple historical volume.

AINeutralarXiv – CS AI · Mar 124/10

🧠

AMB-DSGDN: Adaptive Modality-Balanced Dynamic Semantic Graph Differential Network for Multimodal Emotion Recognition

Researchers propose AMB-DSGDN, a new AI system for multimodal emotion recognition that uses adaptive modality balancing and differential graph attention mechanisms. The system addresses limitations in existing approaches by filtering noise and preventing dominant modalities from overwhelming the fusion process in text, speech, and visual data.

AINeutralarXiv – CS AI · Mar 114/10

🧠

VoxEmo: Benchmarking Speech Emotion Recognition with Speech LLMs

Researchers introduce VoxEmo, a comprehensive benchmark for evaluating Speech Large Language Models on emotion recognition tasks across 35 emotion corpora and 15 languages. The benchmark addresses evaluation challenges in open text generation and introduces novel protocols that better align with human subjective emotion perception.

AINeutralarXiv – CS AI · Mar 54/10

🧠

A benchmark for joint dialogue satisfaction, emotion recognition, and emotion state transition prediction

Researchers have created a new multi-task Chinese dialogue dataset that enables prediction of user satisfaction, emotion recognition, and emotional state transitions across multiple conversation turns. The dataset addresses limitations in existing Chinese resources and aims to improve understanding of how user emotions evolve during interactions to better predict satisfaction.