y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#research News & Analysis

907 articles tagged with #research. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

907 articles
AINeutralarXiv โ€“ CS AI ยท Mar 54/10
๐Ÿง 

Social Norm Reasoning in Multimodal Language Models: An Evaluation

Researchers evaluated five Multimodal Large Language Models (MLLMs) on their ability to reason about social norms in both text and image scenarios. GPT-4o performed best overall, while all models showed superior performance with text-based norm reasoning compared to image-based scenarios.

๐Ÿง  GPT-4
AINeutralarXiv โ€“ CS AI ยท Mar 53/10
๐Ÿง 

A novel network for classification of cuneiform tablet metadata

Researchers developed a novel neural network architecture for classifying cuneiform tablet metadata using point-cloud representations. The convolution-inspired approach outperformed existing transformer-based methods like Point-BERT by gradually down-scaling point clouds while integrating local and global information.

AINeutralarXiv โ€“ CS AI ยท Mar 54/10
๐Ÿง 

TFWaveFormer: Temporal-Frequency Collaborative Multi-level Wavelet Transformer for Dynamic Link Prediction

Researchers propose TFWaveFormer, a novel Transformer architecture that combines temporal-frequency analysis with multi-resolution wavelet decomposition for dynamic link prediction. The framework achieves state-of-the-art performance on benchmark datasets by better capturing complex multi-scale temporal dynamics in applications like social networks and financial modeling.

AINeutralarXiv โ€“ CS AI ยท Mar 54/10
๐Ÿง 

DQE-CIR: Distinctive Query Embeddings through Learnable Attribute Weights and Target Relative Negative Sampling in Composed Image Retrieval

Researchers propose DQE-CIR, a new method for composed image retrieval that improves AI's ability to find images based on reference images and text modifications. The approach addresses limitations in current contrastive learning frameworks by using learnable attribute weights and target relative negative sampling to create more distinctive query embeddings.

AINeutralarXiv โ€“ CS AI ยท Mar 54/10
๐Ÿง 

MOO: A Multi-view Oriented Observations Dataset for Viewpoint Analysis in Cattle Re-Identification

Researchers introduced MOO, a large-scale synthetic dataset of 1,000 cattle individuals captured from 128 viewpoints to improve animal re-identification across different viewing angles. The dataset addresses critical challenges in aerial-ground re-identification by providing precise angular annotations and demonstrates effective transfer to real-world applications.

AINeutralarXiv โ€“ CS AI ยท Mar 54/10
๐Ÿง 

MuRAL: A Multi-Resident Ambient Sensor Dataset Annotated with Natural Language for Activities of Daily Living

Researchers have released MuRAL, a new dataset containing over 21 hours of multi-resident smart home sensor data with natural language annotations for training AI models. The dataset aims to improve Large Language Models' ability to understand human activities in complex smart home environments, though current LLMs still struggle with key tasks like resident identification and activity prediction.

AINeutralarXiv โ€“ CS AI ยท Mar 54/10
๐Ÿง 

RLJP: Legal Judgment Prediction via First-Order Logic Rule-enhanced with Large Language Models

Researchers propose RLJP, a new framework for Legal Judgment Prediction that combines first-order logic rules with large language models to improve AI-based legal decision making. The system uses a three-stage approach including Confusion-aware Contrastive Learning to dynamically optimize judgment rules and showed superior performance on public datasets.

AINeutralarXiv โ€“ CS AI ยท Mar 54/10
๐Ÿง 

HAMLET: A Hierarchical and Adaptive Multi-Agent Framework for Live Embodied Theatrics

Researchers have developed HAMLET, a hierarchical multi-agent AI framework that creates immersive, interactive theatrical experiences using large language models. The system generates narrative blueprints from simple topics and enables AI actors to perform with adaptive reasoning, emotional states, and physical interactions with scene props.

AINeutralarXiv โ€“ CS AI ยท Mar 54/10
๐Ÿง 

Self-Supervised Inductive Logic Programming

Researchers developed a new self-supervised Inductive Logic Programming approach called Poker that can learn recursive logic programs without requiring expert-crafted negative examples or problem-specific background theories. The system automatically generates and labels new training examples during learning, showing improved performance over existing methods when negative examples are unavailable.

AINeutralarXiv โ€“ CS AI ยท Mar 54/10
๐Ÿง 

AutoQD: Automatic Discovery of Diverse Behaviors with Quality-Diversity Optimization

Researchers present AutoQD, a new AI method that automatically discovers diverse behavioral policies without requiring hand-crafted descriptors. The approach uses mathematical embeddings of policy occupancy measures to enable Quality-Diversity optimization algorithms to find varied high-performing solutions in reinforcement learning tasks.

AINeutralarXiv โ€“ CS AI ยท Mar 54/10
๐Ÿง 

Q-Guided Stein Variational Model Predictive Control via RL-informed Policy Prior

Researchers have developed Q-SVMPC, a new Model Predictive Control method that combines reinforcement learning with Stein variational inference to improve trajectory optimization. The approach addresses limitations in existing MPC methods that often converge to single solutions, instead maintaining diverse solution paths for better performance in robotics applications.

AINeutralarXiv โ€“ CS AI ยท Mar 54/10
๐Ÿง 

When Relevance Meets Novelty: Dual-Stable Periodic Optimization for Serendipitous Recommendation

Researchers propose Co-Evolutionary Alignment (CoEA), a new recommendation system method that uses dual large language models to balance relevant and novel content suggestions. The system addresses traditional recommendation bias through dynamic optimization that considers both long-term group identity and short-term individual preferences.

AIBullisharXiv โ€“ CS AI ยท Mar 54/10
๐Ÿง 

LadderSym: A Multimodal Interleaved Transformer for Music Practice Error Detection

Researchers introduced LadderSym, a new Transformer-based AI method for detecting music practice errors that significantly outperforms existing approaches. The system uses multimodal processing of audio and symbolic music scores, more than doubling accuracy for detecting missed notes and improving extra note detection by 14.4 points.

AINeutralarXiv โ€“ CS AI ยท Mar 54/10
๐Ÿง 

MuSaG: A Multimodal German Sarcasm Dataset with Full-Modal Annotations

Researchers have released MuSaG, the first German multimodal sarcasm detection dataset featuring 33 minutes of annotated television content with text, audio, and video data. The study reveals a significant gap between human sarcasm detection (which relies heavily on audio cues) and current AI models (which perform best with text).

AINeutralarXiv โ€“ CS AI ยท Mar 54/10
๐Ÿง 

CareMedEval dataset: Evaluating Critical Appraisal and Reasoning in the Biomedical Field

Researchers introduce CareMedEval, a new dataset with 534 questions based on 37 scientific articles to evaluate large language models' ability to perform critical appraisal in biomedical contexts. Testing reveals current AI models struggle with this specialized reasoning task, achieving only 0.5 exact match rates even with advanced prompting techniques.

AINeutralarXiv โ€“ CS AI ยท Mar 54/10
๐Ÿง 

Improving Multi-View Reconstruction via Texture-Guided Gaussian-Mesh Joint Optimization

Researchers propose a novel framework for 3D object reconstruction from multi-view images that simultaneously optimizes mesh geometry and appearance through Gaussian-guided rendering. The unified approach addresses limitations of existing methods that separate geometry and appearance optimization, enabling better downstream editing tasks like relighting and shape deformation.

AINeutralarXiv โ€“ CS AI ยท Mar 44/103
๐Ÿง 

Revealing Positive and Negative Role Models to Help People Make Good Decisions

Researchers present a framework for social planners to strategically reveal positive and negative role models to influence agent behavior in social networks. The study addresses optimization challenges when disclosure budgets are limited and proposes algorithms to maximize social welfare while maintaining fairness across different groups.

AINeutralarXiv โ€“ CS AI ยท Mar 44/103
๐Ÿง 

AnchorDrive: LLM Scenario Rollout with Anchor-Guided Diffusion Regeneration for Safety-Critical Scenario Generation

Researchers have developed AnchorDrive, a two-stage AI framework that combines large language models with diffusion models to generate realistic safety-critical scenarios for autonomous driving systems. The system uses LLMs for controllable scenario generation based on natural language instructions, then employs diffusion models to create realistic driving trajectories.

AIBullisharXiv โ€“ CS AI ยท Mar 44/102
๐Ÿง 

FEAST: Retrieval-Augmented Multi-Hierarchical Food Classification for the FoodEx2 System

Researchers developed FEAST, a new AI framework that improves food classification accuracy for Europe's FoodEx2 system by 12-38% on rare food categories. The system uses retrieval-augmented learning to better classify complex food descriptions into standardized codes used for food safety monitoring across Europe.

AINeutralarXiv โ€“ CS AI ยท Mar 44/102
๐Ÿง 

No Memorization, No Detection: Output Distribution-Based Contamination Detection in Small Language Models

Researchers developed CDD (Contamination Detection via output Distribution) to identify data contamination in small language models by measuring output peakedness. The study found that CDD only works when fine-tuning produces verbatim memorization, failing at chance level with parameter-efficient methods like low-rank adaptation that avoid memorization.

AINeutralarXiv โ€“ CS AI ยท Mar 44/103
๐Ÿง 

On the Parameter Estimation of Sinusoidal Models for Speech and Audio Signals

Research paper compares three sinusoidal models for speech and audio signal processing: standard Sinusoidal Model (SM), Exponentially Damped Sinusoidal Model (EDSM), and extended adaptive Quasi-Harmonic Model (eaQHM). The study finds eaQHM performs better for medium-to-large window analysis while EDSM excels with smaller analysis windows, suggesting future research should combine both approaches.

AINeutralarXiv โ€“ CS AI ยท Mar 44/103
๐Ÿง 

GLEAN: Grounded Lightweight Evaluation Anchors for Contamination-Aware Tabular Reasoning

Researchers propose GLEAN, a new evaluation protocol for testing small AI models on tabular reasoning tasks while addressing contamination and hardware constraints. The framework reveals distinct error patterns between different models and provides diagnostic tools for more reliable evaluation under limited computational resources.

AINeutralarXiv โ€“ CS AI ยท Mar 44/102
๐Ÿง 

A Benchmark Analysis of Graph and Non-Graph Methods for Caenorhabditis Elegans Neuron Classification

Researchers conducted a benchmark study comparing graph neural networks (GNNs) against traditional methods for classifying neurons in C. elegans worms. The study found that attention-based GNNs significantly outperformed baseline methods when using spatial and connection features, validating the effectiveness of graph-based approaches for biological neural network analysis.