y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#dataset News & Analysis

48 articles tagged with #dataset. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

48 articles
AIBullisharXiv – CS AI · Mar 26/1015
🧠

DesignSense: A Human Preference Dataset and Reward Modeling Framework for Graphic Layout Generation

Researchers introduce DesignSense-10k, a dataset of 10,235 human-annotated preference pairs for evaluating graphic layout generation, along with DesignSense, a specialized AI model that outperforms existing models by 54.6% in layout quality assessment. The framework addresses the gap between AI-generated layouts and human aesthetic preferences, showing practical improvements in layout generation through reinforcement learning.

AINeutralarXiv – CS AI · Mar 26/1015
🧠

LFQA-HP-1M: A Large-Scale Human Preference Dataset for Long-Form Question Answering

Researchers released LFQA-HP-1M, a dataset with 1.3 million human preference annotations for evaluating long-form question answering systems. The study introduces nine quality rubrics and shows that simple linear models can match advanced LLM evaluators while exposing vulnerabilities in current evaluation methods.

AIBullisharXiv – CS AI · Feb 275/103
🧠

Make It Hard to Hear, Easy to Learn: Long-Form Bengali ASR and Speaker Diarization via Extreme Augmentation and Perfect Alignment

Researchers developed Lipi-Ghor-882, an 882-hour Bengali speech dataset, and demonstrated that targeted fine-tuning with synthetic acoustic degradation significantly improves automatic speech recognition for long-form Bengali audio. Their dual pipeline achieved a 0.019 Real-Time Factor, establishing new benchmarks for low-resource speech processing.

AIBullisharXiv – CS AI · Feb 276/107
🧠

Understanding Usage and Engagement in AI-Powered Scientific Research Tools: The Asta Interaction Dataset

Researchers released the Asta Interaction Dataset containing over 200,000 user queries from AI-powered scientific research tools, revealing how scientists interact with LLM-based research assistants. The study shows users treat these systems as collaborative research partners, submitting longer queries and using outputs as persistent artifacts for non-linear exploration.

AINeutralarXiv – CS AI · Mar 175/10
🧠

AgrI Challenge: A Data-Centric AI Competition for Cross-Team Validation in Agricultural Vision

Researchers introduced the AgrI Challenge, a data-centric AI competition focused on agricultural vision that revealed significant generalization gaps in machine learning models when deployed across different field conditions. The study found that models trained on single datasets showed validation-test gaps of up to 16.20%, but collaborative multi-source training reduced these gaps to under 3%.

AINeutralarXiv – CS AI · Mar 125/10
🧠

CEI: A Benchmark for Evaluating Pragmatic Reasoning in Language Models

Researchers introduced the Contextual Emotional Inference (CEI) Benchmark, a dataset of 300 human-validated scenarios designed to evaluate how well large language models understand pragmatic reasoning in complex communication. The benchmark tests LLMs' ability to interpret ambiguous utterances across five pragmatic subtypes including sarcasm, mixed signals, and passive aggression in various social contexts.

AINeutralarXiv – CS AI · Mar 94/10
🧠

Conditioning LLMs to Generate Code-Switched Text

Researchers developed a methodology to fine-tune large language models (LLMs) for generating code-switched text between English and Spanish by back-translating natural code-switched sentences into monolingual English. The study found that fine-tuning significantly improves LLMs' ability to generate fluent code-switched text, and that LLM-based evaluation methods align better with human preferences than traditional metrics.

AINeutralarXiv – CS AI · Mar 54/10
🧠

A benchmark for joint dialogue satisfaction, emotion recognition, and emotion state transition prediction

Researchers have created a new multi-task Chinese dialogue dataset that enables prediction of user satisfaction, emotion recognition, and emotional state transitions across multiple conversation turns. The dataset addresses limitations in existing Chinese resources and aims to improve understanding of how user emotions evolve during interactions to better predict satisfaction.

AINeutralarXiv – CS AI · Mar 54/10
🧠

MuRAL: A Multi-Resident Ambient Sensor Dataset Annotated with Natural Language for Activities of Daily Living

Researchers have released MuRAL, a new dataset containing over 21 hours of multi-resident smart home sensor data with natural language annotations for training AI models. The dataset aims to improve Large Language Models' ability to understand human activities in complex smart home environments, though current LLMs still struggle with key tasks like resident identification and activity prediction.

AINeutralarXiv – CS AI · Mar 54/10
🧠

CareMedEval dataset: Evaluating Critical Appraisal and Reasoning in the Biomedical Field

Researchers introduce CareMedEval, a new dataset with 534 questions based on 37 scientific articles to evaluate large language models' ability to perform critical appraisal in biomedical contexts. Testing reveals current AI models struggle with this specialized reasoning task, achieving only 0.5 exact match rates even with advanced prompting techniques.

AINeutralarXiv – CS AI · Mar 44/103
🧠

The Vienna 4G/5G Drive-Test Dataset

Researchers have released the Vienna 4G/5G Drive-Test Dataset, a comprehensive open dataset of georeferenced mobile network measurements collected across Vienna, Austria. The dataset combines passive scanner observations with active handset logs and includes building/terrain models to support machine learning applications in mobile network analysis and optimization.

AINeutralarXiv – CS AI · Mar 34/103
🧠

MAC: A Conversion Rate Prediction Benchmark Featuring Labels Under Multiple Attribution Mechanisms

Researchers have created MAC, the first public conversion rate prediction dataset featuring labels from multiple attribution mechanisms, along with PyMAL, an open-source library for multi-attribution learning approaches. The study introduces a new method called Mixture of Asymmetric Experts (MoAE) that significantly outperforms existing state-of-the-art multi-attribution learning methods.

AINeutralHugging Face Blog · Dec 94/104
🧠

Open Preference Dataset for Text-to-Image Generation by the 🤗 Community

The article appears to be about an open preference dataset for text-to-image generation created by the Hugging Face community. However, the article body is empty, making it impossible to provide specific details about the dataset's features, applications, or significance.

AINeutralHugging Face Blog · Oct 254/108
🧠

Interactively explore your Huggingface dataset with one line of code

The article appears to discuss a tool or method for interactively exploring Hugging Face datasets using a single line of code. However, the article body is empty, preventing detailed analysis of the specific implementation or capabilities.

AINeutralarXiv – CS AI · Mar 34/104
🧠

Seeing Beyond 8bits: Subjective and Objective Quality Assessment of HDR-UGC Videos

Researchers introduce Beyond8Bits, a large-scale dataset of 44K HDR user-generated videos with 1.5M crowd ratings, and HDR-Q, the first multimodal large language model designed for HDR video quality assessment. The work addresses limitations of current video quality systems that are optimized for standard dynamic range content.

$NEAR
AINeutralarXiv – CS AI · Mar 24/105
🧠

TaCarla: A comprehensive benchmarking dataset for end-to-end autonomous driving

Researchers have released TaCarla, a comprehensive dataset containing over 2.85 million frames from CARLA simulation environment designed for end-to-end autonomous driving research. The dataset addresses limitations in existing autonomous driving datasets by providing both perception and planning data with diverse behavioral scenarios for comprehensive model training and evaluation.

$RNDR
AINeutralHugging Face Blog · Oct 231/105
🧠

CinePile 2.0 - making stronger datasets with adversarial refinement

The article title references CinePile 2.0 and adversarial refinement for dataset improvement, but the article body appears to be empty or not provided. Without content to analyze, no meaningful insights about this AI/ML dataset development can be extracted.

GeneralNeutralHugging Face Blog · Jul 81/106
📰

Announcing New Dataset Search Features

The article title suggests an announcement about new dataset search features, but no article body content was provided for analysis. Without the actual article content, specific details about the features, their implications, or market impact cannot be determined.

← PrevPage 2 of 2