48 articles tagged with #dataset. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.
AIBullisharXiv – CS AI · Mar 26/1015
🧠Researchers introduce DesignSense-10k, a dataset of 10,235 human-annotated preference pairs for evaluating graphic layout generation, along with DesignSense, a specialized AI model that outperforms existing models by 54.6% in layout quality assessment. The framework addresses the gap between AI-generated layouts and human aesthetic preferences, showing practical improvements in layout generation through reinforcement learning.
AINeutralarXiv – CS AI · Mar 26/1015
🧠Researchers released LFQA-HP-1M, a dataset with 1.3 million human preference annotations for evaluating long-form question answering systems. The study introduces nine quality rubrics and shows that simple linear models can match advanced LLM evaluators while exposing vulnerabilities in current evaluation methods.
AIBullisharXiv – CS AI · Feb 275/103
🧠Researchers developed Lipi-Ghor-882, an 882-hour Bengali speech dataset, and demonstrated that targeted fine-tuning with synthetic acoustic degradation significantly improves automatic speech recognition for long-form Bengali audio. Their dual pipeline achieved a 0.019 Real-Time Factor, establishing new benchmarks for low-resource speech processing.
AIBullisharXiv – CS AI · Feb 276/107
🧠Researchers released the Asta Interaction Dataset containing over 200,000 user queries from AI-powered scientific research tools, revealing how scientists interact with LLM-based research assistants. The study shows users treat these systems as collaborative research partners, submitting longer queries and using outputs as persistent artifacts for non-linear exploration.
AIBullishHugging Face Blog · Mar 116/107
🧠The article title suggests LeRobot has released the world's largest open-source self-driving dataset, representing a significant contribution to autonomous vehicle development. However, no article body was provided for detailed analysis.
AINeutralarXiv – CS AI · Mar 175/10
🧠Researchers introduced the AgrI Challenge, a data-centric AI competition focused on agricultural vision that revealed significant generalization gaps in machine learning models when deployed across different field conditions. The study found that models trained on single datasets showed validation-test gaps of up to 16.20%, but collaborative multi-source training reduced these gaps to under 3%.
AINeutralarXiv – CS AI · Mar 125/10
🧠Researchers introduced the Contextual Emotional Inference (CEI) Benchmark, a dataset of 300 human-validated scenarios designed to evaluate how well large language models understand pragmatic reasoning in complex communication. The benchmark tests LLMs' ability to interpret ambiguous utterances across five pragmatic subtypes including sarcasm, mixed signals, and passive aggression in various social contexts.
AINeutralarXiv – CS AI · Mar 94/10
🧠Researchers developed a methodology to fine-tune large language models (LLMs) for generating code-switched text between English and Spanish by back-translating natural code-switched sentences into monolingual English. The study found that fine-tuning significantly improves LLMs' ability to generate fluent code-switched text, and that LLM-based evaluation methods align better with human preferences than traditional metrics.
AINeutralarXiv – CS AI · Mar 54/10
🧠Researchers have created a new multi-task Chinese dialogue dataset that enables prediction of user satisfaction, emotion recognition, and emotional state transitions across multiple conversation turns. The dataset addresses limitations in existing Chinese resources and aims to improve understanding of how user emotions evolve during interactions to better predict satisfaction.
AINeutralarXiv – CS AI · Mar 54/10
🧠Researchers have released MuRAL, a new dataset containing over 21 hours of multi-resident smart home sensor data with natural language annotations for training AI models. The dataset aims to improve Large Language Models' ability to understand human activities in complex smart home environments, though current LLMs still struggle with key tasks like resident identification and activity prediction.
AINeutralarXiv – CS AI · Mar 54/10
🧠Researchers introduce CareMedEval, a new dataset with 534 questions based on 37 scientific articles to evaluate large language models' ability to perform critical appraisal in biomedical contexts. Testing reveals current AI models struggle with this specialized reasoning task, achieving only 0.5 exact match rates even with advanced prompting techniques.
AINeutralarXiv – CS AI · Mar 44/103
🧠Researchers have released the Vienna 4G/5G Drive-Test Dataset, a comprehensive open dataset of georeferenced mobile network measurements collected across Vienna, Austria. The dataset combines passive scanner observations with active handset logs and includes building/terrain models to support machine learning applications in mobile network analysis and optimization.
AINeutralarXiv – CS AI · Mar 34/104
🧠Researchers have created CrimeNER, a specialized dataset of over 1,500 annotated crime-related documents for training named-entity recognition AI models. The study addresses the lack of quality training data in the crime domain by developing a database from terrorist attack reports and DOJ press notes, defining 22 types of crime-related entities.
AINeutralarXiv – CS AI · Mar 34/103
🧠Researchers have created MAC, the first public conversion rate prediction dataset featuring labels from multiple attribution mechanisms, along with PyMAL, an open-source library for multi-attribution learning approaches. The study introduces a new method called Mixture of Asymmetric Experts (MoAE) that significantly outperforms existing state-of-the-art multi-attribution learning methods.
AINeutralHugging Face Blog · Dec 94/104
🧠The article appears to be about an open preference dataset for text-to-image generation created by the Hugging Face community. However, the article body is empty, making it impossible to provide specific details about the dataset's features, applications, or significance.
AIBullishHugging Face Blog · Nov 44/107
🧠Argilla has released version 2.4 of their dataset building platform, which allows users to create fine-tuning and evaluation datasets without coding requirements. The update focuses on improving accessibility for non-technical users to build AI training datasets through their Hub platform.
AINeutralHugging Face Blog · Jul 184/106
🧠The article title references Docmatix, which appears to be a large-scale dataset designed for Document Visual Question Answering tasks. However, no article body content was provided for analysis.
AIBullishHugging Face Blog · Mar 155/106
🧠The WebSight Dataset represents a new AI development that enables automatic conversion of web screenshots into HTML code. This breakthrough could significantly streamline web development processes by using machine learning to interpret visual web layouts and generate corresponding code.
AINeutralHugging Face Blog · Oct 254/108
🧠The article appears to discuss a tool or method for interactively exploring Hugging Face datasets using a single line of code. However, the article body is empty, preventing detailed analysis of the specific implementation or capabilities.
AINeutralarXiv – CS AI · Mar 34/104
🧠Researchers introduce Beyond8Bits, a large-scale dataset of 44K HDR user-generated videos with 1.5M crowd ratings, and HDR-Q, the first multimodal large language model designed for HDR video quality assessment. The work addresses limitations of current video quality systems that are optimized for standard dynamic range content.
$NEAR
AINeutralarXiv – CS AI · Mar 24/105
🧠Researchers have released TaCarla, a comprehensive dataset containing over 2.85 million frames from CARLA simulation environment designed for end-to-end autonomous driving research. The dataset addresses limitations in existing autonomous driving datasets by providing both perception and planning data with diverse behavioral scenarios for comprehensive model training and evaluation.
$RNDR
AINeutralHugging Face Blog · Oct 231/105
🧠The article title references CinePile 2.0 and adversarial refinement for dataset improvement, but the article body appears to be empty or not provided. Without content to analyze, no meaningful insights about this AI/ML dataset development can be extracted.
GeneralNeutralHugging Face Blog · Jul 81/106
📰The article title suggests an announcement about new dataset search features, but no article body content was provided for analysis. Without the actual article content, specific details about the features, their implications, or market impact cannot be determined.