#dataset-design News & Analysis

2 articles tagged with #dataset-design. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

2 articles

AINeutralarXiv – CS AI · Jun 96/10

🧠

Emergence of Context Characteristics Sensitivity in Large Language Models

Researchers studied how large language models develop sensitivity to context characteristics during instruction fine-tuning across three stages: supervised fine-tuning, direct preference optimization, and reinforcement learning. The study found that models progressively learn to favor easily understandable contexts with high length and similarity to queries, with subsequent training stages either reinforcing or resolving these preferences based on dataset design.

AINeutralarXiv – CS AI · Jun 96/10

🧠

Video Understanding by Design: How Datasets Shape Video Models

A comprehensive survey argues that dataset structure fundamentally shapes the evolution of video understanding models, connecting dataset characteristics to architectural innovations like transformers and multimodal foundation models. The research provides a unified framework explaining how different datasets drive specific inductive biases and architectural choices across video AI development.