y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#data-fidelity News & Analysis

2 articles tagged with #data-fidelity. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

2 articles
AINeutralarXiv – CS AI · May 276/10
🧠

When Does Synthetic Patent Data Help? Volume-Fidelity Trade-offs in Low-Resource Multi-Label Classification

Researchers demonstrate that synthetic data generated by LLMs for patent classification shows mixed results, with improvements primarily driven by increased sample volume rather than data quality. The optimal strategy combines 20-30% real data with 70-80% synthetic data, though synthetic corpora can paradoxically harm retrieval performance despite improving classification metrics.

AINeutralarXiv – CS AI · Apr 206/10
🧠

Evaluating LLMs as Human Surrogates in Controlled Experiments

Researchers compared large language models with human responses in a behavioral study on accuracy perception, finding that LLMs reproduce directional effects but with inconsistent effect magnitudes across different models. The study reveals that off-the-shelf LLMs can simulate some human belief-updating patterns in controlled experiments but lack reliable human-scale accuracy, establishing clearer boundaries for when synthetic LLM data is appropriate for behavioral research.