y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#pretraining News & Analysis

28 articles tagged with #pretraining. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

28 articles
AINeutralarXiv – CS AI · Mar 25/106
🧠

General vs Domain-Specific CNNs: Understanding Pretraining Effects on Brain MRI Tumor Classification

Research comparing CNN architectures for brain tumor classification found that general-purpose models like ConvNeXt-Tiny (93% accuracy) outperformed domain-specific medical pre-trained models like RadImageNet DenseNet121 (68% accuracy). The study suggests that contemporary general-purpose CNNs with diverse pre-training may be more effective for medical imaging tasks in data-scarce scenarios.

AINeutralApple Machine Learning · Feb 245/103
🧠

Beyond a Single Extractor: Re-thinking HTML-to-Text Extraction for LLM Pretraining

Researchers investigate whether using a single HTML-to-text extractor for web-scale LLM pretraining datasets leads to suboptimal data utilization. The study reveals that different extractors can result in substantially different pages surviving filtering pipelines, despite similar model performance on standard language tasks.

← PrevPage 2 of 2