y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#training-optimization News & Analysis

29 articles tagged with #training-optimization. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

29 articles
AIBullisharXiv – CS AI · Mar 26/109
🧠

Preference Packing: Efficient Preference Optimization for Large Language Models

Researchers propose 'preference packing,' a new optimization technique for training large language models that reduces training time by at least 37% through more efficient handling of duplicate input prompts. The method optimizes attention operations and KV cache memory usage in preference-based training methods like Direct Preference Optimization.

AIBullisharXiv – CS AI · Feb 276/106
🧠

RLHFless: Serverless Computing for Efficient RLHF

Researchers introduce RLHFless, a serverless computing framework for Reinforcement Learning from Human Feedback (RLHF) that addresses resource inefficiencies in training large language models. The system achieves up to 1.35x speedup and 44.8% cost reduction compared to existing solutions by dynamically adapting to resource demands and optimizing workload distribution.

AINeutralarXiv – CS AI · Mar 175/10
🧠

Align Forward, Adapt Backward: Closing the Discretization Gap in Logic Gate Networks

Researchers propose CAGE (Confidence-Adaptive Gradient Estimation) to solve the training-inference mismatch problem in neural networks that use soft mixtures during training but hard selection during inference. The method achieves over 98% accuracy on MNIST with zero selection gap, significantly outperforming existing approaches like Gumbel-ST which suffers accuracy collapse.

← PrevPage 2 of 2