y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#gradient-estimation News & Analysis

3 articles tagged with #gradient-estimation. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

3 articles
AIBullisharXiv โ€“ CS AI ยท Mar 57/10
๐Ÿง 

Unbiased Dynamic Pruning for Efficient Group-Based Policy Optimization

Researchers introduce Dynamic Pruning Policy Optimization (DPPO), a new framework that accelerates AI language model training by 2.37x while maintaining accuracy. The method addresses computational bottlenecks in Group Relative Policy Optimization through unbiased gradient estimation and improved data efficiency.

AIBullisharXiv โ€“ CS AI ยท Apr 136/10
๐Ÿง 

On Divergence Measures for Training GFlowNets

Researchers propose improved divergence measures for training Generative Flow Networks (GFlowNets), comparing Renyi-ฮฑ, Tsallis-ฮฑ, and KL divergences to enhance statistical efficiency. The work introduces control variates that reduce gradient variance and achieve faster convergence than existing methods, bridging GFlowNets training with generalized variational inference frameworks.

AINeutralarXiv โ€“ CS AI ยท Mar 175/10
๐Ÿง 

Align Forward, Adapt Backward: Closing the Discretization Gap in Logic Gate Networks

Researchers propose CAGE (Confidence-Adaptive Gradient Estimation) to solve the training-inference mismatch problem in neural networks that use soft mixtures during training but hard selection during inference. The method achieves over 98% accuracy on MNIST with zero selection gap, significantly outperforming existing approaches like Gumbel-ST which suffers accuracy collapse.