y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#humaneval News & Analysis

2 articles tagged with #humaneval. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

2 articles
AIBearisharXiv – CS AI · Apr 137/10
🧠

On the Limits of Layer Pruning for Generative Reasoning in Large Language Models

Research demonstrates that layer pruning—a compression technique for large language models—effectively reduces model size while maintaining classification performance, but critically fails to preserve generative reasoning capabilities like arithmetic and code generation. Even with extensive post-training on 400B tokens, models cannot recover lost reasoning abilities, revealing fundamental limitations in current compression approaches.

AINeutralHugging Face Blog · Jun 184/104
🧠

BigCodeBench: The Next Generation of HumanEval

The article appears to discuss BigCodeBench as a new evaluation benchmark for code generation, positioning it as an advancement over HumanEval. However, the article body is empty, preventing detailed analysis of its features, methodology, or potential impact on AI development.