AINeutralarXiv – CS AI · 8h ago6/10
🧠
CodegenBench: Can LLMs Write Efficient Code Across Architectures?
Researchers introduced CodegenBench, a benchmark suite evaluating large language models' ability to generate efficient code across diverse CPU architectures including x86_64, Sunway, and Kunpeng. The study reveals that while LLMs excel at generating optimized code for mainstream architectures, they significantly underperform on domain-specific platforms with limited public documentation, exposing critical gaps in cross-platform generalization.