#kernel-performance News & Analysis

2 articles tagged with #kernel-performance. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

2 articles

AIBullisharXiv – CS AI · Jun 17/10

🧠

GPU Forecasters: Language Models as Selective Surrogates for Kernel Runtime Optimization

Researchers demonstrate that large language models can effectively forecast GPU kernel performance, reducing expensive on-device evaluations during optimization searches. By acting as selective surrogates that know their confidence limits, LLMs enable kernel searches to evaluate multiple candidates under fixed GPU budgets, ultimately discovering faster kernels than baseline approaches.

AIBullisharXiv – CS AI · Mar 127/10

🧠

KernelSkill: A Multi-Agent Framework for GPU Kernel Optimization

Researchers developed KernelSkill, a multi-agent framework that optimizes GPU kernel performance using expert knowledge rather than trial-and-error approaches. The system achieved 100% success rates and significant speedups (1.92x to 5.44x) over existing methods, addressing a critical bottleneck in AI system efficiency.