Analytics Digests Sources Topics RSS AI Crypto

#code-llm-evaluation News & Analysis

1 article tagged with #code-llm-evaluation. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

1 articles

AIBearisharXiv – CS AI · Jun 97/10

🧠

Beyond Pass Rate: A Multilingual, Execution-Grounded Evaluation of Open Code LLMs

A comprehensive evaluation of 9 open-source coding LLMs across 2,707 LeetCode problems in 12 programming languages reveals significant performance gaps compared to human developers. The best model achieves only 23.64% correctness versus a 57.2% human baseline, with performance varying substantially across languages and problem types, indicating that aggregate benchmarks mask critical weaknesses in code generation systems.

Tag Connections

110

107

105

93

#bitcoin↔#market

69

#geopolitical↔#iran

61

58

#ai↔#artificial-intelligence

52

#ai↔#microsoft

47

46

Tag Sentiment

#ai901 articles

#market551 articles

#iran472 articles

#bitcoin389 articles

#trump266 articles

#trading181 articles

#openai144 articles

#security133 articles

#nvidia117 articles

#exchange116 articles

BullishNeutralBearish

◆ AI Mentions

🏢OpenAI

150×

🏢Nvidia

116×

🏢Anthropic

91×

🏢Hugging Face

53×

🧠Claude

52×

🧠Gemini

30×

🧠ChatGPT

29×

🧠GPT-5

29×

🏢Meta

19×

🧠Opus

15×

🧠Llama

11×

🏢xAI

8×

🧠Grok

8×

🏢Perplexity

8×

🏢Google

7×

🧠Sonnet

6×

🏢Microsoft

5×

🧠GPT-4

3×

🧠Midjourney

2×

🧠o1

1×

Stay Updated

Everything combined

▲ Trending Tags

1#ai901 2#market551 3#iran472 4#bitcoin389 5#trump266 6#trading181 7#openai144 8#security133 9#nvidia117 10#exchange116 11#geopolitical114 12#google97 13#china95 14#fed94 15#ethereum78

Filters

Sentiment

Importance

Sort

📡 See all 70+ sources

y0.exchange

Your AI agent for DeFi

Connect Claude or GPT to your wallet. AI reads balances, proposes swaps and bridges — you approve. Your keys never leave your device.

8 MCP tools · 15 chains · $0 fees

Connect Wallet to AI →How it works →

Viewing: y0 Digest feed