AI ร CryptoNeutralarXiv โ CS AI ยท 7h ago7/10
๐ค
Intent2Tx: Benchmarking LLMs for Translating Natural Language Intents into Ethereum Transactions
Researchers introduce Intent2Tx, a benchmark dataset of nearly 32,000 real-world Ethereum transactions designed to evaluate how well large language models can translate natural language instructions into executable blockchain transactions. Testing 16 state-of-the-art LLMs reveals a critical gap: while models generate syntactically valid code, they frequently fail to achieve intended on-chain state transitions, exposing fundamental limitations in current AI's ability to reliably bridge user intent and blockchain execution.
$ETH