AINeutralarXiv – CS AI · Mar 126/10
🧠
SpreadsheetArena: Decomposing Preference in LLM Generation of Spreadsheet Workbooks
Researchers introduce SpreadsheetArena, a platform for evaluating large language models' ability to generate spreadsheet workbooks from natural language prompts. The study reveals that preferred spreadsheet features vary significantly across use cases, and even top-performing models struggle with domain-specific best practices in areas like finance.