AINeutralarXiv โ CS AI ยท 3d ago6/10
๐ง
OPENXRD: A Comprehensive Benchmark Framework for LLM/MLLM XRD Question Answering
Researchers introduced OPENXRD, a comprehensive benchmarking framework for evaluating large language models and multimodal LLMs in crystallography question answering. The study tested 74 state-of-the-art models and found that mid-sized models (7B-70B parameters) benefit most from contextual materials, while very large models often show saturation or interference.
๐ง GPT-4๐ง GPT-4.5๐ง GPT-5