AIBullisharXiv โ CS AI ยท 6h ago1
๐ง
LitBench: A Graph-Centric Large Language Model Benchmarking Tool For Literature Tasks
Researchers have introduced LitBench, a new benchmarking tool designed to develop and evaluate domain-specific large language models for literature-related tasks. The tool uses graph-centric data curation to generate domain-specific literature sub-graphs and creates training datasets, with results showing small domain-specific LLMs achieving competitive performance against state-of-the-art models like GPT-4o.