AIBullisharXiv – CS AI · 14h ago6/10
🧠
Frontier LLM-based agents can overcome the ontology curation bottleneck for natural phenotypes
Frontier large language models from Anthropic and OpenAI have demonstrated competitive performance with human experts at annotating natural phenotypes to ontology terms, a previously labor-intensive bottleneck in biological research. When evaluated against the same Gold Standard benchmark used in a 2018 study, these AI agents performed within the range of trained human curators and substantially outperformed prior NLP tools, suggesting significant potential to scale phenotype annotation workflows.
🏢 OpenAI🏢 Anthropic