AINeutralarXiv – CS AI · 6h ago6/10
🧠
Planktonzilla: Multimodal dataset and models for understanding plankton ecosystems
Researchers introduce Planktonzilla-17M, the largest unified plankton image dataset with 17.4 million images across 602 taxonomic classes from thirteen imaging systems. The work demonstrates that supervised learning with taxonomic lineage outperforms CLIP-style training and reveals limitations in current biological foundation models like BioCLIP for marine imaging applications.