Rethinking Genomic Modeling Through Optical Character Recognition
Researchers introduce OpticalDNA, a vision-based genomic modeling framework that treats DNA sequences as visual documents rather than token sequences, achieving superior performance with 20× fewer effective tokens and 256k trainable parameters. This represents a fundamental architectural shift in how foundation models approach genomic data, improving computational efficiency and long-context understanding.