AIBullisharXiv โ CS AI ยท 7h ago6/10
๐ง
Visual-ERM: Reward Modeling for Visual Equivalence
Researchers introduce Visual-ERM, a multimodal reward model that improves vision-to-code tasks by evaluating visual equivalence in rendered outputs rather than relying on text-based rules. The system achieves significant performance gains on chart-to-code tasks (+8.4) and shows consistent improvements across table and SVG parsing applications.