AINeutralarXiv – CS AI · 7h ago6/10
🧠
RelayFormer: A Unified Local-Global Attention Framework for Scalable Image and Video Manipulation Localization
RelayFormer is a new deep learning framework that unifies image and video manipulation detection through a flexible attention mechanism called Global Local Relay (GLR) tokens. The approach handles variable resolutions without distortion and processes both static and temporal data with a single architecture, addressing key limitations in current visual forensics methods.