AINeutralarXiv – CS AI · 6h ago6/10
🧠
ESTANet: Efficient Online Error Detection in Procedural Videos via Prediction Inconsistency
ESTANet proposes a lightweight deep learning framework for real-time error detection in procedural videos by exploiting prediction inconsistencies among multiple action detectors with varying sensitivities. The system achieves state-of-the-art performance on multiple datasets while maintaining computational efficiency, demonstrating that leveraging inherent detector properties can solve complex vision tasks without architectural complexity.