y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#video-analysis News & Analysis

6 articles tagged with #video-analysis. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

6 articles
AINeutralarXiv โ€“ CS AI ยท Mar 177/10
๐Ÿง 

Human-AI Ensembles Improve Deepfake Detection in Low-to-Medium Quality Videos

Research comparing 200 humans and 95 AI detectors found humans significantly outperform AI at detecting deepfakes, especially in low-quality mobile phone videos where AI accuracy drops to near chance levels. The study reveals human-AI hybrid systems are most effective, as humans and AI make complementary errors in deepfake detection.

AIBullisharXiv โ€“ CS AI ยท Mar 266/10
๐Ÿง 

LensWalk: Agentic Video Understanding by Planning How You See in Videos

Researchers introduced LensWalk, an agentic AI framework that enables Large Language Models to actively control their visual observation of videos through dynamic temporal sampling. The system uses a reason-plan-observe loop to progressively gather evidence, achieving 5% accuracy improvements on challenging video benchmarks without requiring model fine-tuning.

AINeutralarXiv โ€“ CS AI ยท Mar 176/10
๐Ÿง 

Is Seeing Believing? Evaluating Human Sensitivity to Synthetic Video

Research reveals that humans can detect credibility issues in deepfake videos through visual and audio distortions. Three experiments show that both technical artifacts and distortions in synthetic media reduce perceived credibility, though understanding of human perception of deepfakes remains limited.

AIBullisharXiv โ€“ CS AI ยท Mar 175/10
๐Ÿง 

Learning Question-Aware Keyframe Selection with Synthetic Supervision for Video Question Answering

Researchers developed a question-aware keyframe selection framework for video question answering that uses large multimodal models to generate pseudo labels and coverage regularization. The method significantly improves accuracy on temporal and causal questions in the NExT-QA dataset, making video analysis more efficient by reducing inference costs.

AIBullisharXiv โ€“ CS AI ยท Mar 175/10
๐Ÿง 

Speech Recognition on TV Series with Video-guided Post-ASR Correction

Researchers have developed a Video-Guided Post-ASR Correction (VPC) framework that uses Video-Large Multimodal Models to improve speech recognition accuracy in complex environments like TV series. The system addresses challenges with multiple speakers, overlapping speech, and domain-specific terminology by leveraging video context to refine ASR outputs.

AINeutralarXiv โ€“ CS AI ยท Mar 164/10
๐Ÿง 

Team LEYA in 10th ABAW Competition: Multimodal Ambivalence/Hesitancy Recognition Approach

Team LEYA developed a multimodal AI approach for recognizing ambivalence and hesitancy in videos for the 10th ABAW Competition, combining scene, facial, audio, and text analysis. Their fusion model achieved 83.25% accuracy compared to 70.02% for single-modality approaches, demonstrating significant improvements in behavioral recognition technology.