AINeutralarXiv – CS AI · 5h ago5/10
🧠
EgoPressDiff: Multimodal Video Diffusion for Egocentric UV-Domain Hand-Pressure Estimation
EgoPressDiff presents a conditional video diffusion framework that estimates hand-surface contact pressure from egocentric viewpoints by generating UV-pressure maps from visual input. The method combines pose and mesh vertex features with a novel Distribution-Calibrated Spatial Layer to achieve 34% improvement in accuracy metrics, addressing limitations in AR/VR, robotics, and ergonomic applications.