AINeutralarXiv – CS AI · 8h ago5/10
🧠
The Impact of VAE Design on Latent Pose Representations for Diffusion-based Sign Language Production
Researchers investigate how variational autoencoder (VAE) design choices affect latent space properties in sign language production systems using diffusion models. Testing on the Phoenix14T dataset reveals that downstream generative performance correlates more strongly with latent space structure than with traditional reconstruction metrics, suggesting current evaluation methods may miss critical factors influencing model quality.