Latte - A first open source Transformer-based Video Diffusion Generation Framework (TMLR 2025)

Xin Ma
Xin Ma
PhD Student

I’m a Ph.D canditate at Monash University. My research interests include video and image generation, multimodal models, low-level vision, and face recognition, among others.