Selected Projects

Here are a selection of projects that I have worked on over the years.

*
Cinemo - An image animation Framework (CVPR 2025)
This framework offers simpler, more precise user control and better image animation performance.
Latte - A first open source Transformer-based Video Diffusion Generation Framework (TMLR 2025)
A simple and general latent video diffusion model incorporating sptio-temporal Transformers for video generation.
LaVie - A High-Quality Video Generation Framework (IJCV 2024)
A large-scale text-to-video framework that produces high-quality and temporally coherent videos. This framework operates on cascaded video latent diffusion models, comprising a base T2V model, a temporal interpolation model, and a video super-resolution model.