Publications

(2024). InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation. In ICLR.

PDF Cite Dataset

(2024). SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction. In ICLR.

PDF Cite Code

(2024). Latte: Latent Diffusion Transformer for Video Generation.

Preprint Cite Code Project

(2023). LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models.

Preprint Cite Code Project

(2023). Uncertainty-Aware Image Inpainting with Adaptive Feedback Network. ESWA.

PDF Cite Code

(2023). LEO: Generative Latent Image Animator for Human Video Synthesis.

Preprint Cite Project

(2022). Style-Based Attentive Network for Real-World Face Hallucination. In PRCV.

PDF Cite

(2022). Compressing Models with Few Samples: Mimicking then Replacing. In CVPR.

PDF Cite

(2021). Contrastive attention network with dense field estimation for face completion. PR.

PDF Cite

(2021). Partial NIR-VIS heterogeneous face recognition with automatic saliency search. T-IFS.

PDF Cite

(2021). Unsupervised Contrastive Photo-to-Caricature Translation based on Auto-distortion. In ICPR.

PDF Cite

(2021). Free-form image inpainting via contrastive attention network. In ICPR.

PDF Cite

(2021). Inconsistency-aware wavelet dual-branch network for face forgery detection. IEEE Transactions on Biometrics, Behavior, and Identity Science.

PDF Cite

(2021). FA-GAN: face augmentation GAN for deformation-invariant face recognition. T-IFS.

PDF Cite