Recent Publications

Quickly discover relevant content by filtering publications.
InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation
Published in International Conference on Learning Representations (ICLR), 2024, Stars
LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models
Published in International Journal of Computer Vision (IJCV), 2024, JCR Q1 & CCF-A, Stars

Granted Patents

  • Model training method, map building method and device, CN114972909B

  • Human face image super-resolution method based on attention mechanism, CN111080513B

  • Image super-resolution method of adversarial generative network based on fusion mutual information, CN110660020B

  • Image super-resolution method of deep neural network fusing mutual information, CN110211035B

  • Attention-mechanism-based image completion method and device, CN112184582B

  • Cartoon style image conversion model training method, image generation method and device, CN112232485B

  • Image completion method based on uncertainty estimation, CN112686817B