ICLR

InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation
Published in International Conference on Learning Representations (ICLR), 2024, Stars