VideoCLIP: Contrastive Pre-Training for Zero-Shot Video-Text Understanding

1 points by LuisMondragon 5 years ago · 2 comments

Reader

sharemywin 5 years ago

Not sure if this is the same thing?

https://github.com/openai/CLIP

LuisMondragonOP 5 years ago

Not the same. CLIP is trained with pairs of images and texts, whereas VideoCLIP uses pairs of videos and texts.