This video explains how CLIP from OpenAI transforms Image Classification into a TextImage similarity matching task. This is done with Contrastive Training and ZeroShot PatternExploiting Training. Thanks for watching Paper Links: Clip (Blog Post): VirTex: ConVIRT: PatternExploiting Training: Vision Transformer (Blog Post, Nice Animation):
0
0
Related videos
Preparing
To view the site materials you should be more than or equal to 18 years old