CLIP is a deep learning model developed by OpenAI that enables direct comparisons between images and text. It has applications in image classification, retrieval, and content moderation. CLIP establishes a multi-modal embedding space through joint training of image and text encoders.

10m read timeFrom towardsdatascience.com
Post cover image

Sort: