CLIP is a deep learning model developed by OpenAI that enables direct comparisons between images and text. It has applications in image classification, retrieval, and content moderation. CLIP establishes a multi-modal embedding space through joint training of image and text encoders.

Sort: