Mistral AI has introduced Pixol 12B, a new open-source vision model with 12 billion parameters, designed for multimodal tasks. The model shows strong performance in image and text data interpretation, excelling in instruction following and recognizing images like Bill Gates, CAPTCHAs, and detailed image descriptions. Although
ā¢11m watch time
Sort: