Llama 3.2, developed in collaboration with Meta and available on Hugging Face, includes both multimodal vision models and text-only models. The Vision models come in 11B and 90B sizes and feature strong visual reasoning capabilities. Text-only models are available in 1B and 3B sizes, optimized for on-device use. Llama 3.2 also

15m read timeFrom huggingface.co
Post cover image
Table of contents
Table of contentsWhat is Llama 3.2 Vision?Llama 3.2 license changes. Sorry, EU :(What is special about Llama 3.2 1B and 3B?DemoUsing Hugging Face TransformersLlama 3.2 1B & 3B Language ModelsLlama 3.2 VisionOn-deviceFine-tuning Llama 3.2Hugging Face Partner IntegrationsAdditional ResourcesAcknowledgements
10 Comments

Sort: