Meta

Meta: Llama 3.2 11B Vision Instruct

Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks combining visual and textual data. It excels in tasks such as image captioning and visual question answering, bridging the gap between language generation and visual reasoning. Pre-trained on a massive dataset of image-text pairs, it performs well in complex, high-accuracy image analysis. Its ability to integrate visual understanding with language processing makes it an ideal solution for industries requiring comprehensive visual-linguistic AI applications, such as content creation, AI-driven customer service, and research. Click here for the [original model card](https://github.com/meta-llama/llama-models/blob/main/models/llama3_2/MODEL_CARD_VISION.md). Usage of this model is subject to [Meta's Acceptable Use Policy](https://www.llama.com/llama3/use-policy/).

Context Window
131K tokens
Parameters
11B
Input Price
$0.05/1M
Output Price
$0.05/1M
Price Tier
budget
Provider
Meta

How to Use Meta: Llama 3.2 11B Vision Instruct

With CoreAI, you can start chatting with Meta: Llama 3.2 11B Vision Instruct instantly — no separate subscription needed. CoreAI bundles access to Meta: Llama 3.2 11B Vision Instruct along with 300+ other AI models from Meta and other providers like OpenAI, Anthropic, Google, Meta, and more.

  1. Download the CoreAI app for iOS, Android, or use the Web App
  2. Select Meta: Llama 3.2 11B Vision Instruct from the model selector
  3. Start chatting, comparing, or creating with AI

More Meta Models

Meta

Meta: Llama Guard 4 12B

Llama Guard 4 is a Llama 4 Scout-derived multimodal pretrained model, fine-tuned for content safety classification. Similar to previous versions, it c
164K budget
Meta

Meta: Llama 4 Maverick

Llama 4 Maverick 17B Instruct (128E) is a high-capacity multimodal language model from Meta, built on a mixture-of-experts (MoE) architecture with 128
1049K budget
Meta

Meta: Llama 4 Scout

Llama 4 Scout 17B Instruct (16E) is a mixture-of-experts (MoE) language model developed by Meta, activating 17 billion parameters out of a total of 10
328K budget
Meta

Llama Guard 3 8B

Llama Guard 3 is a Llama-3.1-8B pretrained model, fine-tuned for content safety classification. Similar to previous versions, it can be used to classi
131K budget
Meta

Meta: Llama 3.3 70B Instruct (free)

The Meta Llama 3.3 multilingual large language model (LLM) is a pretrained and instruction tuned generative model in 70B (text in/text out). The Llama
66K budget
Meta

Meta: Llama 3.3 70B Instruct

The Meta Llama 3.3 multilingual large language model (LLM) is a pretrained and instruction tuned generative model in 70B (text in/text out). The Llama
131K budget

Try Meta: Llama 3.2 11B Vision Instruct Now

Chat with Meta: Llama 3.2 11B Vision Instruct and 300+ other AI models — all in one app.

Download App → Try on Web App