/Models/Meta: Llama 3.2 11B Vision Instruct
Meta

Meta: Llama 3.2 11B Vision Instruct

131k context
Lowest Price
$0.05
per 1M tokens
Providers
1
Available
Context
131k
tokens

Price Comparison

ProviderInput / OutputLatencyStatus
OpenRouterOpenRouterLowest
$0.05/$0.05
...
Verified

About This Model

Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks combining visual and textual data. It excels in tasks such as image captioning and visual question answering, bridging the gap between language generation and visual reasoning. Pre-trained on a massive da...

Quick Start