Meta: Llama 3.2 11B Vision Instruct, Meta — route, pricing & source
Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks combining visual and textual data. It excels in tasks such as image captioning and...
OpenRoutermediumLast checked: May 14, 2026
Input
$0.2450 / 1M
/ 1M tokens
Output
$0.2450 / 1M
/ 1M tokens
Cached input
Not available
Not available
Cache write
Not available
Not available
Context and limits
Context window
131.1K
Max output
16,384
Input modalities
text, image
Output modalities
text
Weight access
Closed
Availability
Unknown
Capability flags
VisionJSONLong context
From source metadata + verification fields — confirm provider docs before production.
Benchmarks
No safely merged benchmark result for this model yet.
Access and license
Official API
No
Open weights
No
Self-hostable
No
License
Closed · API onlyNo public weights: use through a vendor’s cloud API or product. “Proprietary” means the creator does not distribute the trained model weights for self-hosting.
Example costs
Scenario: 1K input + 300 output tokens / request
10,000 requests
$3.19
100,000 requests
$31.85
1,000,000 requests
$318.50
Cheaper alternatives
Lower-cost models for the same scenario
