Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks combining visual and textual data. It excels in tasks such as image captioning and...
Best overall rank
-
Across all subsets · overall
Total arena votes
0
Across 0 (subset, category) entries
Providers on OR
0
None yet
First seen
-
Awaiting first snapshot
Trend
No snapshot history yet - trends appear after the first daily refresh.