GLM-5V-Turbo is Z.ai’s first native multimodal agent foundation model, built for vision-based coding and agent-driven tasks. It natively handles image, video, and text inputs, excels at long-horizon planning, complex coding,...
text+image+video->text| Provider | Region / Tag | Context | Input/M | Output/M | Uptime 1d | Latency | Quant | Cache |
|---|---|---|---|---|---|---|---|---|
| Z.AI | z-ai/fp8 | 203K | $1.20 | $4.00 | 100.00% | - | fp8 | - |
| Rank | Category | Variant | Trend | Elo | Votes |
|---|---|---|---|---|---|
| #14 | Humor | - | 1258 | 231 | |
| #23 | Creative Writing Vision | - | 1262 | 466 | |
| #28 | Chinese | - | 1289 | 488 | |
| #32 | English | - | 1234 | 3,259 | |
| #33 | Homework | - | 1264 | 1,172 | |
| #37 | Creative Writing | - | 1227 | 466 | |
| #37 | Diagram | - | 1253 | 1,938 | |
| #37 | Overall | - | 1227 | 7,480 | |
| #38 | Ocr | - | 1241 | 5,220 |