
Gemma 4 31B
gemma4-31bMultimodal: Supports variable aspect ratios and configurable image token budgets for balancing speed and detail. See Image Processing Guide for usage examples.

Qwen3-VL 30B
qwen3-vl-30bMultimodal: Processes images with up to 256K context for long documents. See Image Processing Guide for usage examples.

Kimi K2.5
kimi-k2-5Vision + Language: Jointly trained on images and text. See Image Processing Guide for usage examples.


