Available models and capabilities are subject to change. If you require SLA guarantees, specific model availability, or long-term production usage, please contact us to discuss your needs. We’re also happy to work with you to add support for your desired model.

DeepSeek V4 Pro
deepseek-v4-proLong Context: The upstream model card describes one-million-token context support; this Tinfoil deployment is configured for an 800K-token context window.

GLM-5.1
glm-5-1
Kimi K2.6
kimi-k2-6Vision + Language: Supports text, image, and video inputs with native reasoning and tool calling for agentic workflows.

Gemma 4 31B
gemma4-31bVision + Language: Processes text and image inputs. Features step-by-step reasoning with configurable thinking mode.

GPT-OSS 120B
gpt-oss-120b
Llama 3.3 70B
llama3-3-70bStructured Outputs: All chat models support structured outputs for reliable data extraction and API integration. Full JSON schema validation available in Python, Node, and Go SDKs. See the Structured Outputs Guide for implementation examples.



