Model Catalog

Quick Reference — All models listed below are accessible through any of our SDKs. See the SDK pages for language-specific usage examples.

Proxy URL: All models can be accessed through the unified proxy URL https://inference.tinfoil.sh which automatically routes to the appropriate model based on the model name in the API request.

Available Models

Below is a list of all models currently supported on Tinfoil, including their model IDs, types, repositories, and enclave addresses.

Don’t see the model you need? Contact us at [email protected] and we’ll work with you to add support for your desired model. We’re working hard to get bigger and more powerful models running with Tinfoil.

Chat Models

Description: Chat models support conversational AI capabilities through the standard chat completions API. All chat models follow the OpenAI chat completion format.

DeepSeek R1

deepseek-r1-0528

Parameters: 671B Context: 128K tokens
Strengths: State-of-the-art reasoning, advanced mathematical capabilities, enhanced function calling, reduced hallucination rate
Structured Outputs: Structured response formatting support
Best for: Complex reasoning tasks, mathematical problem-solving, advanced coding, and tasks requiring deep analytical thinking

tinfoilsh/confidential-deepseek-r1-0528

GPT-OSS 120B

gpt-oss-120b

Parameters: 117B Context: 128K tokens
Strengths: Powerful reasoning, configurable reasoning effort levels, full chain-of-thought access, native agentic abilities including function calling, web browsing, and Python code execution
Structured Outputs: Structured response formatting support
Best for: Production use cases requiring high reasoning capabilities, agentic operations, and specialized applications

tinfoilsh/confidential-gpt-oss-120b

Mistral Small 3.1 24B

mistral-small-3-1-24b

Context: 128K tokens
Strengths: Vision capabilities for image analysis, efficient performance, strong multilingual support
Multimodal Support: Supports image inputs (in base64) alongside text for vision tasks
Structured Outputs: Structured response formatting support
Best for: General-purpose chat applications, image analysis, and diverse language requirements

tinfoilsh/confidential-mistral-small-3-1

🖼️ Multimodal: See Image Processing Guide for vision examples.

Llama 3.3 70B

llama3-3-70b

Context: 128K tokens
Strengths: Multilingual understanding, dialogue optimization, strong reasoning
Structured Outputs: Structured response formatting support
Best for: Conversational AI applications and complex dialogue systems

tinfoilsh/confidential-llama-mistral-qwen-turbo

Qwen 2.5 72B

qwen2-5-72b

Context: 128K tokens
Strengths: Exceptional function calling, multilingual capabilities, programming, mathematical reasoning, structured data processing
Tool Calling: Best-in-class function calling capabilities with high reliability
Structured Outputs: Structured response formatting support
Best for: Complex reasoning tasks, function calling workflows, coding applications, mathematical problems, long-form content generation

tinfoilsh/confidential-qwen2-5-72b

🛠️ Excellent Tool Calling: See Tool Calling Guide for implementation examples.

📋 Structured Outputs: All chat models support structured outputs for reliable data extraction and API integration. Full JSON schema validation available in Node.js SDK. See the Structured Outputs Guide for implementation examples.

Audio Models

Description: Audio models provide speech-to-text transcription and text-to-speech synthesis capabilities. Supporting both audio file transcription and high-quality speech generation.

Whisper Large V3 Turbo

whisper-large-v3-turbo

Capabilities: Speech-to-text transcription
Strengths: Fast processing, high accuracy, multiple language support
Best for: Audio transcription, voice-to-text applications

tinfoilsh/confidential-audio-processing

Audio Format: Supports .mp3 and .wav files

Kokoro

kokoro

Capabilities: Text-to-speech synthesis
Strengths: High-quality speech generation, multiple voice options, natural-sounding output
Voices: Supports voice combinations like af_sky, af_bella, af_sky+af_bella
Best for: Text-to-speech applications, voice synthesis, audio content creation

tinfoilsh/confidential-audio-processing

Embedding Models

Description: Embedding models convert text into high-dimensional vectors for semantic search, similarity comparisons, and other vector-based operations.

Nomic Embed Text

nomic-embed-text

Dimensions: 768
Strengths: Open-source, outperforms OpenAI Ada-002 on benchmarks
Best for: Semantic search, document similarity, clustering

tinfoilsh/confidential-nomic-embed-text

Document Processing Models

Description: Document processing models handle file conversion, text extraction, and document parsing operations.

Docling Document Processing

docling

Capabilities: Document processing and conversion service
Strengths: PDF processing, Word document parsing, text extraction, format conversion with high accuracy
Best for: Document upload, processing, conversion, and text extraction workflows

tinfoilsh/confidential-doc-upload

📄 File Support: Supports PDF, Word documents, and other common document formats. See Document Processing Guide for usage examples.

Using Models

To use any of these models, you’ll need:

API Key: Get your key from the Tinfoil dashboard
SDK: Install the SDK for your preferred language
Model ID: Use the model ID from the cards above in your API requests

For detailed usage examples and code samples, see the SDK documentation:

Getting Started

Models

SDKs

Guides

Enclave Verification

Tutorials

Resources

Available Models

Chat Models

Audio Models

Embedding Models

Document Processing Models

Using Models

Getting Started

Models

SDKs

Guides

Enclave Verification

Tutorials

Resources

​Available Models

​Chat Models

​Audio Models

​Embedding Models

​Document Processing Models

​Using Models

Available Models

Chat Models

Audio Models

Embedding Models

Document Processing Models

Using Models