Documentation Index
Fetch the complete documentation index at: https://docs.tinfoil.sh/llms.txt
Use this file to discover all available pages before exploring further.
Safety models classify and moderate content based on custom policies you define.
GPT-OSS Safeguard 120B
gpt-oss-safeguard-120b Parameters: 117B (5.1B active)Context: 131K tokensStrengths: Safety reasoning, bring-your-own-policy flexibility, full access to reasoning chains for debugging, configurable reasoning effort levelsStructured Outputs: Structured response formatting supportBest for: Content moderation, policy enforcement, LLM guardrails, and Trust & Safety labeling workflowsConfiguration repo: tinfoilsh/confidential-gpt-oss-safeguard-120bSafety Model: Classifies text content based on custom safety policies you provide.