Gpt Oss 120b

gpt-oss-120b#

Providers / Cerebras / gpt-oss-120b

📋 Overview#

  • ID: gpt-oss-120b
  • Provider: Cerebras
  • Authors: OpenAI
  • Release Date: 2025-08-05
  • Open Weights: true
  • Context Window: 131k tokens
  • Max Output: 32k tokens

🔬 Technical Specifications#

Sampling Controls: Temperature Top-P

🎯 Capabilities#

Feature Overview#

Supports text generation and processing Supported input modalities Supported output modalities Temperature sampling control Nucleus sampling (top-p) Maximum token limit Stop sequences Response streaming

Input/Output Modalities#

DirectionTextImageAudioVideoPDF
Input
Output

Core Features#

Tool CallingTool DefinitionsTool ChoiceWeb SearchFile Attachments

Response Delivery#

StreamingStructured OutputJSON ModeFunction CallText Format

🎛️ Generation Controls#

Sampling & Decoding#

TemperatureTop-P
0.0-2.00.0-1.0

Length & Termination#

Max TokensStop Sequences
1-32k

💰 Pricing#

Pricing shown for Cerebras

Token Pricing#

InputOutputReasoningCache ReadCache Write
$0.25/1M$0.69/1M---

💰 Cost Calculator#

Calculate costs for common usage patterns:

Use CaseInputOutputTotal Cost
Quick chat (1K in, 500 out)1k tokens500 tokens$0.000595
Document summary (10K in, 1K out)10k tokens1k tokens$0.003190
RAG query (50K in, 2K out)50k tokens2k tokens$0.0139
Code generation (5K in, 10K out)5k tokens10k tokens$0.008150

Pricing Formula:

1Cost = (Input Tokens / 1M × $0.25) + (Output Tokens / 1M × $0.69)

📊 Example Costs#

Real-world usage examples and their costs:

Usage TierDaily VolumeMonthly TokensMonthly Cost
Personal (10 chats/day)10 chats675k$0.2677
Small Team (100 chats/day)100 chats9.0M$3.57
Enterprise (1000 chats/day)1000 chats135.0M$53.55

📋 Metadata#

Created: 0001-01-01 00:00:00 UTC

Last Updated: 2025-10-19 18:13:08 UTC



Last Updated: 2025-10-21 23:55:56 UTC | Generated by ModelWiki