Groq#
Ultra-fast inference with custom LPU hardware, offering low-latency model serving.
Provider Information#
| Field | Value |
|---|---|
| Provider ID | groq |
| Total Models | 28 |
| Authentication | API Key Required |
| Environment Variable | GROQ_API_KEY |
| Status Page | https://status.groq.com |
🔗 API Endpoints#
Documentation: https://console.groq.com/docs/models
Models API: https://api.groq.com/openai/v1/models
Chat Completions: https://api.groq.com/openai/v1/chat/completions
Health API: https://groqstatus.com/api/v2/summary.json
🔒 Privacy & Data Handling#
Privacy Policy: https://groq.com/privacy-policy/
Terms of Service: https://groq.com/terms-of-use/
Retains User Data: No
Trains on User Data: No
⏱️ Data Retention Policy#
Policy Type: No Retention
Retention Duration: Immediate deletion
Details: Input prompts and context are not retained; data is processed for immediate response generation and then discarded
🛡️ Content Moderation#
Requires Moderation: No
Content Moderated: Yes
Moderated by: Groq
🏢 Headquarters#
Mountain View, CA, USA
Available Models#
GPT#
| Model | Context | Input | Output | Features |
|---|---|---|---|---|
| openai/gpt-oss-120b | 131.1k | $0.15 | $0.75 | 📝 ⚡ |
| openai/gpt-oss-20b | 131.1k | $0.10 | $0.50 | 📝 ⚡ |
Gemma#
| Model | Context | Input | Output | Features |
|---|---|---|---|---|
| gemma2-9b-it | 8.2k | $0.20 | $0.20 | 📝 🔧 ⚡ |
Llama#
| Model | Context | Input | Output | Features |
|---|---|---|---|---|
| Llama 3 70B | 8.2k | $0.59 | $0.79 | — |
| Llama 3 8B | 8.2k | $0.05 | $0.08 | — |
| Llama Guard 3 8B | 8.2k | $0.20 | $0.20 | — |
| deepseek-r1-distill-llama-70b | 131.1k | $0.75 | $0.99 | 📝 🔧 ⚡ |
| llama-3.1-8b-instant | 131.1k | $0.05 | $0.08 | 📝 ⚡ |
| llama-3.3-70b-versatile | 131.1k | $0.59 | $0.79 | 📝 ⚡ |
| meta-llama/llama-4-maverick-17b-128e-instruct | 131.1k | $0.20 | $0.60 | 📝 ⚡ |
| meta-llama/llama-4-scout-17b-16e-instruct | 131.1k | $0.11 | $0.34 | 📝 ⚡ |
| meta-llama/llama-guard-4-12b | 131.1k | $0.20 | $0.20 | 📝 ⚡ |
| meta-llama/llama-prompt-guard-2-22m | 512 | N/A | N/A | 📝 ⚡ |
| meta-llama/llama-prompt-guard-2-86m | 512 | N/A | N/A | 📝 ⚡ |
Mistral#
| Model | Context | Input | Output | Features |
|---|---|---|---|---|
| Mistral Saba 24B | 32.8k | $0.79 | $0.79 | — |
Other#
| Model | Context | Input | Output | Features |
|---|---|---|---|---|
| allam-2-7b | 4.1k | N/A | N/A | 📝 ⚡ |
| compound-beta | 131.1k | N/A | N/A | 📝 ⚡ |
| compound-beta-mini | 131.1k | N/A | N/A | 📝 ⚡ |
| groq/compound | 131.1k | N/A | N/A | 📝 ⚡ |
| groq/compound-mini | 131.1k | N/A | N/A | 📝 ⚡ |
| moonshotai/kimi-k2-instruct | 131.1k | $1.00 | $3.00 | 📝 ⚡ |
| moonshotai/kimi-k2-instruct-0905 | 262.1k | $1.00 | $3.00 | 📝 ⚡ |
| playai-tts | 8.2k | N/A | N/A | 📝 ⚡ |
| playai-tts-arabic | 8.2k | N/A | N/A | 📝 ⚡ |
Qwen#
| Model | Context | Input | Output | Features |
|---|---|---|---|---|
| Qwen QwQ 32B | 131.1k | $0.29 | $0.39 | — |
| qwen/qwen3-32b | 131.1k | $0.29 | $0.59 | 📝 ⚡ |
Whisper#
| Model | Context | Input | Output | Features |
|---|---|---|---|---|
| whisper-large-v3 | 448 | N/A | N/A | 📝 ⚡ |
| whisper-large-v3-turbo | 448 | $0.00 | $0.00 | 📝 ⚡ |
Configuration#
Authentication#
This provider requires an API key. Set it as an environment variable:
1export GROQ_API_KEY="your-api-key-here"Using with Model.Wiki#
1# List all models from this provider
2starmap list models --provider groq
3
4# Fetch latest models from provider API
5starmap fetch --provider groq
6
7# Sync provider data
8starmap sync --provider groqSee Also#
← Back to Providers | ← Back to Home | Generated by ModelWiki