deepseek-r1-distill-llama-70b

Authors / Meta / deepseek-r1-distill-llama-70b

deepseek-r1-distill-llama-70b#

Developed by

Meta

📋 Technical Specifications#

SpecificationValue
Model IDdeepseek-r1-distill-llama-70b
Context Window131k tokens
Max Output8k tokens
Release Date2025-01-20
Open Weightstrue

🎯 Capabilities#

Supports text generation and processing Supported input modalities Supported output modalities Accepts tool definitions in requests Supports tool choice strategies (auto/none/required) Temperature sampling control Nucleus sampling (top-p) Maximum token limit Stop sequences Frequency penalty Presence penalty Deterministic seeding JSON schema validation Response streaming

🌐 Provider Availability#

This model is available through the following providers with potential variations:

ProviderContextPricing (Input/Output)Notes
Groq131.1k$0.75 / $0.99

Other Models by This Author#

  • Codellama 7b Hf
  • compound-beta
  • compound-beta-mini
  • Faster R Cnn
  • groq/compound
  • …and 32 more

Last Updated: 2025-10-21 23:55:57 UTC | Generated by ModelWiki