DeepSeek R1 Distill Llama 8B
DeepSeek · 8.0B · Q4_K_M · ~4.9 GB
Released January 20, 2025
Reasons before answering. Expect a delay while it generates hidden thinking tokens before any text appears.
How good is it? — per-capability quality
Each number is the model’s score on its best public benchmarkfor that skill — named and linked below, never a blend. Scores compare cleanly when the benchmark matches; benchmarks are gameable, so we show the receipts rather than hide behind one number.
DeepSeek-R1 is the first-generation reasoning model built atop DeepSeek-V3 (671B total parameters…
About this model
- Developer
- DeepSeek
- Family
- DeepSeek-R1 (distill)
- Context window
- 32,768 tokens
Will DeepSeek R1 Distill Llama 8B actually run on YOUR graphics card?
This page shows fit for a generic VRAM tier. Tell us your GPU for a personalized speed estimate and fit verdict.
Enter your graphics card →