ModelDen

DeepSeek R1 Distill Qwen 14B

DeepSeek · 14.8B · Q4_K_M · ~9.0 GB
Released January 20, 2025
Reasons before answering. Expect a delay while it generates hidden thinking tokens before any text appears.

How good is it? — per-capability quality

Each number is the model’s score on its best public benchmarkfor that skill — named and linked below, never a blend. Scores compare cleanly when the benchmark matches; benchmarks are gameable, so we show the receipts rather than hide behind one number.

Coding53
LiveCodeBench · source · as of 2026-06
Math & logic80
AIME 2024 · source · as of 2026-06
General59
GPQA · source · as of 2026-06

DeepSeek-R1 is the first-generation reasoning model built atop DeepSeek-V3 (671B total parameters…

About this model

Developer
DeepSeek
Family
DeepSeek-R1 (distill)
Context window
32,768 tokens
Will DeepSeek R1 Distill Qwen 14B actually run on YOUR graphics card?

This page shows fit for a generic VRAM tier. Tell us your GPU for a personalized speed estimate and fit verdict.

Enter your graphics card →