DeepSeek R1 Distill Qwen 14B

DeepSeek · 14.8B · Q4_K_M · ~9.0 GB

Released January 20, 2025

Reasons before answering. Expect a delay while it generates hidden thinking tokens before any text appears.

How good is it? — per-capability quality

Each number is the model’s score on its best public benchmarkfor that skill — named and linked below, never a blend. Scores compare cleanly when the benchmark matches; benchmarks are gameable, so we show the receipts rather than hide behind one number.

Coding53

LiveCodeBench · source · as of 2026-06

Math & logic80

AIME 2024 · source · as of 2026-06

General59

GPQA · source · as of 2026-06

DeepSeek-R1 is the first-generation reasoning model built atop DeepSeek-V3 (671B total parameters…

About this model

Developer: DeepSeek
Family: DeepSeek-R1 (distill)
Context window: 32,768 tokens

Hugging Face ↗Developer site ↗

Will DeepSeek R1 Distill Qwen 14B actually run on YOUR graphics card?

This page shows fit for a generic VRAM tier. Tell us your GPU for a personalized speed estimate and fit verdict.

Enter your graphics card →