Llama 3.1 Nemotron Nano 8B V1

Meta · 8B · Q4_K_M · ~4.8 GB

Released March 18, 2025

How good is it? — per-capability quality

Each number is the model’s score on its best public benchmarkfor that skill — named and linked below, never a blend. Scores compare cleanly when the benchmark matches; benchmarks are gameable, so we show the receipts rather than hide behind one number.

Coding85

MBPP · source · as of 2026-06

Math & logic47

AIME 2025 · source · as of 2026-06

General54

GPQA · source · as of 2026-06

Llama-3.1-Nemotron-Nano-8B-v1 is a large language model (LLM) which is a derivative of Meta Llama…

About this model

Developer: Meta
Family: Llama 3.1
License: Llama Community License
Context window: 32,768 tokens

Developer site ↗

Will Llama 3.1 Nemotron Nano 8B V1 actually run on YOUR graphics card?

This page shows fit for a generic VRAM tier. Tell us your GPU for a personalized speed estimate and fit verdict.

Enter your graphics card →