ModelDen

Llama 3.1 Nemotron Nano 8B V1

Meta · 8B · Q4_K_M · ~4.8 GB
Released March 18, 2025

How good is it? — per-capability quality

Each number is the model’s score on its best public benchmarkfor that skill — named and linked below, never a blend. Scores compare cleanly when the benchmark matches; benchmarks are gameable, so we show the receipts rather than hide behind one number.

Coding85
MBPP · source · as of 2026-06
Math & logic47
AIME 2025 · source · as of 2026-06
General54
GPQA · source · as of 2026-06

Llama-3.1-Nemotron-Nano-8B-v1 is a large language model (LLM) which is a derivative of Meta Llama…

About this model

Developer
Meta
Family
Llama 3.1
License
Llama Community License
Context window
32,768 tokens
Will Llama 3.1 Nemotron Nano 8B V1 actually run on YOUR graphics card?

This page shows fit for a generic VRAM tier. Tell us your GPU for a personalized speed estimate and fit verdict.

Enter your graphics card →