ai/llama3.2

Verified Publisher

By Docker

Updated 9 months ago

Solid LLaMA 3 update, reliable for coding, chat, and Q&A tasks

Model
23

100K+

ai/llama3.2 repository overview

Llama 3.2 Instruct

logo

Llama 3.2 introduced lightweight 1B and 3B models at bfloat16 (BF16) precision, later adding quantized versions. The quantized models are significantly faster, with a much lower memory footprint and reduced power consumption, while maintaining nearly the same accuracy as their BF16 counterparts.

Intended uses

Llama 3.2 instruct models are designed for:

  • AI assistance on edge devices, Running chatbots and virtual assistants with minimal latency on low-power * hardware.
  • Code assistance , Writing, debugging, and optimizing code on mobile or embedded systems.
  • Content generation ,Drafting emails, summaries, and creative content on lightweight devices.
  • Low-power AI for smart gadgets, Enhancing voice assistants on wearables and IoT devices.
  • Edge-based data processing, Summarizing and analyzing data locally for security and efficiency.

Characteristics

AttributeDetails
ProviderMeta
ArchitectureLlama
Cutoff dateDecember 2023
LanguagesEnglish, German, French, Italian, Portuguese, Hindi, Spanish, and Thai
Tool calling
Input modalitiesText
Output modalitiesText, Code
LicenseLlama 3.2 Community License

Available model variants

Model variantParametersQuantizationContext windowVRAM¹Size
ai/llama3.2:latest

ai/llama3.2:3B-Q4_K_M
3BIQ2_XXS/Q4_K_M131K tokens2.77 GiB1.87 GB
ai/llama3.2:1B-Q4_01BQ4_0131K tokens1.35 GiB727.75 MB
ai/llama3.2:1B-Q8_01BQ8_0131K tokens1.87 GiB1.22 GB
ai/llama3.2:1B-F161BF16131K tokens2.95 GiB2.30 GB
ai/llama3.2:3B-Q4_03BQ4_0131K tokens2.68 GiB1.78 GB
ai/llama3.2:3B-Q4_K_M3BIQ2_XXS/Q4_K_M131K tokens2.77 GiB1.87 GB
ai/llama3.2:3B-F163BF16131K tokens6.89 GiB5.98 GB

¹: VRAM estimated based on model characteristics.

latest3B-Q4_K_M

Use this AI model with Docker Model Runner

First, pull the model:

docker model pull ai/llama3.2

Then run the model:

docker model run ai/llama3.2

For more information on Docker Model Runner, explore the documentation.

Benchmark performance

CapabilityBenchmarkLlama 3.2 1B
GeneralMMLU49.3
Re-writingOpen-rewrite eval41.6
SummarizationTLDR9+ (test)16.8
Instruct. followingIFEval59.5
MathGSM8K (CoT)44.4
MATH (CoT)30.6
ReasoningARC-C59.4
GPQA27.2
Hellaswag41.2
Tool UseBFCL V225.7
Nexus13.5
Long ContextInfiniteBench/En.QA20.3
InfiniteBench/En.MC38.0
NIH/Multi-needle75.0
MultilingualMGSM (CoT)24.5

Tag summary

Content type

Model

Digest

sha256:da80a8418

Size

1.8 GB

Last updated

9 months ago

docker model pull ai/llama3.2:3B-Q4_0

This week's pulls

Pulls:

11,912

Last week