Model

deepseek-r1-distill-llama-8b

Public

DeepSeek R1 Distill Llama 8B by deepseek-ai

Use cases

Minimum system memory

5GB

Tags

8B
llama

README

DeepSeek R1 Distill Llama 8B by deepseek-ai

Supports context length of 128k.

Distilled from DeepSeek's R1 reasoning model.

Tuned for reasoning and chain-of-thought.

Sources

The underlying model files this model uses