The 8B parameter version of the Qwen3 model family.

202.9K Downloads

11 stars

Capabilities

Reasoning

Minimum system memory

5GB

Tags

8B
qwen3

Last updated

Updated on May 24by
lmmy's profile picture
lmmy

README

Qwen3 8B by qwen

Supports a context length of up to 131,072 tokens with YaRN (default 32k)

Supports /no_think to disable reasoning, just add it at the end of your prompt

Supports both thinking and non-thinking modes withe enhanced reasoning in both for significantly enhanced mathematics, coding, and commonsense

Excels at creative writing, role-playing, multi-turn dialogues, and instruction following

Advanced agent capabilities and support for over 100 languages and dialects

Custom Fields

Special features defined by the model author

Enable Thinking

: boolean

(default=true)

Controls whether the model will think before replying

Parameters

Custom configuration options included with this model

Min P Sampling
0
Top K Sampling
20

Sources

The underlying model files this model uses