qwen3-235b-a22b

Public

The 235B parameter (MoE) version of the Qwen3 model family.

9.3K Downloads

6 stars

Capabilities

Reasoning

Minimum system memory

134GB

Tags

235B
qwen3moe

Last updated

Updated on May 24by
lmmy's profile picture
lmmy

README

Qwen3 235B A22B

Supports a context length of up to 131,072 tokens with YaRN (default 32k)

Supports /no_think to disable reasoning, just add it at the end of your prompt

MoE model with 22B activated params, 128 total and 8 active experts

Supports both thinking and non-thinking modes withe enhanced reasoning in both for significantly enhanced mathematics, coding, and commonsense

Excels at creative writing, role-playing, multi-turn dialogues, and instruction following

Advanced agent capabilities and support for over 100 languages and dialects

Custom Fields

Special features defined by the model author

Enable Thinking

: boolean

(default=true)

Controls whether the model will think before replying

Parameters

Custom configuration options included with this model

Min P Sampling
0
Top K Sampling
20

Sources

The underlying model files this model uses