8.7K Downloads

qwen/
qwen3-4b-thinking-2507
4B
qwen3moe

Updated thinking version of Qwen3 4B featuring continued scaling of thinking capability, improving both the quality and depth of reasoning

Tool use

Reasoning

Last Updated2 days ago
README

Qwen3 4B Thinking 2507 by qwen

Updated thinking version of Qwen3-4B featuring continued scaling of thinking capability, improving both the quality and depth of reasoning. Qwen3-4B-Thinking-2507 includes the following key enhancements:

Significantly improved performance on reasoning tasks, including logical reasoning, mathematics, science, coding, and academic benchmarks that typically require human expertise. Markedly better general capabilities, such as instruction following, tool usage, text generation, and alignment with human preferences. Enhanced 256K long-context understanding capabilities.

Supports a context length of up to 262,144 tokens.

Note: This model supports only thinking mode. Specifying enable_thinking=True is not required.

sources

The underlying model files this model uses

When you download this model, LM Studio picks the source that will best suit your machine (you can override this)

config

Custom configuration options included with this model

Min P Sampling
0
Repeat Penalty
Disabled
Temperature
0.6
Top K Sampling
20
Top P Sampling
0.95