README

Qwen3 235B A22B Thinking 2507

Enhanced version of Qwen3-235B-A22B featuring significant improvements in thinking and reasoning capabilities with state-of-the-art performance among open-source thinking models.

This MoE model uses 22B activated parameters from 128 total experts with 8 active at any time. Features dramatically improved performance on reasoning tasks including logical reasoning, mathematics, science, coding, and academic benchmarks that typically require human expertise.

Supports a context length of up to 262,144 tokens natively with enhanced long-context understanding.

Advanced agent capabilities and support for over 100 languages and dialects.

Parameters

Custom configuration options included with this model

Min P Sampling

0

Repeat Penalty

Disabled

Temperature

0.6

Top K Sampling

20

Top P Sampling

0.95

Sources

The underlying model files this model uses

Based on

🤗lmstudio-community/Qwen3-235B-A22B-Thinking-2507-GGUF→

GGUF

🤗lmstudio-community/Qwen3-235B-A22B-Thinking-2507-MLX-4bit→

MLX

🤗lmstudio-community/Qwen3-235B-A22B-Thinking-2507-MLX-6bit→

MLX

🤗lmstudio-community/Qwen3-235B-A22B-Thinking-2507-MLX-8bit→

MLX

qwen3-235b-a22b-thinking-2507

Qwen3 235B A22B Thinking 2507