5.6K Downloads

qwen3-235b-a22b-2507

qwen/

235B

qwen3moe

Updated version of Qwen3-235B-A22B featuring significant improvements in general capabilities including instruction following, logical reasoning, text comprehension, mathematics, science, coding and tool usage.

Trained for Ttool use

Last Updatedon July 23

Use Model in LM Studio

Requires min. Min.134GB

README

Qwen3 235B A22B 2507

Updated version of Qwen3-235B-A22B featuring significant improvements in general capabilities including instruction following, logical reasoning, text comprehension, mathematics, science, coding and tool usage.

This MoE model uses 22B activated parameters from 128 total experts with 8 active at any time. Compared to the original Qwen3-235B-A22B, it delivers substantial gains in long-tail knowledge coverage across multiple languages and markedly better alignment with user preferences in subjective and open-ended tasks.

Supports a context length of up to 262,144 tokens natively with enhanced 256k long-context understanding.

Advanced agent capabilities and support for over 100 languages and dialects.

Note: This model supports only non-thinking mode and does not generate <think></think> blocks in its output.

sources

The underlying model files this model uses

Based on

🤗lmstudio-community/Qwen3-235B-A22B-Instruct-2507-GGUF→

GGUF

🤗lmstudio-community/Qwen3-235B-A22B-Instruct-2507-MLX-4bit→

MLX

🤗lmstudio-community/Qwen3-235B-A22B-Instruct-2507-MLX-6bit→

MLX

🤗lmstudio-community/Qwen3-235B-A22B-Instruct-2507-MLX-8bit→

MLX

When you download this model, LM Studio picks the source that will best suit your machine (you can override this)

config

Custom configuration options included with this model

Min P Sampling

0

Repeat Penalty

Disabled

Temperature

0.7

Top K Sampling

20

Top P Sampling

0.8