The latest version of the Qwen3 model family, featuring 4B, 30B, and 235B dense and MoE models, both thinking and non-thinking variants.
To run the smallest Qwen3, you need at least 2 GB of RAM. The largest one may require up to 134 GB.
Qwen3 models support tool use and reasoning. They are available in gguf and mlx.

Over the past three months, Alibaba Qwen continued to explore the potential of the Qwen3 families, and they now introduce the updated Qwen3-2507 in two variants, Qwen3-Instruct-2507 and Qwen3-Thinking-2507, and multiple sizes.
Qwen3-Instruct-2507 is the updated version of the previous Qwen3 non-thinking mode, featuring the following key enhancements:
Qwen3-Thinking-2507 is the continuation of Qwen3 thinking model, with improved quality and depth of reasoning, featuring the following key enhancements: