Forked from qwen/qwen3-coder-480b
Description
Qwen's most powerful code model, featuring 480B total parameters with 35B activated through Mixture of Experts (MoE) architecture.
Capabilities
Minimum system memory
Tags
Last updated
Updated 2 days agobyREADME
Qwen's most powerful code model, featuring 480B total parameters with 35B activated through Mixture of Experts (MoE) architecture.
Key Features:
Technical Specifications:
Note: This model operates in non-thinking mode only and does not generate <think></think> blocks.
Parameters
Custom configuration options included with this model
Sources
The underlying model files this model uses