ModelsDocsBlogPricingEnterprise

Login or Signup

Models

Developer Docs

Pricing

Enterprise

Careers

Blog

Changelog

Privacy Policy

Terms of Use

👾 Element Labs, Inc. © 2026

LinkedIn GitHub Discord Twitter / X

Product

Download the app Models LM LinkNew LM Studio Hub Beta Releases Changelog

Developer

Developer Docs lmstudio-js lmstudio-python LM Studio CLI (lms)llms.txt llms-full.txt

Company

CareersWe're Hiring!

Enterprise Solutions

Legal

five0/qwen3-coder-480b • LM Studio Hub

qwen3-coder-480b

Public

Forked from qwen/qwen3-coder-480b

Description

Qwen's most powerful code model, featuring 480B total parameters with 35B activated through Mixture of Experts (MoE) architecture.

Capabilities

Trained for tool use

Minimum system memory

250GB

Tags

480B

qwen3_moe

Last updated

Updated on November 26by

README

Qwen3 Coder 480B

Qwen's most powerful code model, featuring 480B total parameters with 35B activated through Mixture of Experts (MoE) architecture.

Key Features:

Agentic Coding: Comparable performance to Claude Sonnet 4 on coding tasks
Repository-Scale Understanding: Optimized for large codebases and complex projects

Technical Specifications:

480B total parameters, 35B activated (MoE with 160 experts, 8 active)
62 layers with Grouped Query Attention (96 Q heads, 8 KV heads)

Parameters

Custom configuration options included with this model

Repeat Penalty

1.05

Temperature

0.7

Top K Sampling

20

Top P Sampling

0.8

Sources

The underlying model files this model uses

Native 262,144 token context length

Note: This model operates in non-thinking mode only and does not generate <think></think> blocks.

Based on

🤗lmstudio-community/Qwen3-Coder-480B-A35B-Instruct-GGUF→

GGUF

🤗lmstudio-community/Qwen3-Coder-480B-A35B-Instruct-MLX-4bit→

MLX

🤗lmstudio-community/Qwen3-Coder-480B-A35B-Instruct-MLX-6bit→

MLX

🤗lmstudio-community/Qwen3-Coder-480B-A35B-Instruct-MLX-8bit→

MLX