ModelsDocsBlogEnterpriseLM LinkNew

Login or Signup

Models

Developer Docs

LM Link

Careers

Blog

Changelog

Enterprise Solutions

Privacy Policy

Terms of Use

👾 Element Labs, Inc. © 2026

LinkedIn GitHub Discord Twitter / X

Product

Download the app Models LM LinkNew LM Studio Hub Beta Releases Changelog

Developer

Developer Docs lmstudio-js lmstudio-python LM Studio CLI (lms)llms.txt llms-full.txt

Company

CareersWe're Hiring!

Enterprise Solutions

Legal

psk11/qwen3.6-28-b-reap-i1 • LM Studio Hub

qwen3.6-28-b-reap-i1

Public

Description

Das Modell lädt selbst mit 205 Experten und voller Kontextlänge 262 k Token in den Speicher (16 GB VRAM auf NVIDIA RTX 5070 TI). Aber: es läuft dann extrem langsam. Empfehlung: 18 Experten, 131 k Kontextlänge, KV-Quant Q8_0/Q5_1, dann sehr schnell!

Last updated

Updated 21 days agoby

Parameters

Structured Output

None