10.6K Downloads
Advanced reasoning model from ByteDance with flexible "thinking budget" control and ability to reflect on the length of its own reasoning
Trained for Ttool use
Reasoning
Advanced reasoning model with flexible thinking budget control and native 512K context support
36B parameters with GQA attention architecture, designed for powerful long-context reasoning, agentic tasks, and general capabilities
Features dynamic reasoning length control, allowing users to adjust thinking budget from 512 tokens to unlimited based on task complexity
Excels at mathematical reasoning, coding tasks, tool usage, and agentic workflows including SWE-Bench and issue resolution
Achieves state-of-the-art performance on multiple benchmarks including MATH (81.7%), LiveCodeBench (67.4%), and RULER long-context (94.6%)
Optimized for international use cases with Apache 2.0 license and research-friendly design
Special features defined by the model author
Thinking Budget
: select
(default=-1)
Sets the maximum number of tokens the model can use for internal reasoning
The underlying model files this model uses
When you download this model, LM Studio picks the source that will best suit your machine (you can override this)
Custom configuration options included with this model