33.3K Downloads
Description
Advanced reasoning model from ByteDance with flexible "thinking budget" control and ability to reflect on the length of its own reasoning
Use cases
Minimum system memory
Tags
Last update
Updated on August 28byREADME
Advanced reasoning model with flexible thinking budget control and native 512K context support
36B parameters with GQA attention architecture, designed for powerful long-context reasoning, agentic tasks, and general capabilities
Features dynamic reasoning length control, allowing users to adjust thinking budget from 512 tokens to unlimited based on task complexity
Excels at mathematical reasoning, coding tasks, tool usage, and agentic workflows including SWE-Bench and issue resolution
Achieves state-of-the-art performance on multiple benchmarks including MATH (81.7%), LiveCodeBench (67.4%), and RULER long-context (94.6%)
Optimized for international use cases with Apache 2.0 license and research-friendly design
Custom Fields
Special features defined by the model author
Thinking Budget
: select
(default=-1)
Sets the maximum number of tokens the model can use for internal reasoning
Parameters
Custom configuration options included with this model
Sources
The underlying model files this model uses