Advanced reasoning model from ByteDance with flexible "thinking budget" control and ability to reflect on the length of its own reasoning
To run the smallest seed-oss, you need at least 21 GB of RAM.
seed-oss models support tool use and reasoning. They are available in gguf and mlx.

Seed-OSS is a series of open-source large language models developed by ByteDance's Seed Team, designed for powerful long-context, reasoning, agent and general capabilities, and versatile developer-friendly features. Although trained with only 12T tokens, Seed-OSS achieves excellent performance on several popular open benchmarks.
You can configure the model's thinking budget within LM Studio's chat interface.
Seed-OSS adopts the popular causal language model architecture with RoPE, GQA attention, RMSNorm and SwiGLU activation.
| Parameters | 36B |
| Attention | GQA |
| Activation Function | SwiGLU |
| Number of Layers | 64 |
| Number of QKV Heads | 80 / 8 / 8 |
| Head Size | 128 |
| Hidden Size | 5120 |
| Vocabulary Size | 155K |
| Context Length | 512K |
| RoPE Base Frequency | 1e7 |
Seed-OSS is Apache-2.0 licensed.