@titi14
Joined November 2025
Projects
The 4B parameter version of the Qwen3 model family.
MODEL
Distilled version of the DeepSeek-R1-0528 model, created by continuing the post-training process on the Qwen3 8B Base model using Chain-of-Thought (CoT) from DeepSeek-R1-0528.
MODEL
State-of-the-art image + text input models from Google, built from the same research and tech used to create the Gemini models
MODEL
a 7B Vision Language Model (VLM) from the Qwen2.5 family
MODEL
MiniMax M2 is a 230B MoE (10 active) LLM, built for coding and agentic workflows.
MODEL
State-of-the-art image + text input models from Google, built from the same research and tech used to create the Gemini models
MODEL