titi14

@titi14

Joined November 2025

Projects

qwen3-4b

Public

The 4B parameter version of the Qwen3 model family.

MODEL

Forked from qwen/qwen3-4b•Updated on November 28

Distilled version of the DeepSeek-R1-0528 model, created by continuing the post-training process on the Qwen3 8B Base model using Chain-of-Thought (CoT) from DeepSeek-R1-0528.

State-of-the-art image + text input models from Google, built from the same research and tech used to create the Gemini models

MODEL

Forked from qwen/qwen3-vl-2b•Updated on November 28

MODEL

Forked from qwen/qwen2.5-vl-7b•Updated on November 28

MODEL

Forked from minimax/minimax-m2•Updated on November 28

State-of-the-art image + text input models from Google, built from the same research and tech used to create the Gemini models