titi14's profile picture

titi14

@titi14

Joined November 2025

Projects

Public
Forked from qwen/qwen3-4b

The 4B parameter version of the Qwen3 model family.

MODEL

Updated 16 hours ago

Distilled version of the DeepSeek-R1-0528 model, created by continuing the post-training process on the Qwen3 8B Base model using Chain-of-Thought (CoT) from DeepSeek-R1-0528.

MODEL

Updated 16 hours ago

State-of-the-art image + text input models from Google, built from the same research and tech used to create the Gemini models

MODEL

Updated 16 hours ago
Forked from qwen/qwen3-vl-2b

MODEL

Updated 16 hours ago

a 7B Vision Language Model (VLM) from the Qwen2.5 family

MODEL

Updated 16 hours ago

MiniMax M2 is a 230B MoE (10 active) LLM, built for coding and agentic workflows.

MODEL

Updated 16 hours ago

MODEL

Updated 16 hours ago
Forked from google/gemma-3-4b

State-of-the-art image + text input models from Google, built from the same research and tech used to create the Gemini models

MODEL

Updated 16 hours ago