LMmy choosing

Model Catalog

New & noteworthy local models you can run on your own computer.

Model capabilities

Granite 4.1
3B
8B
30B
Granite 4.1 models are new and improved granite models which have gone through an improved post-training pipeline, including supervised finetuning and reinforcement learning alignment, resulting in enhanced tool calling, instruction following, and chat capabilities.
1.2K
0
3
Updated 3 hours ago
Nemotron 3 Omni
30B
NVIDIA Nemotron 3 Nano Omni is an open multimodal model with highest efficiency that powers sub-agents to complete tasks faster across vision, audio, and language
145K
21
Updated 22 days ago
Qwen3.6
27B
35B
Qwen3.6 prioritizes stability and real-world utility, offering developers a more intuitive, responsive, and genuinely productive coding experience.
1.3M
92
2
Updated 22 days ago
Gemma 4
5.1B
7.9B
26B
31B
Gemma 4 is Google's most capable family of open models, built from Gemini 3 research. Supports vision input and available in multiple sizes for on-device deployment.
4.4M
691
4
Updated 1 month ago
Nemotron 3 Super
120B
NVIDIA Nemotron 3 Super, a 120B open hybrid MoE model (12B active), supporting up to 1M tokens context window
169K
45
Updated 2 months ago
Qwen3.5
2B
4B
9B
27B
35B
Qwen3.5 represents a significant leap forward, integrating breakthroughs in multimodal learning, architectural efficiency, reinforcement learning scale, and global accessibility to empower developers and enterprises with unprecedented capability and efficiency
3M
315
5
Updated 2 months ago
LFM2-24B-A2B
24B
LFM2 is a family of hybrid models designed for on-device deployment. LFM2-24B-A2B is the largest model in the family, scaling the architecture to 24 billion parameters while keeping inference efficient.
87.1K
18
Updated 2 months ago
Qwen3-Coder-Next
80B
Qwen3 Coder Next is an 80B MoE with 3B active parameters designed for coding agents and local development. Excels at long-horizon reasoning, complex tool usage, and recovery from execution failures.
261.1K
85
Updated 3 months ago
GLM-4.7
30B
Open source coding models by Z.ai, based on a new base model and specializing in coding and tool calling.
299.2K
107
Updated 4 months ago
FunctionGemma
270M
FunctionGemma is a lightweight, open model from Google, built as a foundation for creating your own specialized function calling models.
2.8K
48
Updated 5 months ago
Nemotron 3
30B
General purpose reasoning and chat model trained from scratch by NVIDIA. Contains 30B total parameters with only 3.5B active at a time for low-latency MoE inference
149.5K
59
Updated 5 months ago
GLM-4.6V-Flash
9B
GLM 4.6V Flash is a 9B vision-language model optimized for local deployment and low-latency applications.
280.3K
65
Updated 5 months ago
Devstral 2
24B
123B
Second-generation Devstral for agentic coding. Built for tool use to explore codebases, edit multiple files, and power software engineering agents with newly added vision support.
184.7K
63
2
Updated 5 months ago
Rnj-1
8B
Rnj-1 is a family of 8B parameter open-weight, dense models trained from scratch by Essential AI.
65K
21
Updated 5 months ago
Ministral 3
3B
3B
8B
8B
14B
14B
Ministral 3 series, available in three model sizes: 3B, 8B, and 14B parameters. Provides best of class cost-to-performance ratio.
542.7K
127
6
Updated 5 months ago
Qwen3 Next
80B
Hybrid attention architecture, high-sparsity Mixture-of-Experts 80B model (active 3B).
54.2K
31
Updated 5 months ago
Olmo 3
7B
7B
32B
Olmo 3 is a family of Open language models designed to enable the science of language models.
38.1K
29
3
Updated 6 months ago
olmOCR 2
7B
The olmOCR 2 model is a Vision Language Model (VLM) from Allen AI.
71.4K
20
Updated 6 months ago
minimax-m2
230B
MiniMax M2 is a 230B MoE (10B active) model built for coding and agentic workflows
15K
40
Updated 6 months ago
gpt-oss-safeguard
20B
120B
gpt-oss-safeguard-20b and gpt-oss-safeguard-120b are open safety models from OpenAI, building on gpt-oss. Trained to help classify text content based on customizable policies.
7.6K
36
2
Updated 6 months ago
Qwen3-VL
2B
4B
8B
30B
32B
Qwen's latest vision-language model. Includes comprehensive upgrades to visual perception, spatial reasoning, and image understanding.
751.5K
136
5
Updated 6 months ago
Granite 4.0
3B
3B
7B
32B
Granite 4.0 language models are lightweight, state-of-the-art open models that natively support multilingual capabilities, coding tasks, RAG, tool use, and JSON output.
76K
54
4
Updated 6 months ago
seed-oss
36B
Advanced reasoning model from ByteDance with flexible "thinking budget" control and ability to reflect on the length of its own reasoning
55.1K
23
Updated 6 months ago
Qwen3
4B
4B
30B
30B
235B
235B
The latest version of the Qwen3 model family, featuring 4B, 30B, and 235B dense and MoE models, both thinking and non-thinking variants.
489.2K
184
6
Updated 6 months ago
gpt-oss
20B
120B
OpenAI's first open source LLM. Comes in 2 sizes: 20B and 120B. Supports configurable reasoning effort (low, medium, high). Trained for tool use. Apache 2.0 licensed.
1.7M
361
2
Updated 6 months ago
Qwen3-Coder
30B
480B
State-of-the-art, Mixture-of-Experts local coding model with native support for 256K context length. Available in 30B (3B active) and 480B (35B active) sizes.
415K
155
2
Updated 6 months ago
Ernie-4.5
21B
Medium-size Mixture-of-Experts model from Baidu's new Ernie 4.5 line of foundation models.
22.7K
12
Updated 6 months ago
LFM2
350M
700M
1.2B
LFM2 is a new generation of hybrid models developed by Liquid AI, specifically designed for edge AI and on-device deployment. It sets a new standard in terms of quality, speed, and memory efficiency.
70.1K
52
3
Updated 6 months ago
Devstral
23.6B
24B
Devstral is a coding model from Mistral AI. It excels at using tools to explore codebases, editing multiple files and power software engineering agents.
79.7K
41
2
Updated 5 months ago
gemma-3n
4.5B
6.9B
Gemma 3n is a generative AI model optimized for use in everyday devices, such as phones, laptops, and tablets.
250.5K
96
2
Updated 6 months ago
Mistral Small
24B
Mistrall Small is a 'knowledge-dense' 24B multi-modal (image input) local model that supports up to 128 token context length.
81.2K
21
Updated 6 months ago
Magistral
23.6B
24B
MistralAI's open-weight reasoning model. 24B dense transformer model supporting up to 128K token context window. The model is capable of long chains of reasoning traces before providing answers.
129.7K
51
2
Updated 6 months ago
mistral-nemo
12B
General purpose dense transformer designed for multilingual use cases. Built in collaboration between MistralAI and NVIDIA.
50K
13
Updated 6 months ago
qwen2.5-vl
3B
7B
32B
72B
Qwen2.5-VL is a performant vision-language model, capable of recognizing common objects and text. Supports context length of 128k tokens in a variety of human languages.
216.2K
30
4
Updated 6 months ago
gemma-3
270M
1B
4B
12B
27B
State-of-the-art image + text input models from Google, built from the same research and tech used to create the Gemini models
2M
219
5
Updated 6 months ago
phi-4-reasoning
3.8B
14.7B
14.7B
Phi-4-mini-reasoning is a lightweight open model built upon synthetic data with a focus on high-quality, reasoning dense data.
184.8K
53
3
Updated 6 months ago
phi-4
3B
14B
phi-4 is a state-of-the-art open model built upon a blend of synthetic datasets, data from filtered public domain websites, and acquired academic books and Q&A datasets.
38.5K
19
2
Updated 6 months ago
Codestral
22B
Mistral AI's latest coding model, Codestral can handle both instructions and code completions with ease in over 80 programming languages.
58.8K
32
Updated 6 months ago
Mistral
7B
One of the most popular open-source LLMs, Mistral's 7B Instruct model's balance of speed, size, and performance makes it a great general-purpose daily driver.
141.7K
49
Updated 6 months ago
Qwen3 (1st Generation)
4B
8B
14B
30B
32B
235B
The first batch of Qwen3 models (Qwen3-2504), a collection of dense and MoE models ranging from 4B to 235B. These are general purpose models that score highly on benchmarks.
540.9K
54
6
Updated 6 months ago
deepseek-r1
7B
8B
8B
14B
32B
70B
Distilled version of the DeepSeek-R1-0528 model, created by continuing the post-training process on the Qwen3 8B Base model using Chain-of-Thought (CoT) from DeepSeek-R1-0528.
857.2K
247
6
Updated 6 months ago