Changelog • LM Studio 0.3.15

New features and improvements in LM Studio 0.3.15:

Support for NVIDIA RTX 50 series GPUs with CUDA 12
Support for GLM4 models
tool_choice parameter support in OpenAI-like REST API
Tool streaming bug fixes
UI touchups
Preview: ability to publish and download presets from the community! (enable in Settings > General)

Build 11

Build 10

Preview: Add the ability to publish and download presets from the community (head to Settings to enable)
Add tool_choice parameter support to OpenAI-like REST API
- "tool_choice": "none" - Model will not call any tools
- "tool_choice": "auto" - Model decides whether or not to call tools
- "tool_choice": "required" - Forces model to only output tools (llama.cpp engines only)
Added an option to log each generated fragment to API server logs
Fixed the erroneous "Client disconnected. Stopping generation..." message when using the API server
Fixed a front end error when using the preset selection in the developer page
Fix for GLM prompt template
Fix Llama 4 prompt template bug "Unknown ArrayValue filter: trim" when using tools

Build 9

Fix: Ensure OpenAI-like REST API chunk "finish_reason" is "tool_calls" when appropriate
Fixes "N/A" token count in system prompt editor when model is loaded

Build 8

Experimental feature behind flag in Chat Appearance, smooth autoscroll latest chat message to top

Build 7

[CUDA12] Fix incorrect VRAM capacity showing on Hardware page on some machines
Fix Llama 4 crashes when using GPU settings: priority order, limit offload to dedicated GPU memory
[GGUF] Fixed bug where top-k sampling parameter could not be set to 0
[MLX] Removed the checkbox from top-k sampling parameter

Build 6

Build 5

[CUDA] CUDA 12 engine auto-upgrade if driver is compatible and any GPU is 50-series and above
[MLX] Add top-k sampler

Build 4

New: CUDA 12 support in LM Studio's llama.cpp engines (Windows/Linux)
- Dramatically faster first-time model load times on RTX 50-series GPUs
- Initial compatibility requirements:
  - NVIDIA driver version:
    - Windows: 551.61 or newer
    - Linux: 550.54.14 or newer
  - At least one GPU of the following:
    - GeForce RTX 5090, RTX 5080, RTX 5070 Ti, or RTX 5070
    - Datacenter GPU with Hopper or Blackwell micro-architecture
- App will automatically upgrade you if your machine is compatible
- Check your system compatibility by running nvidia-smi in terminal
Added support for sorting models by last load time in the model loader (the new default)
Adds new system prompt editor UI
Adds a toggle to hide/show advanced settings while loading models
Fix Cogito jinja parsing error "Unexpected character: ~"
Fixes downloads pane resize bug

Build 3

Build 2

Build 1

UI touchups:
- New and improved chat input box
- Neatened up app action bar layout
- Slimmer app sidebar
- Chat sidebar segments: Context and Model