0.3.15 - Release Notes
New features and improvements in LM Studio 0.3.15:
- Support for NVIDIA RTX 50 series GPUs with CUDA 12
- Support for GLM4 models
tool_choice parameter support in OpenAI-like REST API
- Tool streaming bug fixes
- UI touchups
- Preview: ability to publish and download presets from the community! (enable in Settings > General)
Build 11
- Llama 4 prompt template fixes to improve tool call reliability
Build 10
- Preview: Add the ability to publish and download presets from the community (head to Settings to enable)
- Add
tool_choice parameter support to OpenAI-like REST API
"tool_choice": "none" - Model will not call any tools
"tool_choice": "auto" - Model decides whether or not to call tools
"tool_choice": "required" - Forces model to only output tools (llama.cpp engines only)
- Added an option to log each generated fragment to API server logs
- Fixed the erroneous "Client disconnected. Stopping generation..." message when using the API server
- Fixed a front end error when using the preset selection in the developer page
- Fix for GLM prompt template
- Fix Llama 4 prompt template bug "Unknown ArrayValue filter: trim" when using tools
Build 9
- Fix: Ensure OpenAI-like REST API chunk "finish_reason" is "tool_calls" when appropriate
- Fixes "N/A" token count in system prompt editor when model is loaded
Build 8
- Experimental feature behind flag in Chat Appearance, smooth autoscroll latest chat message to top
Build 7
- [CUDA12] Fix incorrect VRAM capacity showing on Hardware page on some machines
- Fix Llama 4 crashes when using GPU settings: priority order, limit offload to dedicated GPU memory
- [GGUF] Fixed bug where top-k sampling parameter could not be set to 0
- [MLX] Removed the checkbox from top-k sampling parameter
Build 6
- Chat terminal message styling updates
- Conversation font scale introduced in chat Appearance tab
- Conversation font weight introduced in chat Appearance tab
Build 5
- [CUDA] CUDA 12 engine auto-upgrade if driver is compatible and any GPU is 50-series and above
- [MLX] Add top-k sampler
Build 4
- New: CUDA 12 support in LM Studio's llama.cpp engines (Windows/Linux)
- Dramatically faster first-time model load times on RTX 50-series GPUs
- Initial compatibility requirements:
- NVIDIA driver version:
- Windows: 551.61 or newer
- Linux: 550.54.14 or newer
- At least one GPU of the following:
- GeForce RTX 5090, RTX 5080, RTX 5070 Ti, or RTX 5070
- Datacenter GPU with Hopper or Blackwell micro-architecture
- App will automatically upgrade you if your machine is compatible
- Check your system compatibility by running
nvidia-smi in terminal
- Added support for sorting models by last load time in the model loader (the new default)
- Adds new system prompt editor UI
- Adds a toggle to hide/show advanced settings while loading models
- Fix Cogito jinja parsing error "Unexpected character: ~"
- Fixes downloads pane resize bug
Build 3
- Fixed lms CLI sometimes not initializing properly on MacOS
Build 2
- Fixes bug where the chat sidebar labels would overflow
- Fixes bug where the downloads pane would open at wrong position
Build 1
- UI touchups:
- New and improved chat input box
- Neatened up app action bar layout
- Slimmer app sidebar
- Chat sidebar segments: Context and Model