Fixed "Cannot read properties of undefined" when using a text-only model in a chat with images
Fixed a path resolution issue on Windows that caused LM Runtimes on certain machines to act unexpectedly
CUDA model load crash, "llm_engine_cuda.node. The file cannot be accessed by the system"
ROCm garbled model generation
Fixed a bug where RAG messages in chats created with older versions of the app were not showing
Fixed an Input Method Editor (IME) bug: now upon pressing Enter the message will not send unless the composition is completed
Build 5
Fixed an API bug where reasoning_content setting was not respected when streaming DeepSeek R1 chat completion responses
Build 4
New Experimental API: send reasoning_content in a separate field in chat completion responses (both streaming and non-streaming)
Works for models that generate content within <think></think> tags (like DeepSeek R1)
Turn on in App Settings > Developer
Build 3
New: Add a Chat Appearance option to auto-expand newly added Thinking UI blocks
New: Show quick access to guardrail config when the app gives an insufficient system resources error notification
Fixed a bug where if the non-default models directory is deleted, new models will not be indexed
Fixed a bug in hardware detection that sometimes incorrectly filtered out GPUs in multi-GPU setups when using the Vulkan backend
Fixed a bug in the model load UI where F32 cache types without flash attention were not recognized as a valid configuration for the llama.cpp Metal runtime
Build 2
New: Added support for downloading models from nested folders in Hugging Face repositories
Improved support for searching with Hugging Face URLs directly
New: Automatically update selected Runtime Extension Packs (you can turn this off in Settings)
New: Added an option to use LM Studio's Hugging Face proxy. This can help users who have trouble accessing Hugging Face directly
New: KV Cache Quantization for MLX models (requires mlx-engine/0.3.0)
My Models tab refresh: neater model names, and sidebar categories for model types
Can toggle back to showing full file names in App Settings > General
To see raw model metadata (previously: (i) button), right-click on the model name and choose "View Raw Metadata"
Fixed a bug where clearing Top K in Sampling Settings would trigger an error
Build 1
New: TTL - optionally auto-unload unused API models after a certain amount of time (ttl field in request payload)