February 1, 2025

LM Studio 0.3.9

0.3.9 - Release Notes

Build 6

  • Fixed "Cannot read properties of undefined" when using a text-only model in a chat with images
  • Fixed a path resolution issue on Windows that caused LM Runtimes on certain machines to act unexpectedly
    • CUDA model load crash, "llm_engine_cuda.node. The file cannot be accessed by the system"
    • ROCm garbled model generation
  • Fixed a bug where RAG messages in chats created with older versions of the app were not showing
  • Fixed an Input Method Editor (IME) bug: now upon pressing Enter the message will not send unless the composition is completed

Build 5

  • Fixed an API bug where reasoning_content setting was not respected when streaming DeepSeek R1 chat completion responses

Build 4

  • New Experimental API: send reasoning_content in a separate field in chat completion responses (both streaming and non-streaming)
    • Works for models that generate content within <think> </think> tags (like DeepSeek R1)
    • Turn on in App Settings > Developer

Build 3

  • New: Add a Chat Appearance option to auto-expand newly added Thinking UI blocks
  • New: Show quick access to guardrail config when the app gives an insufficient system resources error notification
  • Fixed a bug where if the non-default models directory is deleted, new models will not be indexed
  • Fixed a bug in hardware detection that sometimes incorrectly filtered out GPUs in multi-GPU setups when using the Vulkan backend
  • Fixed a bug in the model load UI where F32 cache types without flash attention were not recognized as a valid configuration for the llama.cpp Metal runtime

Build 2

  • New: Added support for downloading models from nested folders in Hugging Face repositories
  • Improved support for searching with Hugging Face URLs directly
  • New: Automatically update selected Runtime Extension Packs (you can turn this off in Settings)
  • New: Added an option to use LM Studio's Hugging Face proxy. This can help users who have trouble accessing Hugging Face directly
  • New: KV Cache Quantization for MLX models (requires mlx-engine/0.3.0)
  • My Models tab refresh: neater model names, and sidebar categories for model types
    • Can toggle back to showing full file names in App Settings > General
    • To see raw model metadata (previously: (i) button), right-click on the model name and choose "View Raw Metadata"
  • Fixed a bug where clearing Top K in Sampling Settings would trigger an error

Build 1

  • New: TTL - optionally auto-unload unused API models after a certain amount of time (ttl field in request payload)
  • New: Auto-Evict - optionally auto-unload previously loaded API models before loading new ones (control in App Settings)
  • Fixed a bug where equations inside model thinking blocks would sometimes generate empty space below the block
  • Fixed cases where text in toast notifications was not scrollable
  • Fixed a bug where unchecking and checking Structured Output JSON would make the schema value disappear
  • Fixed a bug where auto-scroll while generating would sometimes not allow scrolling up
  • [Developer] Moved logging options to the Developer Logs panel header (••• menu)
  • Fixed Chat Appearance font size option not scaling text in Thoughts block