Project Files
docs / DEPLOYMENT.md
All components run on one machine. LM Studio acts as both client and server.
āāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāā ā MacBook Pro M5 ā ā ā ā āāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāā ā ā ā LM Studio App ā ā ā ā ā ā ā ā āāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāā ā ā ā ā ā Plugin: user-docs ā ā ā ā ā ā ā ā ā ā ā ā vision-capability-primer: qwen/qwen3-vl-4b ā ā ā ā ā āāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāā ā ā ā ā ā ā ā ā ā ā OpenAI-compat. API ā ā ā ā ā¼ ā ā ā ā āāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāā ā ā ā ā ā LM Studio Server (local) ā ā ā ā ā ā baseUrl: http://127.0.0.1:1234/v1 ā ā ā ā ā ā Agent Model: qwen/qwen3.6-27b ā ā ā ā ā ā Embedding Model: ggml-org/bge-m3-Q8_0-GGUF ā ā ā ā ā ā Vision Model: qwen/qwen3-vl-8b ā ā ā ā ā āāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāā ā ā ā āāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāā ā ā ā āāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāā
graph TD subgraph MacBookProM5["MacBook Pro M5"] subgraph LMStudioApp["LM Studio App"] Plugin["Plugin: user-docs<br>vision-capability-primer:<br> qwen/qwen3-vl-4b"] LMServerLocal["LM Studio Server (local)<br>baseUrl: http://127.0.0.1:1234/v1<br>Agent Model: qwen/qwen3.6-27b<br>Embedding Model: ggml-org/bge-m3-Q8_0-GGUF<br>Vision Model: qwen/qwen3-vl-8b"] end end Plugin -ā|"OpenAI-compat. API"| LMServerLocal
Plugin settings (defaults):
| Setting | Value |
|---|---|
model | qwen/qwen3.6-35b-a3b |
baseUrl / embeddingBaseUrl | http://127.0.0.1:1234/v1 |
embeddingModel | ggml-org/bge-m3-Q8_0-GGUF |
qwen3VlModelPath | qwen/qwen3-vl-8b |
LM Studio Client and LM Studio backend share one machine. The LM Studio Server (agent inference) runs on a dedicated, more powerful machine.
Plugin settings:
| Setting | Value |
|---|---|
model | qwen/qwen3.6-35b-a3b |
baseUrl | http://127.0.0.1:1234/v1 |
embeddingModel | ggml-org/bge-m3-Q8_0-GGUF |
qwen3VlModelPath | qwen/qwen3-vl-8b |
embeddingBaseUrl | http://127.0.0.1:1234/v1 |
āāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāā
ā MacBook Neo ā
ā ā
ā āāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāā ā āāāāāāāāāāāāāāāāāāāāāāāāāāāāāāā
ā ā LM Studio App ā ā ā Mac Studio M3 Ultra ā
ā ā ā ā ā ā
ā ā āāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāā ā ā ā ā
ā ā ā Plugin: user-docs ā ā ā ā LM Studio Server ā
ā ā ā ā ā ā ā ā
ā ā ā vision-capability-primer: qwen/qwen3-vl-4b ā ā ā ā ā
ā ā āāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāā ā ā ā Agent Model: ā
ā ā ā ā ā ā qwen/qwen3.6-27b ā
ā ā ā OpenAI-compat. API āāāāāāāāāāāāāāāāāāā¶ļøā http://<studio-ip>:1234/v1 ā
ā ā ā¼ ā ā ā ā
ā ā āāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāā ā ā ā ā
ā ā ā LM Studio Server (local) ā ā ā ā ā
ā ā ā baseUrl: http://127.0.0.1:1234/v1 ā ā ā ā ā
ā ā ā Agent Model: qwen/qwen3.6-27b ā ā ā ā ā
ā ā ā Embedding Model: ggml-org/bge-m3-Q8_0-GGUF ā ā ā ā ā
ā ā ā Vision Model: qwen/qwen3-vl-8b ā ā ā ā ā
ā ā āāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāā ā ā āāāāāāāāāāāāāāāāāāāāāāāāāāāāāāā
ā āāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāā ā
ā ā
āāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāā
graph LR
subgraph MacBookNeo["MacBook Neo"]
subgraph LMStudioApp["LM Studio App"]
Plugin["Plugin: user-docs<br>vision-capability-primer:<br> qwen/qwen3-vl-4b"]
LMServerLocal["LM Studio Server (local)<br>baseUrl: http://127.0.0.1:1234/v1<br>Agent Model: qwen/qwen3.6-27b<br>Embedding Model: ggml-org/bge-m3-Q8_0-GGUF<br>Vision Model: qwen/qwen3-vl-8b"]
end
end
subgraph MacStudio["Mac Studio M3 Ultra"]
LMSServerRemote["LM Studio Server<br><br>Agent Model:<br> qwen/qwen3.6-27b<br><br>http://<studio-ip>:1234/v1"]
end
Plugin -ā|"OpenAI-compat. API"| LMServerLocal
LMStudioApp -ā|"OpenAI-compat. API"| LMSServerRemote