MLX
•
mistral
A state-of-the-art 12B model with 128k context length, built in collaboration with NVIDIA, and released under the Apache 2.0 license
Model info
Model
Mistral Nemo Instruct 2407 4bit
Author
mlx-community
Arch
mistral
Parameters
12B
Format
safetensors
Size on disk
about 6.91 GB
Download the model using lms
— LM Studio's developer CLI.
lms get mistral-nemo
curl http://localhost:1234/v1/chat/completions \
-H "Content-Type: application/json" \
-d '{
"model": "mistral-nemo",
"messages": [
{ "role": "system", "content": "Always answer in rhymes." },
{ "role": "user", "content": "Introduce yourself." }
],
"temperature": 0.7,
"max_tokens": -1,
"stream": true
}'
lms log stream
to see your prompts as they are sent to the LLM.lmstudio.js
- LM Studio SDK documentation (TypeScript)lms log stream
- Stream server logslms
- LM Studio's CLI documentation