Skip to main content

Run Llama 3.1 in LM Studio

LM Studio Team

Meta's newest Llama: Llama 3.1 is here!

TLDR: Relatively small, fast, and supremely capable open-weights model you can run on your laptop.

MetaAI's newest generation of their Llama models, Llama 3.1, is now available.

How to download and run Llama 3.1 locally in your LM Studio

  • Install LM Studio 0.2.28 from https://lmstudio.ai
  • Search for Meta-Llama-3.1-8B-Instruct-GGUF or use this direct download link.
  • When the download is complete, go ahead and load the model.
  • That's it! Now you're running Llama 3.1 locally.

If you're a developer, you can also use Llama 3.1 via LM Studio's built-in OpenAI-like server. See the docs for more details.

What's new with Llama 3.1?

  • New longer context window supporting up to 128k tokens.
  • Now available in 3 different sizes, including a new 405B parameter flagship model, and upgraded 70B & 8B versions.
  • Available across 8 different languages, including English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai.
  • Competitive with other leading, closed-source foundational models, including GPT-4, GPT-4o, and Claude 3.5 Sonnet.

Llama 3.1 comes in three sizes: 8B, 70B, and 405B.

NameDescription
8BRelatively small, fast model, and supremely capable LLM you can run on your laptop.
70BMedium-large variant that enables diverse use cases that may require more complex reasoning.
405BThe most extensively trained open LLM to date. Requires an extremely capable setup to run

To use MetaAI's Llama 3.1 in LM Studio, download or update to LM Studio 0.2.28 or later.

Download LM Studio from the LM Studio website.