Meta Llama 3.1 8B

Meta

llama

The latest in Meta's long-running Llama series, Llama 3.1 is another jack of all trades and master of some, now in 8 languages and up to 128k tokens.

Model info

Model

Meta Llama 3.1 8B

Author

Meta

Arch

llama

Parameters

8B

Size on disk

about 4.92 GB

Format

gguf

Download and run Meta Llama 3.1 8B

Open in LM Studio to view download options

Use Meta Llama 3.1 8B in your code

💡 LM Studio needs to be installed and run at least once for this to work. Don't have it yet? Get it here.

CLI Bootstrap

npx lmstudio install-cli # (only needed once)

Model Load

lms load lmstudio-community/meta-llama-3.1-8b-instruct-gguf
Alternatively, load the model in the LM Studio app.

Use Meta Llama 3.1 8B via an OpenAI-like API

Reuse your existing OpenAI client code and point it to LM Studio instead.

Python example
# Example: reuse your existing OpenAI client code
from openai import OpenAI

# Point to the local server
client = OpenAI(base_url="http://localhost:1234/v1", 
                api_key="lm-studio") # not used

completion = client.chat.completions.create(
  model="lmstudio-community/meta-llama-3.1-8b-instruct-gguf",
  messages=[
    {"role": "system", "content": "Always answer in rhymes."},
    {"role": "user", "content": "Introduce yourself."}
  ],
  temperature=0.7,
)

print(completion.choices[0].message)

Develop

Learn more