Hermes 3 Llama 3.1 8B

NousResearch

llama

A fine-tune of Meta's Llama 3.1, Hermes is further trained on hand-curated datasets as well and synthetic data. Excels in dialogue and code generation.

Model info

Model

Hermes 3 Llama 3.1 8B

Author

NousResearch

Arch

llama

Parameters

8B

Size on disk

about 4.92 GB

Format

gguf

Download and run Hermes 3 Llama 3.1 8B

Open in LM Studio to view download options

Use Hermes 3 Llama 3.1 8B in your code

💡 LM Studio needs to be installed and run at least once for this to work. Don't have it yet? Get it here.

CLI Bootstrap

npx lmstudio install-cli # (only needed once)

Model Load

lms load nousresearch/hermes-3-llama-3.1-8b-gguf
Alternatively, load the model in the LM Studio app.

Use Hermes 3 Llama 3.1 8B via an OpenAI-like API

Reuse your existing OpenAI client code and point it to LM Studio instead.

Python example
# Example: reuse your existing OpenAI client code
from openai import OpenAI

# Point to the local server
client = OpenAI(base_url="http://localhost:1234/v1", 
                api_key="lm-studio") # not used

completion = client.chat.completions.create(
  model="nousresearch/hermes-3-llama-3.1-8b-gguf",
  messages=[
    {"role": "system", "content": "Always answer in rhymes."},
    {"role": "user", "content": "Introduce yourself."}
  ],
  temperature=0.7,
)

print(completion.choices[0].message)

Develop

Learn more