Gemma 2 2B

Google

gemma2

"Google's Llama", Gemma benefits from Google's experience training its flagship Gemini model to provide excellent performance on low power or for autocompletion/drafting tasks.

Model info

Model

Gemma 2 2B

Author

Google

Arch

gemma2

Parameters

2B

Size on disk

about 1.71 GB

Format

gguf

Download and run Gemma 2 2B

Open in LM Studio to view download options

Use Gemma 2 2B in your code

💡 LM Studio needs to be installed and run at least once for this to work. Don't have it yet? Get it here.

CLI Bootstrap

npx lmstudio install-cli # (only needed once)

Model Load

lms load lmstudio-community/gemma-2-2b-it-gguf
Alternatively, load the model in the LM Studio app.

Use Gemma 2 2B via an OpenAI-like API

Reuse your existing OpenAI client code and point it to LM Studio instead.

Python example
# Example: reuse your existing OpenAI client code
from openai import OpenAI

# Point to the local server
client = OpenAI(base_url="http://localhost:1234/v1", 
                api_key="lm-studio") # not used

completion = client.chat.completions.create(
  model="lmstudio-community/gemma-2-2b-it-gguf",
  messages=[
    {"role": "system", "content": "Always answer in rhymes."},
    {"role": "user", "content": "Introduce yourself."}
  ],
  temperature=0.7,
)

print(completion.choices[0].message)

Develop

Learn more