memory

A retrieval-augmented memory for local models. Before the model answers, this prompt-preprocessor retrieves the most relevant snippets from a folder of your notes/docs and injects them as context — grounding answers in your own knowledge base.

Part of lmstudio-suite.

How it works

On each message it embeds your query with an LM Studio embedding model, finds the top matching chunks from your knowledge directory (cosine similarity over a local vector index), and prepends them to the prompt. Indexing is cached and only re-runs when files change. Any failure passes your message through unchanged — retrieval never blocks you.

Configuration

Global:

Tool	What it does
`remember`	Save a fact as a markdown note (frontmatter + tags) under `memories/`.
`recall`	Keyword-search saved memories and return matches with their ids.
`forget`	Delete a saved memory by id.

memory

memory

memory

How it works

Configuration

Writable memory (closing the loop)

Use