Draw Things Chat – LM Studio Plugin

Image generation plugin for LM Studio using a Draw Things backend (HTTP or gRPC).

Tools provided:

generate_image — Text-to-Image and Image-to-Image with variants, previews, and file outputs
review_image — Request a one-shot re-view of specific earlier images (attachments/variants/images/pictures)

Draw Things gRPC uses strict TLS with bundled certs.

Requirements

Node.js >= 18
LM Studio (plugin support + local server if using the orchestrator)
Optional: Draw Things app/API (for Draw Things backend)

Setup and Running

Running in LM Studio (local plugin)

This repo is intended to be loaded locally via the runner script at .lmstudio/production.js.

Build once: npm install && npm run build
Point LM Studio at .lmstudio/production.js (it loads dist/index.js)

The script checks for toolchain prerequisites (on macOS), installs dependencies, and runs a smoke test for sharp.

Build

The build pipeline uses SWC to transpile to dist-temp/src, then Rollup bundles it to dist/index.js (for the plugin) and dist/cli.js (for command-line use). Runtime assets (protos, TLS certs, helpers) are copied to dist/interfaces/** and dist/helpers/**.

CLI usage (optional):

Run directly: node dist/cli.js
Note: there is no npm start script; the plugin is normally run via LM Studio.

Core Library

The shared library draw-things-chat-core is temporarily not included in this repository. Since revision 21, the core source files are bundled into a single src/core-bundle.mjs via esbuild. This was necessary because the LM Studio Hub enforces a 128-file limit per plugin, and the plugin's own sources combined with the core files exceed that budget. Our request to raise the limit (sent to team@lmstudio.ai on 15 January 2026) has not been answered yet.

To set up the core bundle for local development:

After the first setup, any change in core only requires npm run build in draw-things-chat-core — the build script automatically re-bundles and syncs core-bundle.mjs + fpzip files into this repo's src/.

The following generated files are gitignored and must be synced from core:

src/core-bundle.mjs — bundled core library (~300 KB)
src/core-bundle.d.mts — TypeScript type stub

Development with LM Studio

To run in development mode, which requires the LM Studio CLI:

Maintenance scripts

Cleanup orphaned LM Studio working directories (macOS)

LM Studio removes ~/.lmstudio/conversations/<Chat-ID>.conversation.json and attachments, but can leave ~/.lmstudio/working-directories/<Chat-ID> behind.

This repo includes a small helper to find such orphan folders and (optionally) move them to the macOS Trash.

Serving generated images over HTTP

If you run a local HTTP server that serves files from LM Studio chat working directories, the plugin can return HTTP URLs (instead of file:// URLs) in tool responses.

Configuration:

Global plugin setting: HTTP_SERVER_PORT (default: 54760)
The plugin does not start a server; it health-checks http://127.0.0.1:<port>/__healthz and only emits HTTP links when:
- HTTP status is 200, and
- the response header x-mcp-image-server equals 1.

File Layout:

Originals: stored in ~/.lmstudio/working-directories/{chatId}/generated-image-*.png
Previews: stored in ~/.lmstudio/working-directories/{chatId}/preview-*.jpg

Preview Generation:

All previews use unified settings: 1024px max dimension, 85% JPEG quality
Single-pass encoding (no iterative quality reduction)
Format: preview-generated-image-{timestamp}-v{N}.jpg
For attachments: preview-attachment-image-{timestamp}.jpg

Global Plugin Settings (in LM Studio)

All user-configurable settings are defined in src/config.ts (single source of truth).

Tool output / links

PREVIEW_IN_CHAT (boolean; default: false; UI label: Simple Previews in Chat): When enabled, tool responses include inline preview images (simple mode).
HTTP_SERVER_PORT (number; default: 54760): If a compatible local image server is healthy, the tool returns http://127.0.0.1:<port>/<chatId>/<file> URLs instead of file:// URLs.

Draw Things connection

DRAW_THINGS_HOST (string; default: 127.0.0.1)
DRAW_THINGS_HTTP_PORT (number; default: 7860)
DRAW_THINGS_GRPC_PORT (number; default: 7859)

Orchestrator (agent model)

model (select; default: qwen/qwen3.5-35b-a3b): The local vision-capable model used by the generator/orchestrator.
baseUrl (string; default: http://127.0.0.1:1234/v1): LM Studio OpenAI-compatible server base URL.
apiKey (string; default: empty): Only needed if your local server requires auth.

Backends

Only Draw Things is supported.

On startup, the plugin probes both transports and auto-selects:

Prefer gRPC when reachable
Fall back to HTTP when gRPC is unavailable or fails to initialize

Draw Things HTTP

Host: DRAW_THINGS_HOST (plugin setting)
Port: DRAW_THINGS_HTTP_PORT (plugin setting; default 7860)

Draw Things gRPC

Recommended backend service: Self-host gRPCServerCLI
Host: DRAW_THINGS_HOST (plugin setting)
Port: DRAW_THINGS_GRPC_PORT (plugin setting; default 7859)
TLS uses bundled certificates located at src/interfaces/tls/** (copied to dist/).

Tools

`generate_image`

Parameters (minimal UI):

prompt: string
model: 'auto'|'z-image'|'qwen-image'|'flux'|'custom' (selects model overlay with optimized parameters)
imageFormat: 'square'|'landscape'|'portrait' (maps to width/height)

Edit Mode Parameters

mode='edit' enables multi-reference image editing using a canvas image plus optional moodboard references.

Important constraints:

Edit mode requires the Draw Things gRPC backend (HTTP does not support edit mode).
Legacy parameters source, sourceVariant, sourceAttachment are removed; use canvas and moodboard.
There is no auto-fill: only explicitly selected references are used.

Parameter	Type	Description	Example
`mode`	string	Must be `"edit"`	`"edit"`
`prompt`	string	Editing instruction; keep it short and action-oriented	`"add sunglasses"`
`canvas`	string	Primary reference image (required unless there is exactly one source available). Notation: `a1`, `v2`, `p1` (`a`/`v`/`p` shorthand → `*1`).	`"a1"`, `"v2"`
`moodboard`	string[]	Additional reference images (style/context). Same notation as `canvas`.	`["a2","v1"]`

Reference limit:

The selected model enforces a maxReferenceImages limit (canvas + moodboard). For qwen-image this is currently 4 (and flux is also 4).

Note: qwen-edit is not a user-selectable model id. Edit capabilities/limits depend on the effective backend model used (e.g. qwen-image edit models).

Examples:

Model Overlays:

The model parameter accepts symbolic names that map to complete parameter sets optimized for specific models:

auto: Use default parameters (no overlay; defaults currently use qwen-image edit models for edit mode and z-image for most other modes)
z-image: Maps to z_image_turbo_1.0_q8p.ckpt with optimized settings
qwen-image: Maps to qwen_image_2512_bf16_q8p.ckpt with optimized settings
: Maps to with optimized settings

Model overlays define all necessary parameters including:

Actual model filename (.ckpt file)
Optimal sampler, steps, guidance scale
LoRA weights and configurations
Other model-specific optimizations

The overlay system works for both HTTP and gRPC backends. When a model is selected, the symbolic name is used only for overlay lookup—the actual model filename from the overlay is sent to the backend.

MODEL_IDS vs MODEL_FAMILIES (architecture):

The codebase distinguishes between two lists:

MODEL_IDS (internal): All model identifiers including sub-categories like qwen-edit. Used for overlay lookups and internal routing (e.g. selectAutoModel("image2image") → "qwen-edit").
MODEL_FAMILIES (user-facing): Only the selectable families (, , , , ). Used for reports, snapshots, and UI model selection.

Sub-categories (like qwen-edit) are resolved internally but not exposed as separate user-selectable families. For example, when the user selects qwen-image and the mode is image2image, the system internally routes to qwen-edit overlays.

_dt_i2i_profile (internal routing):

For image2image and edit modes, the core layer sets _dt_i2i_profile on the service input:

_dt_i2i_profile: "img2img" → use defaultParamsDrawThingsImg2Img and getEffectiveOverlay(modelId, "img2img")
_dt_i2i_profile: "edit" → use defaultParamsDrawThingsEdit and getEffectiveOverlay(modelId, "edit")

This ensures that image2image + moodboard uses img2img defaults (e.g. strength: 0.9) instead of edit defaults (e.g. strength: 1), even though both modes share the generateImageEdit() code path for multi-reference inputs.

The field is typed in ImageGenerationParams (see src/services/schemas.ts) and stripped from user input via stripInternalToolKeys().

Model used → family (display-only metadata):

This repo can build a dtc.model-mapping-snapshot.v1 payload from the currently loaded Draw Things overlays + custom configs. That snapshot can be prefixed into a ceveyne/draw-things-index/index_image query so the index plugin can enrich search results:

model stays raw (machine-safe basename)
model_display becomes a human-friendly display string
model_use_hints.model_use_by_mode provides VALID mode+model tool args for generate_image

Constraints:

Display-only: does not change validation and does not accept filenames as user inputs.
No filename heuristics: families are derived from overlays/defaults/custom presets, not inferred from filename prefixes.

Token economy note (snapshot prefix):

When the orchestrator injects a leading model-mapping snapshot JSON object into index_image.query for tool execution, it strips that prefix back out when serializing the tool call into the agent-model prompt history. The stripping is signature-based (only JSON objects with schema: "dtc.model-mapping-snapshot.v1" are removed), and the remaining query text (the part authored by the model) is preserved verbatim.

Optional verification:

Generate a Markdown cross-check report: npx tsx scripts/model-mapping-report.ts
Writes scripts/model-mapping-report.md (supports --custom-configs and --out).

PREVIEW_IN_CHAT Behavior:

PREVIEW_IN_CHAT=true: Tool returns preview images as Markdown in the response.
PREVIEW_IN_CHAT=false (default): Tool returns only text/links (preview + original URLs and a compact JSON summary). The orchestrator can inject Markdown for generated images into the chat stream after the tool call completes.

Vision Promotion (pixels to the agent model):

Independently of UI display, the generator/orchestrator can “promote” recent images into the same tool-continuation turn by injecting a synthetic role=user message containing image_url parts (placed directly after the tool result).

How it’s controlled (see src/config.ts):

Promotion only happens when the selected orchestrator model supports vision.
visionPromotionPersistent=false (default): idempotent mode — promote only when there are new promotable items.
visionPromotionPersistent=true: persistent mode — promote up to 5 attachments + 3 variants every turn.

Tool-Result Image Harvesting (external tool images):

When a non-generate_image tool returns image URLs (for example from a web/image search tool), the orchestrator can “harvest” those images into the current chat working directory and inject a compact Markdown table with previews.

Only runs for allowlisted tool calls and HTTPS URLs.
Downloads are bounded (timeouts / max bytes) and previews are generated as small thumbnails.
Harvested images are stored as in chat state and can be referenced via / using notation (e.g. ).

Source selection (canvas):

Canvas / moodboard notation:

a1 = attachment 1, v2 = variant 2, p1 = picture 1
a// shorthand means //

Returns:

If PREVIEW_IN_CHAT=true: writes previews into the active chat folder and returns Markdown images.
Otherwise: returns text + links (no inline images in the tool result; UI display is handled by orchestrator injection).
Also includes links to originals/previews and a compact JSON summary.

Variants:

The requested number of variants is interpreted via variants (or aliases n / num_images) and validated centrally (1–3).
Draw Things maps this to batch_size and returns all images in one call.
All images returned are saved as generated-image-<timestamp>-v1..vN.png and previewed as preview-generated-image-...-v1..vN.*.

FPS Resolution (Video Models):

When the output is a video (e.g. LTX-2), the fps value for the gRPC payload is resolved in this exact priority order (highest → lowest):

Priority	Source	Notes
1	Tool parameter — `filtered.fps`	Explicit `fps` sent in the `generate_image` call. If the LLM provides it, it always wins unconditionally.
2	Capability map — `video.defaultFps` in `IMAGE_MODEL_CAPABILITY_MAP`	Set per backend model file in `src/capabilities.ts` (e.g. `ltx-2-distilled` → 25, derived from the Draw Things Model Zoo `framesPerSecondForModel`). Applies when no tool-parameter value was given.
3	`defaultParams.fps` (via `effective.fps`)	From , currently . Applies when neither the tool parameter nor the capability map supplies a value.

Custom Config / Model Overlay values for fps are blacklisted: fps is stripped from overlayParamsNoSize alongside width, height, batch_count, batch_size, batchCount, batchSize, upscaler, upscaler_scale, and upscalerScale. Overlay sources (both Custom Configs and Model Overlays) therefore have no effect on fps resolution.

Summary of the resolution expression (all three gRPC call sites):

effective.fps reflects defaultParams.fps (= 24) because overlay fps is blacklisted and ...filtered (which may carry tool-param fps) is spread over defaultParams in the effective object — but that path is shadowed by the filtered.fps check in priority 1 anyway.

File Outputs:

Originals: ~/.lmstudio/working-directories/{chatId}/generated-image-<timestamp>-vN.png
Previews: ~/.lmstudio/working-directories/{chatId}/preview-<basename>.jpg|webp

`review_image`

review_image schedules a one-shot re-view/promotion of existing media items already present in the chat working directory state (chat_media_state.json).

Accepted targets (canonical):

aN = attachment
vN = generated variant
iN = harvested tool-result image
pN = picture (e.g. Draw-Things index result)

Strict validation (default):

strict=true by default.
Any target that cannot be resolved to an existing record in chat_media_state.json is rejected.

Tolerated aliases (not advertised):

Some models tend to reference provenance fields (e.g. values taken from tool results like imagePaths or httpPreviewUrls) instead of using aN/vN/iN/pN.

We deliberately do not advertise these aliases in the tool schema/description, but we tolerate them for robustness:

Tool-result rewriting (agent-model input)

The plugin may rewrite tool results only for the agent model input, to keep the model aligned with stable notations and to avoid accidental reliance on unstable paths/URLs.

Draw-Things index (`ceveyne/draw-things-index/index_image`)

The original tool payload contains images[] entries with imagePaths[] and/or httpPreviewUrls[].

Each images[] entry may also include optional provenance metadata:

sourceInfo.type: one of generate_image_variant | attachment | saved_image | draw_things_project
sourceInfo.imageType (optional): producer-provided origin string (recent addition; used primarily for saved_image)

When rendering the Draw-Things index markdown table, sourceInfo.imageType is surfaced as an additional metadata field:

**Origin:** <imageType>

For the agent model, we add a strictly additive field to each images[] entry:

index: ["pN", ...]

The array is aligned by position with the union of imagePaths/httpPreviewUrls (same length as the max of those arrays). Each entry is either:

a directly usable review_image target (pN), derived from the stable picture index in chat_media_state.json, or
null when there is no path/preview at that position.

This is intentionally additive: all original fields remain unchanged.

Logs: logs/generate-image-plugin.log, logs/error.log, logs/generate-image-plugin.audit.jsonl

Image Processing (Image2Image + Edit)

When using mode='image2image' or mode='edit', the plugin normalizes every input image before it is sent to the backend. This applies uniformly to all source pools:

User attachments (a1, a2, ...)
Generated variants (v1, v2, ...)
Harvested pictures from supported external tool results (, , ...)

The processing is identical across transports and modes; only the number of inputs differs (moodboard multi-reference is gRPC-only).

Normalization Rules (per input image):

Output sizing: The backend renders at a constrained requested_effective size. If requested_raw exceeds render limits, the upscaler is enabled and the final output is postprocessed back to exact requested_raw dimensions.

Processing Flow:

Configuration:

All img2img/txt2img limits are centralized in src/services/drawthingsLimits.ts:

Error Handling:

Fail-fast approach: Jimp resize/crop failures throw exceptions (no silent fallbacks)
Dimension validation before backend send (auto-corrects non-compliant dimensions)
Dev runner only: may write debug buffers to /tmp/debug-i2i-buffer-*.png (the main TypeScript plugin code does not)

See src/core/tools.ts and src/helpers/imageUtils.ts for implementation details.

Notes on environment variables

This plugin exports the LM Studio plugin settings into process.env internally for compatibility with existing service code. Treat environment variables as internal wiring; user-facing configuration lives in src/config.ts and is set via LM Studio.

Troubleshooting

gRPC TLS: ensure dist/interfaces/tls/server_crt.crt + root_ca.crt exist.
Decoder helpers: dist/helpers/GRPCBin2PNG.js + dist/helpers/PNG2GRPCBin.js should exist (executed via ). Optional native binaries can be provided via /.

License

MIT

draw-things-chat