read-pdf

Transcribe a PDF to Markdown by rasterising every page to a PNG and asking the vision-language model loaded in LM Studio to read it.

Why this exists

LM Studio has no built-in OCR, and the existing alternatives (MinerU, Marker, Docling…) require a heavy Python/Tesseract install. If you already have a VL model loaded, a PDF is just a stack of images — this plugin runs that loop, locally, with no network call.

Requirements

LM Studio with a vision-capable model loaded.
Tested OK with: Qwen3-VL 8B Q4 on a 4-page native PDF. Add your own combinations to this list as you try them.

Tool exposed

read_pdf_to_markdown({ path, pages?, page_render_scale?, transcription_hint?, include_page_separators? }) returns { markdown, pages_processed, pages_failed, total_chars, ms_total, ms_per_page, warnings } or { error } on hard failures (file missing, encrypted PDF, no VL loaded…).

pages accepts forms like "1", "1-3", , , .

read-pdf

read-pdf

Why this exists

Requirements

Tool exposed

Configuration

How fast

Limitations

License