README.md
A tool provider for LM Studio that enables AI models to browse and extract content from web pages using Puppeteer. This extension provides tools to fetch, clean, and return structured HTML content from URLs.
fetch or fetch_all.fetch_all.Before using this tool provider, ensure your environment meets the following requirements:
@lmstudio/sdk)libgbm-dev on Linux, or standard Chromium dependencies).To use this tool provider within LM Studio:
Fetches and cleans the content of a single web page.
url (string): A valid URL to fetch.{ "name": "fetch", "arguments": { "url": "https://example.com" } }
Fetches and cleans the content of multiple web pages concurrently.
urls (array): An array of valid URLs to fetch.{ "name": "fetch_all", "arguments": { "urls": ["https://example.com", "https://lmstudio.ai"] } }
To optimize the output for LLM consumption, the scraper performs aggressive cleaning:
<script>, <style>, <link>, <form>, <input>, <button>, <header>, <footer>, <nav>, <svg>, <iframe>, <audio>, <video>, etc.src and href.domcontentloaded and networkidle0.30 * 1000).