Fetcher MCP logo

Fetcher MCP

by jae-jae

Fetcher MCP is an MCP server that fetches web page content using the Playwright headless browser, enabling JavaScript support and intelligent content extraction. It supports both HTML and Markdown output formats and allows for parallel processing of multiple URLs.

View on GitHub

Last updated: N/A

What is Fetcher MCP?

Fetcher MCP is a server that utilizes Playwright to fetch web page content, supporting JavaScript execution and intelligent content extraction. It's designed to overcome the limitations of traditional web scrapers by handling dynamic web content.

How to use Fetcher MCP?

You can run Fetcher MCP directly with npx -y fetcher-mcp. First-time users need to install the required browser with npx playwright install chromium. Debug mode can be enabled with the --debug option. It can be configured in Claude Desktop by adding a configuration in the claude_desktop_config.json file.

Key features of Fetcher MCP

  • JavaScript Support

  • Intelligent Content Extraction

  • Flexible Output Format (HTML and Markdown)

  • Parallel Processing

  • Resource Optimization

  • Robust Error Handling

  • Configurable Parameters

Use cases of Fetcher MCP

  • Extracting content from dynamic websites

  • Batch processing of multiple URLs

  • Integrating web scraping into downstream applications

  • Automated data collection

  • Fetching content for AI models

FAQ from Fetcher MCP

How do I handle websites with anti-crawler mechanisms?

Use the prompt 'Please wait for the page to fully load' to enable waitForNavigation: true and increase the timeout duration with 'Please set the page loading timeout to 60 seconds'.

How do I preserve the original HTML structure?

Use the prompt 'Please preserve the original HTML content' which sets extractContent: false and returnHtml: true.

How do I fetch the complete webpage content?

Use the prompt 'Please fetch the complete webpage content instead of just the main content' which sets extractContent: false.

How do I return content in HTML format?

Use the prompt 'Please return the content in HTML format' which sets returnHtml: true.

How do I enable debug mode?

Use the prompt 'Please enable debug mode for this fetch operation' which sets debug: true. Alternatively, run the server with the --debug flag.