Website Downloader MCP Server logo

Website Downloader MCP Server

by pskill9

This MCP server provides a tool to download entire websites using wget. It preserves the website structure and converts links to work locally.

View on GitHub

Last updated: N/A

What is Website Downloader MCP Server?

The Website Downloader MCP Server is a tool that allows you to download entire websites for offline access. It utilizes wget to recursively download the website content, preserving the original structure and converting links to function locally.

How to use Website Downloader MCP Server?

To use the server, you need to install wget on your system based on your operating system (macOS, Linux, or Windows). Then, build the server using npm install and npm run build. Finally, add the server to your MCP settings, specifying the command as node and the arguments as the path to the built index.js file. You can then use the download_website tool with the url, outputPath, and depth parameters.

Key features of Website Downloader MCP Server

  • Downloads recursively with infinite depth

  • Includes all page requisites (CSS, images, etc.)

  • Converts links to work locally

  • Adds appropriate extensions to files

  • Restricts downloads to the same domain

  • Preserves the website structure

Use cases of Website Downloader MCP Server

  • Archiving websites for offline access

  • Creating local copies of websites for development or testing

  • Downloading documentation or resources for offline use

  • Analyzing website structure and content

  • Backing up websites

FAQ from Website Downloader MCP Server

What is wget?

Wget is a free software package for retrieving files using HTTP, HTTPS and FTP, the most widely-used Internet protocols. It is a non-interactive commandline tool, so it may easily be called from scripts, cron jobs, terminals without X-Windows support, etc.

What operating systems are supported?

The server supports macOS, Linux (Debian/Ubuntu, Red Hat/Fedora), and Windows, provided that wget is installed on the system.

How do I specify the output directory?

You can use the outputPath parameter in the download_website tool to specify the directory where the website should be downloaded. If not specified, it defaults to the current directory.

How do I limit the download depth?

You can use the depth parameter in the download_website tool to limit the download depth. A depth of 0 downloads only the specified page, 1 downloads direct links, and so on. Defaults to infinite.

What happens if I don't have wget installed?

The server will not function correctly without wget. You need to install wget based on your operating system before using the server.