Firecrawl logo

Firecrawl

by mendableai

Firecrawl is an API service that takes a URL, crawls it, and converts it into clean markdown or structured data. It empowers AI applications with advanced scraping, crawling, and data extraction capabilities.

View on GitHub

Last updated: N/A

What is Firecrawl?

Firecrawl is an API service that crawls a URL and converts it into clean markdown or structured data. It crawls all accessible subpages and provides clean data for each, without requiring a sitemap.

How to use Firecrawl?

Firecrawl offers an easy-to-use API with a hosted version. You can also self-host the backend. Access the API documentation and playground on the Firecrawl website. SDKs are available for Python, Node, Go, and Rust. It also integrates with LLM frameworks and low-code platforms.

Key features of Firecrawl

  • Scrape URLs and get content in LLM-ready formats (markdown, structured data, screenshot, HTML)

  • Crawl URLs and all accessible subpages

  • Map a website and get all the website URLs

  • Extract structured data from single pages, multiple pages, or entire websites with AI

  • LLM-ready formats: markdown, structured data, screenshot, HTML, links, metadata

  • Proxies, anti-bot mechanisms, dynamic content (js-rendered), output parsing, orchestration

  • Customizability: exclude tags, crawl behind auth walls with custom headers, max crawl depth, etc.

  • Media parsing: PDFs, DOCX, images

  • Reliability first: designed to get the data you need - no matter how hard it is

  • Actions: click, scroll, input, wait and more before extracting data

  • Batching: scrape thousands of URLs at the same time with a new async endpoint.

Use cases of Firecrawl

  • Building AI chat applications

  • Training AI models with web data

  • Extracting structured data for analysis

  • Monitoring website content changes

  • Creating knowledge bases from web sources

FAQ from Firecrawl

What formats does Firecrawl support?

Firecrawl supports markdown, structured data, screenshot, HTML, links, and metadata.

Does Firecrawl handle dynamic content?

Yes, Firecrawl handles dynamic content rendered with JavaScript.

Can I customize the crawling process?

Yes, you can customize the crawling process by excluding tags, crawling behind authentication walls, and setting a maximum crawl depth.

Is there a free tier?

Please refer to the Firecrawl website for pricing and free tier information.

How do I get an API key?

Sign up on Firecrawl.dev to get an API key.

Firecrawl - MCP Server | MCP Directory