Skip to main content

Firecrawl

Firecrawl is a powerful tool designed to scrape web pages and retrieve content in various formats. It allows users to extract data efficiently and even capture full-page screenshots with ease.

Scrape

Submit a scraping request to Firecrawl to retrieve webpage content and data in specified formats.

Fields

  • URL: The URL of the webpage you wish to scrape. This should be the full web address, including http:// or https://. Ensure the URL is correct and accessible. Example: https://docs.example.com.

  • Output Formats: Specify the formats you want the content in. Options include: markdown, html, rawHtml, links, screenshot, extract, screenshot@fullPage. Separate multiple formats with commas. Example: markdown, html.

  • Only Main Content: Indicate if only the main content of the page should be returned, excluding headers, navs, footers. Set to true or false. Default is true.

  • Request Timeout: Set the maximum time in milliseconds for the request to be completed. Default is 30000. Ensure you provide a valid integer.

Output

The output will be the scraped content of the webpage in the specified formats. If multiple formats are requested, each will be provided as a separate output.

Scrape - Full page screenshot

Scrape a web page and capture a screenshot with optional wait time.

Fields

  • URL: The URL of the web page you want to scrape and capture a screenshot of. Make sure the URL is valid and accessible. Example: https://example.com.

  • Only Main Content: If set to true, only the main content of the page will be returned, excluding elements like headers, footers, and navigation bars. Default is true.

  • Wait Time: Specify wait time in milliseconds before executing further actions. This allows time for page elements to fully load. Default is 0. Provide a valid integer.

Output

The output will be a full-page screenshot of the specified web page, captured after the optional wait time. The screenshot will be provided in the format specified (screenshot@fullPage).