Firecrawl
Firecrawl is a powerful tool designed to scrape web pages and retrieve content in various formats. It allows users to extract data efficiently and even capture full-page screenshots with ease.
Scrape
Submit a scraping request to Firecrawl to retrieve webpage content and data in specified formats.
Fields
-
URL: The URL of the webpage you wish to scrape. This should be the full web address, including
http://orhttps://. Ensure the URL is correct and accessible. Example:https://docs.example.com. -
Output Formats: Specify the formats you want the content in. Options include:
markdown,html,rawHtml,links,screenshot,extract,screenshot@fullPage. Separate multiple formats with commas. Example:markdown, html. -
Only Main Content: Indicate if only the main content of the page should be returned, excluding headers, navs, footers. Set to
trueorfalse. Default istrue. -
Request Timeout: Set the maximum time in milliseconds for the request to be completed. Default is
30000. Ensure you provide a valid integer.
Output
The output will be the scraped content of the webpage in the specified formats. If multiple formats are requested, each will be provided as a separate output.
Scrape - Full page screenshot
Scrape a web page and capture a screenshot with optional wait time.
Fields
-
URL: The URL of the web page you want to scrape and capture a screenshot of. Make sure the URL is valid and accessible. Example:
https://example.com. -
Only Main Content: If set to
true, only the main content of the page will be returned, excluding elements like headers, footers, and navigation bars. Default istrue. -
Wait Time: Specify wait time in milliseconds before executing further actions. This allows time for page elements to fully load. Default is
0. Provide a valid integer.
Output
The output will be a full-page screenshot of the specified web page, captured after the optional wait time. The screenshot will be provided in the format specified (screenshot@fullPage).