Firecrawl
Firecrawl is a powerful tool designed to scrape web pages and retrieve content in various formats. It allows users to extract data efficiently and even capture full-page screenshots with ease.
Scrape
Submit a scraping request to Firecrawl to retrieve webpage content and data in specified formats.
Fields
-
URL: The URL of the webpage you wish to scrape. This should be the full web address, including
http://
orhttps://
. Ensure the URL is correct and accessible. Example:https://docs.example.com
. -
Output Formats: Specify the formats you want the content in. Options include:
markdown
,html
,rawHtml
,links
,screenshot
,extract
,screenshot@fullPage
. Separate multiple formats with commas. Example:markdown, html
. -
Only Main Content: Indicate if only the main content of the page should be returned, excluding headers, navs, footers. Set to
true
orfalse
. Default istrue
. -
Request Timeout: Set the maximum time in milliseconds for the request to be completed. Default is
30000
. Ensure you provide a valid integer.
Output
The output will be the scraped content of the webpage in the specified formats. If multiple formats are requested, each will be provided as a separate output.
Scrape - Full page screenshot
Scrape a web page and capture a screenshot with optional wait time.
Fields
-
URL: The URL of the web page you want to scrape and capture a screenshot of. Make sure the URL is valid and accessible. Example:
https://example.com
. -
Only Main Content: If set to
true
, only the main content of the page will be returned, excluding elements like headers, footers, and navigation bars. Default istrue
. -
Wait Time: Specify wait time in milliseconds before executing further actions. This allows time for page elements to fully load. Default is
0
. Provide a valid integer.
Output
The output will be a full-page screenshot of the specified web page, captured after the optional wait time. The screenshot will be provided in the format specified (screenshot@fullPage
).