These parameters control how the crawler behaves and which pages it visits.Documentation Index
Fetch the complete documentation index at: https://docs.hasdata.com/llms.txt
Use this file to discover all available pages before exploring further.
List of starting URLs. The crawler will begin from these.
Maximum number of pages to scrape.
0 means no limit.How many link levels to follow from each starting URL.
1 means only direct links. 2 means follow links from those pages, and so on.Only follow URLs that match this regex.Example:
(blog/.+|about/.+) matches /blog/post-1, /about/company, etc.Skip URLs that match this regex.Example:
(admin/.+|private/.+) skips /admin/login, /private/settings, etc.outputFormat, extractRules, headers, proxyType, and others.