Features
Structured Data Extraction
Use the extractRules
parameter to extract specific content from the page using CSS selectors. This is useful when you want structured JSON output without parsing raw HTML manually.
The value is a simple object where each key is the name of the field you want, and the value is the CSS selector used to extract it.
Use extractRules
when you need fast, lightweight structured data without running a full AI model.
Format
Each selector will return the text content of the matched element.
Example: Extract Page Title
This extracts the text of the first <h1>
on the page and returns it under the title
key.
Example: Extract Multiple Fields
This returns:
Attribute Extraction
You can extract an attribute by using @attribute
syntax:
Notes
- Only the first match per selector is returned
- If the selector is not found, the value will be
null