Scrapes the given URL and extracts structured data using Gemini 2.5 Flash Lite.
Response includes:
markdown: Cleaned, readable markdown contenthtml: Cleaned HTML (main content only)rawHtml: Original unprocessed HTMLmetadata: Page metadata (title, description, og:*, author, dates, etc.)links: Internal and external links found on the pagejson: LLM-extracted structured data based on provided schemastats: Processing time statisticspages: Array of crawled pages (if maxDepth > 0)Required:
schema: JSON Schema defining the structure for extractionOptions:
prompt: Custom extraction prompt (max 2000 chars)onlyMainContent: Extract only main content (default: true)contentSelector: Custom CSS selector for content extractionmaxDepth: Number of additional pages to crawl (0-10, +2 credits each)depthPrompt: LLM prompt for selecting relevant linksexcludePaths: URL paths to exclude (e.g., [“/login”, “/admin”])includeOnlyPaths: Only crawl matching paths (e.g., [“/blog”])ignoreSitemap: Skip sitemap URL discovery (default: false)Browser Rendering:
Use waitFor (selector or ms) or actions to force browser rendering.
Credits: 5 + (maxDepth * 2) credits per request.
API key with 'ApiKey ' prefix (e.g., ApiKey ck_...)