Send a URL and CSS selectors. Get clean, structured data back. No scraping, no parsing, no HTML to wade through.
-H "X-API-Key: sk-..." -H "Content-Type: application/json" -d '{ "url": "https://news.ycombinator.com", "fields": [ {"name": "title", "selector": ".titleline a"}, {"name": "score", "selector": ".score", "multiple": true} ] }' { "success": true, "data": { "title": "Show HN: StructAPI", "score": ["127", "89", "342", ...] } }
Define exactly what fields you want using familiar CSS selectors. Extract text, attributes, or HTML — with single values or arrays.
No selectors? No problem. POST /auto and get title, headings, links, images, and paragraphs auto-extracted.
Every other scraping API returns raw HTML. We return clean JSON. Skip the parsing entirely.
Diffbot starts at $299/mo. We start at free. Same idea — structured extraction — without the enterprise price tag.
Free tier gives you 100 requests/month. No credit card. Just curl and go.
curl, Python, JavaScript — all in the docs. Copy-paste examples that just work.