Get Scrape Job Status
Scrape Endpoints
Get Scrape Job Status
Poll for job progress and results
GET
Get Scrape Job Status
Documentation Index
Fetch the complete documentation index at: https://docs.spidra.io/llms.txt
Use this file to discover all available pages before exploring further.
Polling Pattern
Scrape jobs are processed asynchronously. When you submit a job you get ajobId back immediately. You then poll this endpoint every 2-5 seconds until status is completed or failed.
Status Values
| Status | Meaning |
|---|---|
waiting | In queue, not started yet |
active | Running right now |
completed | Done, results are ready |
failed | Something went wrong, check error |
Response Structure
Whenstatus is completed, everything you need is inside result.
result.content
This is the main output field. What it contains depends on whether you provided aprompt:
- With
prompt: the AI-extracted result, formatted according tooutput("markdown"or"json") - Without
prompt: the raw scraped page content as markdown
content still returns the raw markdown as a fallback, and ai_extraction_failed is set to true so you can detect this.
result.data
An array with one entry per URL you submitted. Each entry contains:| Field | Description |
|---|---|
url | The URL that was scraped |
title | The page title from the browser |
markdownContent | The full raw scraped content for this URL as markdown. If you used forEach, this contains all the collected items formatted as ## Item 1, ## Item 2, etc. |
success | true if the page was scraped successfully, false if it failed |
screenshotUrl | URL to the screenshot on S3, or null if you did not request one |
result.stats
Timing and usage information for the job.| Field | Description |
|---|---|
durationMs | How long the whole job took in milliseconds |
captchaSolvedCount | Number of CAPTCHAs that were automatically solved |
inputTokens | Tokens sent to the AI model |
outputTokens | Tokens returned from the AI model |
totalTokens | Total tokens used (input + output) |
Failed Jobs
Whenstatus is failed, the error field contains the reason:
Authorizations
Path Parameters
The job ID returned from POST /scrape

