Get Crawled Pages

curl --request GET \
  --url https://api.spidra.io/api/crawl/{jobId}/pages \
  --header 'x-api-key: <api-key>'

{
  "pages": [
    {
      "id": "page-1",
      "url": "https://example.com/blog/post-1",
      "title": "First Post",
      "status": "success",
      "data": {
        "title": "First Post",
        "author": "John",
        "date": "2025-01-01"
      },
      "error_message": null,
      "created_at": "2025-12-17T15:00:00Z"
    }
  ]
}

GET

crawl

{jobId}

pages

Get Crawled Pages

curl --request GET \
  --url https://api.spidra.io/api/crawl/{jobId}/pages \
  --header 'x-api-key: <api-key>'

{
  "pages": [
    {
      "id": "page-1",
      "url": "https://example.com/blog/post-1",
      "title": "First Post",
      "status": "success",
      "data": {
        "title": "First Post",
        "author": "John",
        "date": "2025-01-01"
      },
      "error_message": null,
      "created_at": "2025-12-17T15:00:00Z"
    }
  ]
}

Each page object contains the data field with the extracted content.

Authorizations

x-api-key

string

header

required

Path Parameters

jobId

string

required

Response

List of crawled pages with extracted data

pages

object[]

Show child attributes

Get Crawl Job Status Get Crawl Job Details

Using the API

Scrape Endpoints

Crawl Endpoints

Account Endpoints

Get Crawled Pages

Authorizations

Path Parameters

Response