Download Crawl Results

Downloads a ZIP archive of all successfully crawled pages from a completed job. Each page is saved as one or more files inside the archive, organized by hostname and path. Use the include parameter to control which content types are bundled in the ZIP.

Content Types

Value	What is included
`html`	Raw HTML file for each page
`markdown`	Markdown version of each page
`data`	AI-extracted data in JSON, CSV, or Markdown format (format is auto-detected from your `transformInstruction`)

If you omit the include parameter, all three types are included by default.

Example Requests

# Download everything (HTML, markdown, and extracted data)
curl -OJ "https://api.spidra.io/api/crawl/abc-123/download" \
  -H "x-api-key: YOUR_API_KEY"

# Download only the extracted data
curl -OJ "https://api.spidra.io/api/crawl/abc-123/download?include=data" \
  -H "x-api-key: YOUR_API_KEY"

# Download markdown and data only
curl -OJ "https://api.spidra.io/api/crawl/abc-123/download?include=markdown,data" \
  -H "x-api-key: YOUR_API_KEY"

ZIP Archive Structure

When a single content type is requested, files are placed at the root of the archive with appropriate extensions:

crawl-abc-123.zip
  example.com_blog_post-one.json
  example.com_blog_post-two.json

When multiple content types are requested, each page gets its own folder:

crawl-abc-123.zip
  example.com_blog_post-one/
    data.json
    index.html
    markdown.md
  example.com_blog_post-two/
    data.json
    index.html
    markdown.md

Response

The response is a binary ZIP file with the following headers:

Header	Value
`Content-Type`	`application/zip`
`Content-Disposition`	`attachment; filename=crawl-{jobId}.zip`

Only pages with status: "success" are included in the download. If no successful pages exist, the API returns a 404 error.

Authorizations

x-api-key

string

header

required

Path Parameters

jobId

string

required

The ID of the completed crawl job to download

Query Parameters

include

string

Comma-separated list of content types to include in the ZIP. Accepted values: html, markdown, data. Defaults to all three. Example: include=data,markdown

Response

ZIP archive containing the crawl results

The response is of type file.

Using the API

Scrape Endpoints

Logs

Crawl Endpoints

Account

Download Crawl Results

Content Types

Example Requests

ZIP Archive Structure

Response

Authorizations

Path Parameters

Query Parameters

Response

Using the API

Scrape Endpoints

Logs

Crawl Endpoints

Account

Documentation Index

​Content Types

​Example Requests

​ZIP Archive Structure

​Response

Authorizations

Path Parameters

Query Parameters

Response

Content Types

Example Requests

ZIP Archive Structure

Response