⚙️ API Endpoints

1️⃣ Submit PDF for PageIndex Computation

  • Endpoint (POST): https://api.vectify.ai/pageindex
  • Description: Initiates the conversion of a PDF document into a structured hierarchical tree format. Immediately returns a task identifier (task_id).

Request Body:

ParameterTypeRequiredDescription
filebinaryYesPDF file to be processed

Example Request:

with open('./2023-annual-report.pdf', 'rb') as f:
    response = requests.post(
        "https://api.vectify.ai/pageindex",
        headers={'api_key': 'YOUR_API_KEY_HERE'},
        files={'file': f}
    )

See here for the example PDF document.

Example Response:

{
  "task_id": "abc123def456"
}

2️⃣ Check Status and Retrieve Results

  • Endpoint (POST): https://api.vectify.ai/pageindex/status
  • Description: Checks computation status and retrieves results once processing is complete.

Request Body:

ParameterTypeRequiredDescription
task_idstringYesTask ID from submit response

Computation Status:

The status returned from the endpoint indicates the progress of PDF processing tasks:

StatusDescription
queuedTask is queued and waiting to begin processing
processingTask is currently being processed
completedTask processing is complete; results are ready
failedTask processing encountered an error

Example Request:

response = requests.post(
    "https://api.vectify.ai/pageindex/status",
    headers={'api_key': 'YOUR_API_KEY_HERE'},
    json={"task_id": "abc123def456"}
)

Example Response (Processing):

{
  "task_id": "abc123def456",
  "status": "processing"
}

Example Response (Completed):

{
  "task_id": "abc123def456",
  "status": "completed",
  "result": [
    ...
    {
        "title": "Financial Stability",
        "node_id": "0006",
        "start_index": 21,
        "end_index": 22,
        "summary": "The Federal Reserve maintains financial stability by...",
        "child_nodes": [
            {
                "title": "Monitoring Financial Vulnerabilities",
                "node_id": "0007",
                "start_index": 22,
                "end_index": 28,
                "summary": "The Federal Reserve's monitoring focuses on..."
            },
            {
                "title": "Domestic and International Cooperation and Coordination",
                "node_id": "0008",
                "start_index": 28,
                "end_index": 31,
                "summary": "In 2023, the Federal Reserve collaborated internationally..."
            }
        ]
    }
    ...
  ]
}

See here for a complete example output structure generated by PageIndex from the above example PDF document.


⚠️ API Response Codes

Status CodeDescriptionResolution
200Request successful-
400Bad request due to missing/invalid parametersCheck request parameters
401Unauthorized; invalid or missing API keyEnsure API key is correct
404Task or PDF file not foundVerify task_id and PDF path
413File size too largeUse smaller file or contact support
500Internal server errorRetry later; if persistent, contact support