Submit a document for OCR processing to extract text, detect tables, forms, and other features. Supports PDFs, images, and scanned documents. Returns a job ID that can be used to track processing status.
API key starting with sk_case_
URL or S3 path to the document to process
"https://example.com/contract.pdf"
Optional custom document identifier
"contract-2024-001"
URL to receive completion webhook
"https://your-app.com/webhooks/ocr-complete"
OCR engine to use
doctr, paddleocr "doctr"
OCR features to extract
{
"text": true,
"tables": true,
"forms": false
}
S3 bucket to store results
"my-ocr-results"
S3 key prefix for results
"ocr/2024/"
OCR job created successfully
Unique job identifier
Current job status
queued, processing, completed, failed Document identifier
OCR engine used
Number of pages detected
Job creation timestamp
Estimated completion time