Perform document ocr
POST/documents/:documentId/ocr
Document optical character recognition (OCR) request; extract text and data from a document;
Tesseract available for all editions, but Textract engine and tables and forms options available as an Add-On Module
Request
Path Parameters
Document Identifier
Query Parameters
Site Identifier
- application/json
Body
Array [
]
textractQueries
object[]
Natural-language question
Label to identify this query in the result
Page selection
OCR Parse types - TEXT, FORMS, TABLES, QUERIES
Rewrite PDF document, converting any Image text to searchable text
Type of OCR Engine to use
Possible values: [TESSERACT, TEXTRACT]
Number of pages to OCR (from start) (-1 all)
OCR Engine output format (textract table only)
Possible values: [CSV]
Responses
- 200
200 OK
Response Headers
Access-Control-Allow-Origin
string
Access-Control-Allow-Methods
string
Access-Control-Allow-Headers
string
- application/json
- Schema
- Example (from schema)
Schema
OCR processing message
{
"message": "string"
}