Extract text from a document using AI. Supports PDF, DOCX, ODT, and Markdown files. Output formats: “md” (raw standard markdown), “mdd” (Autype extended markdown with document settings and defaults), “json” (full Autype document JSON with sections). For PDF files with “md” format you can optionally specify which pages to process. Cost: 4 credits per page.
API Key (starts with ak_...)
File ID of the document to process (PDF, DOCX, ODT, or Markdown)
"550e8400-e29b-41d4-a716-446655440000"
Page specifications (e.g. "1", "2-5", "3-"). If omitted, all pages are processed. Only applicable to PDF files with "md" output format.
["1", "3-5"]
Output format. "md" returns raw standard markdown (Mistral OCR). "mdd" returns Autype extended markdown with document settings and defaults. "json" returns full Autype document JSON with sections. Note: "mdd" and "json" formats are only supported for PDF files — DOCX, ODT, and Markdown files always return "md" regardless of this setting.
md, mdd, json "md"
Optional webhook configuration
Lens OCR job created
Job ID
"550e8400-e29b-41d4-a716-446655440000"
Action that was performed
"pdf.merge"
Current job status
PENDING, PROCESSING, COMPLETED, FAILED "PENDING"
Input file IDs
["file-id-1", "file-id-2"]
Output file ID (available when COMPLETED)
null
Error message (available when FAILED)
null
Job creation timestamp
Job start timestamp
Job completion timestamp
Structured job result data (e.g. OCR markdown/JSON, generated filename, PDF metadata, form fields). Available when the job produces a direct result instead of an output file.
Deprecated — use result instead. Additional metadata, duplicated from result for backward compatibility.