High-accuracy PDF-to-Markdown OCR API using LLMs with vision capabilities. Features parallel processing, batching, and auto-retry logic for scalable extraction.
table-extraction pymupdf document-extraction azure-openai intelligent-document-processing gpt4-vision rag-pipeline vision-ocr complex-layout-analysis batch-ocr text-digitization
-
Updated
Nov 29, 2025 - Python