Datalab's Marker and OCR models are now available on Replicate for document parsing and text extraction. Marker converts PDFs, DOCX, PPTX, and images into markdown or JSON, handling tables, math, code, and structured data extraction via JSON schemas. OCR detects text in 90 languages and returns reading order and table grids.

3m read time From replicate.com
Post cover image
Table of contents
Run MarkerRun OCRStructured extractionPerformancePricing

Sort: