A robust OCR tool using the Llama 3.2-Vision model for high accuracy text recognition in images, preserving the original text formatting. It supports multiple image formats (JPG, JPEG, PNG) and allows customizable recognition prompts. Outputs can be in Markdown format with comprehensive error handling.

1 Comment

Sort: