unknown

A robust OCR tool using the Llama 3.2-Vision model for high accuracy text recognition in images, preserving the original text formatting. It supports multiple image formats (JPG, JPEG, PNG) and allows customizable recognition prompts. Outputs can be in Markdown format with comprehensive error handling.

bytefer/ollama-ocr: Implementing OCR with a local visual model run by ollama.

Get 1% better daily. Development resource sharing made easy. Post, comment, or upvote to show your contribution and support other developers! 🙌

⚠️ Please do not post any promotional resources.

Dev Squad

Ollama-OCR makes it easy to recognize high-quality text with just a few lines of code!

Features
- 🚀 High accuracy text recognition using Llama 3.2-Vision model
- 📝 Preserves original text formatting and structure
- 🖼️ Supports multiple image formats: JPG, JPEG, PNG
- ⚡️ Customizable recognition prompts and models
- 🔍 Markdown output format option
- 💪 Robust error handling

<p>Ollama-OCR makes it easy to recognize high-quality text with just a few lines of code!

Features
- 🚀 High accuracy text recognition using Llama 3.2-Vision model
- 📝 Preserves original text formatting and structure
- 🖼️ Supports multiple image formats: JPG, JPEG, PNG
- ⚡️ Customizable recognition prompts and models
- 🔍 Markdown output format option
- 💪 Robust error handling</p>

<p>Ollama-OCR also supports customization of the visual models supported by Ollama, such as minicpm-v. Further Reading: <a href="https://dev.to/bytefer/ollama-ocr-for-high-precision-ocr-with-ollama-4o31" target="_blank" rel="noopener nofollow">Ollama-OCR for High-Precision OCR with Ollama</a></p>