LLM Sherpa provides open-sourced APIs to accelerate large language model projects. It supports various file formats, OCR integration, and now includes a LayoutPDFReader that maintains hierarchical structure during PDF parsing to address common issues. Users are encouraged to self-host using the provided Docker image.

Sort: