LangExtract is a Python library by Google that uses Large Language Models to extract structured information from unstructured text documents. It provides precise source grounding by mapping extractions to exact locations in source text, supports various LLMs including Gemini and local models via Ollama, and generates

9m read timeFrom github.com
Post cover image
Table of contents
Table of ContentsIntroductionWhy LangExtract?Quick StartInstallationAPI Key Setup for Cloud ModelsMore ExamplesContributingTestingDevelopmentTroubleshootingDisclaimer

Sort: