BerriAI's LiteLLM is a Python SDK and proxy server that allows users to call over 100 LLM APIs using the OpenAI format. It provides features like translation of inputs, consistent output, retry/fallback logic, and budget/rate limits per project. It supports synchronous, asynchronous, and streaming responses. The proxy server enables load balancing and tracking of expenditures across projects. The latest stable release can be installed via Docker, and the system supports various logging and observability tools.
Table of contents
🚅 LiteLLMUsage ( Docs )LiteLLM Proxy Server (LLM Gateway) - ( Docs )EnterpriseSupport / talk with foundersWhy did we build thisContributorsSort: