MLflow AI Gateway now supports configurable guardrails that use LLM judges to block or sanitize harmful content, PII, and custom policy violations before they reach your users or your models.

mlflow

MLflow AI Gateway now supports configurable guardrails that enforce content policies at the gateway layer, before requests reach LLMs or responses reach users. Three built-in types are available: Safety (toxicity filtering), PII Detection, and Custom. Guardrails can be set to either block requests with an HTTP 400 error or sanitize (redact) offending content. They run in order per endpoint, support before/after pipeline stages, and use a separate LLM judge model for evaluation. Configuration is done through the MLflow UI with no application code changes required.

Enforce Content Policies at the Gateway with AI Gateway Guardrails