Skip to main content

Configure AI Gateway on model serving endpoints

note

This feature is in Preview.

In this article, you learn how to configure Mosaic AI Gateway on a model serving endpoint.

Requirements

  • A Databricks workspace in one of the following regions:
    • An external models supported region
    • A Foundation Model APIs supported region for provisioned throughput or pay-per-token.
  • A model serving endpoint. You can use one of the preconfigured pay-per-token endpoints on your workspace.

Configure AI Gateway using the UI

In the AI Gateway section of the endpoint creation page, you can individually configure AI Gateway features.

The following table summarizes how to configure AI Gateway during endpoint creation using the Serving UI.

Configure AI Guardrails in the UI

The following table shows how to configure supported guardrails.

Guardrail

How to enable

Details

Safety

Select Safety to enable safeguards to prevent your model from interacting with unsafe and harmful content.

Personally identifiable information (PII) detection

Select PII detection to detect PII data such as names, addresses, credit card numbers.

Valid topics

You can type topics directly into this field. If you have multiple entries, be sure to press enter after each topic. Alternatively, you can upload a .csv or .txt file.

A maximum of 50 valid topics can be specified. Each topic cannot exceed 100 characters

Invalid keywords

You can type topics directly into this field. If you have multiple entries, be sure to press enter after each topic. Alternatively, you can upload a .csv or .txt file.

A maximum of 50 invalid keywords can be specified. Each keyword cannot exceed 100 characters.

Was this article helpful?