January 2025

These features and Databricks platform improvements were released in January 2025.

Note

Releases are staged. Your Databricks account might not be updated until a week or more after the initial release date.

AI Gateway now supports provisioned throughput (Public Preview)

January 10, 2025

Mosaic AI Gateway now supports Foundation Model APIs provisioned throughput workloads on model serving endpoints.

You can now enable the following governance and monitoring features on your model serving endpoints that use provisioned throughput:

  • Permission and rate limiting to control who has access and how much access.

  • Payload logging to monitor and audit data being sent to model APIs using inference tables.

  • Usage tracking to monitor operational usage on endpoints and associated costs using system tables.

  • AI Guardrails to prevent unwanted data and unsafe data in requests and responses.

  • Traffic routing to minimize production outages during and after deployment.