Lakehouse monitoring example notebook: InferenceLog classification analysis

User requirements

You must have access to run commands on a cluster with access to Unity Catalog.
You must have USE CATALOG privilege on at least one catalog, and you must have USE SCHEMA privileges on at least one schema. This notebook creates tables in the main.default schema. If you do not have the required privileges on the main.default schema, you must edit the notebook to change the default catalog and schema to ones that you do have privileges on.

System requirements:

Your workspace must be enabled for Unity Catalog.
Databricks Runtime 12.2 LTS ML or above.
A Single user or Assigned cluster.

This notebook illustrates how to train and deploy a classification model and monitor its corresponding batch inference table.

For more information about Lakehouse monitoring, see the documentation (AWS | Azure).

3

4:

Install Lakehouse Monitoring client wheel

5

6:

Specify catalog and schema to use

7

8

9

11

Background

The following are required to create an inference log monitor:

A Delta table in Unity Catalog that you own.
The data can be batch scored data or inference logs. The following columns are required:
- timestamp (TimeStamp): Used for windowing and aggregation when calculating metrics
- model_id (String): Model version/id used for each prediction.
- prediction (String): Value predicted by the model.
The following column is optional:
- label (String): Ground truth label.

You can also provide an optional baseline table to track performance changes in the model and drifts in the statistical characteristics of features.

To track performance changes in the model, consider using the test or validation set.
To track drifts in feature distributions, consider using the training set or the associated feature tables.
The baseline table must use the same column names as the monitored table, and must also have a model_version column.

Databricks recommends enabling Delta's Change-Data-Feed (AWS|Azure) table property for better metric computation performance for all monitored tables, including the baseline table. This notebook shows how to enable Change Data Feed when you create the Delta table.

User Journey

Create Delta table: Read raw input and features data and create training and inference sets.
Train a model, register the model the MLflow Model Registry.
Generate predictions on test set and create the baseline table.
Generate predictions on scoring_df1. This is the inference table.
Create the monitor on the inference table and analyse profile/drift metrics and fairness and bias metrics.
Simulate drifts in 3 relevant features, scoring_df2 and generate/materialize predictions.
Add/Join ground-truth labels to monitoring table and refresh monitor.
[Optional] Calculate custom metrics.
[Optional] Delete the monitor.

15

16

18

20

21

22

23

25

26

27:

Write table with CDF enabled

29

30

32

34

35

36

37:

Create Monitor

38

39

41

Orientation to the profile metrics table

The profile metrics table has the suffix _profile_metrics. For a list of statistics that are shown in the table, see the documentation (AWS|Azure).

For every column in the primary table, the profile table shows summary statistics for the baseline table and for the primary table. The column log_type shows INPUT to indicate statistics for the primary table, and BASELINE to indicate statistics for the baseline table. The column from the primary table is identified in the column column_name.
For TimeSeries type analysis, the granularity column shows the granularity corresponding to the row. For baseline table statistics, the granularity column shows null.
The table shows statistics for each value of each slice key in each time window, and for the table as whole. Statistics for the table as a whole are indicated by slice_key = slice_value = null.
In the primary table, the window column shows the time window corresponding to that row. For baseline table statistics, the window column shows null.
Some statistics are calculated based on the table as a whole, not on a single column. In the column column_name, these statistics are identified by :table.

44

Orientation to the drift metrics table

The drift metrics table has the suffix _drift_metrics. For a list of statistics that are shown in the table, see the documentation (AWS|Azure).

For every column in the primary table, the drift table shows a set of metrics that compare the current values in the table to the values at the time of the previous analysis run and to the baseline table. The column drift_type shows BASELINE to indicate drift relative to the baseline table, and CONSECUTIVE to indicate drift relative to a previous time window. As in the profile table, the column from the primary table is identified in the column column_name.
- At this point, because this is the first run of this monitor, there is no previous window to compare to. So there are no rows where drift_type is CONSECUTIVE.
For TimeSeries type analysis, the granularity column shows the granularity corresponding to that row.
The table shows statistics for each value of each slice key in each time window, and for the table as whole. Statistics for the table as a whole are indicated by slice_key = slice_value = null.
The window column shows the the time window corresponding to that row. The window_cmp column shows the comparison window. If the comparison is to the baseline table, window_cmp is null.
Some statistics are calculated based on the table as a whole, not on a single column. In the column column_name, these statistics are identified by :table.

46

48

50

51

53

55:

Using MERGE INTO (Recommended)

57

58:

Update monitor

60

62

classification-monitor(Python)

Lakehouse monitoring example notebook: InferenceLog classification analysis

Setup

Helper methods

Background

User Journey

1. Read dataset and prepare data

1.1 Split data

2. Train a random forest model

3. Create baseline table

4. Generate predictions on incoming scoring data

Example pre-processing step

4.1 Write scoring data with predictions out

5. Create the monitor

5.1 Inspect the metrics tables

Orientation to the profile metrics table

Orientation to the drift metrics table

5.2 Look at fairness and bias metrics

6. Create data drifts(s) in 3 features

6.1 Generate predictions on drifted observations and update inference tables

7. (Ad-hoc) Join/Update ground-truth labels to inference table

8. [Optional] Refresh metrics by also adding custom metrics

Refresh metrics and inspect dashboard

9. [Optional] Delete the monitor