Basic example for Feature Engineering in Unity Catalog

This notebook illustrates how you can use Databricks Feature Engineering in Unity Catalog to create, store, and manage Unity Catalog Features to train ML models and make batch predictions, including with features whose value is only available at the time of prediction. In this example, the goal is to predict the wine quality using a ML model with a variety of static wine features and a realtime input.

This notebook shows how to:

Create a feature table and use it to build a training dataset for a machine learning model.
Modify the feature table and use the updated table to create a new version of the model.
Use the Databricks Features UI to determine how features relate to models.
Perform batch scoring using automatic feature lookup.

Requirements

Databricks Runtime 13.2 for Machine Learning or above.
- If you do not have access to Databricks Runtime for Machine Learning, you can run this notebook on Databricks Runtime 13.2 or above. To do so, run %pip install databricks-feature-engineering at the start of this notebook.

You can also use create_table without providing a dataframe, and then later populate the feature table using fe.write_table.

Example:

fe.create_table(
    name=table_name,
    primary_keys=["wine_id"],
    schema=features_df.schema,
    description="wine features"
)

fe.write_table(
    name=table_name,
    df=features_df,
    mode="merge"
)

To view the logged model, navigate to the MLflow Experiments page for this notebook. To access the Experiments page, click the Experiments icon on the left navigation bar:

Find the notebook experiment in the list. It has the same name as the notebook, in this case, "Basic example for Feature Engineering in Unity Catalog".

Click the experiment name to display the experiment page. The packaged Feature Engineering in UC model, created when you called fe.log_model appears in the Artifacts section of this page. You can use this model for batch scoring.

The model is also automatically registered in the Unity Catalog.

feature-store-with-uc-basic-example(Python)

Basic example for Feature Engineering in Unity Catalog

Requirements

Load dataset

Create a new catalog or reuse an existing catalog

Create a new schema in the catalog

Create the feature table

Train a model with Feature Engineering in Unity Catalog

Batch scoring

Modify feature table

Train a new model version using the updated feature table

Control permissions for and delete feature tables