Batch inference using a BERT model for named entity recognition

This notebook demonstrates how to do the following tasks:

Build a pyfunc model encapsulating a BERT language model for named entity recognition (NER).
Deploy the pyfunc model to a Mosaic AI Model Serving endpoint.
Perform batch inference using ai_query(AWS | Azure) on the Mosaic AI Model Serving endpoint

To test the model before deploying, run this notebook on a cluster with a GPU.

3

4

7

10

12

13

Table

16

18

20

23

25

QueryEndpointResponse(choices=[], created=None, data=[], id=None, model=None, object=None, predictions=[{'ner': [{'entity': 'B-PER', 'score': 0.9939519762992859, 'index': 4, 'word': 'wolfgang', 'start': 11, 'end': 19}, {'entity': 'B-LOC', 'score': 0.9978950023651123, 'index': 9, 'word': 'berlin', 'start': 34, 'end': 40}]}, {'ner': [{'entity': 'B-PER', 'score': 0.995381772518158, 'index': 4, 'word': 'colton', 'start': 11, 'end': 17}, {'entity': 'B-ORG', 'score': 0.9861130118370056, 'index': 9, 'word': 'data', 'start': 29, 'end': 33}, {'entity': 'I-ORG', 'score': 0.9936231970787048, 'index': 10, 'word': '##brick', 'start': 33, 'end': 38}, {'entity': 'I-ORG', 'score': 0.9867315888404846, 'index': 11, 'word': '##s', 'start': 38, 'end': 39}]}], served_model_name='ner', usage=None)

28

df:pyspark.sql.dataframe.DataFrame = [text: string, outputs: struct]

30

bert-batch-inference-ai-query(Python)

Batch inference using a BERT model for named entity recognition

Download and import libraries

Set registry for the model

Define PyFunc to load and create pipeline

Test the BERT model

Register the BERT model

Deploy model to Mosaic AI Model Serving endpoint

Batch inference using `ai_query`

bert-batch-inference-ai-query(Python)

Batch inference using a BERT model for named entity recognition

Download and import libraries

Set registry for the model

Define PyFunc to load and create pipeline

Test the BERT model

Register the BERT model

Deploy model to Mosaic AI Model Serving endpoint

Batch inference using ai_query

Batch inference using `ai_query`