Graph Analysis Tutorial with GraphFrames

This tutorial notebook shows you how to use GraphFrames to perform graph analysis. To run the notebook:

  1. Create the GraphFrames library from the Spark-Packages repository. You must ensure you are using the right version of the library for the version of Databricks Runtime your cluster is running.

  2. Install the library into your cluster.

  3. Download the data from Kaggle and unzip it. You must sign into Kaggle using third-party authentication or create and sign into a Kaggle account.

  4. Upload station.csv and trip.csv using the Create table UI.

    The tables are named station_csv and trip_csv.