cardinality
Collection function: Returns the length of the array or map stored in the column.
For the corresponding Databricks SQL function, see cardinality function.
Syntax
Python
from pyspark.databricks.sql import functions as dbf
dbf.cardinality(col=<col>)
Parameters
Parameter | Type | Description |
|---|---|---|
|
| Target column to compute on. |
Returns
pyspark.sql.Column: length of the array/map.
Examples
Python
from pyspark.databricks.sql import functions as dbf
df = spark.createDataFrame([([1, 2, 3],),([1],),([],)], ['data'])
df.select(dbf.cardinality("data")).show()
Output
+-----------------+
|cardinality(data)|
+-----------------+
| 3|
| 1|
| 0|
+-----------------+