kll_sketch_get_n_float
Returns the number of items collected in the KLL float sketch.
Syntax
Python
from pyspark.sql import functions as sf
sf.kll_sketch_get_n_float(col)
Parameters
Parameter | Type | Description |
|---|---|---|
|
| The KLL float sketch binary representation. |
Returns
pyspark.sql.Column: The count of items in the sketch.
Examples
Example 1: Get count of items in KLL float sketch
Python
from pyspark.sql import functions as sf
df = spark.createDataFrame([1.0,2.0,3.0,4.0,5.0], "FLOAT")
sketch_df = df.agg(sf.kll_sketch_agg_float("value").alias("sketch"))
sketch_df.select(sf.kll_sketch_get_n_float("sketch")).show()
Output
+------------------------------+
|kll_sketch_get_n_float(sketch)|
+------------------------------+
| 5|
+------------------------------+