Skip to main content

kll_sketch_get_rank_double

Extracts a rank value from a KLL double sketch given an input quantile value. The quantile can be a single value or an array.

Syntax

Python
from pyspark.sql import functions as sf

sf.kll_sketch_get_rank_double(sketch, quantile)

Parameters

Parameter

Type

Description

sketch

pyspark.sql.Column or str

The KLL double sketch binary representation.

quantile

pyspark.sql.Column or str

The quantile value(s) to lookup.

Returns

pyspark.sql.Column: The rank value(s) (between 0.0 and 1.0).

Examples

Example 1: Get rank from KLL double sketch

Python
from pyspark.sql import functions as sf
df = spark.createDataFrame([1.0,2.0,3.0,4.0,5.0], "DOUBLE")
sketch_df = df.agg(sf.kll_sketch_agg_double("value").alias("sketch"))
sketch_df.select(sf.kll_sketch_get_rank_double("sketch", sf.lit(3.0))).show()
Output
+---------------------------------------+
|kll_sketch_get_rank_double(sketch, 3.0)|
+---------------------------------------+
| 0.6|
+---------------------------------------+