avg
Returns the average of the values in a group.
Syntax
Python
from pyspark.sql import functions as sf
sf.avg(col)
Parameters
Parameter | Type | Description |
|---|---|---|
|
| Target column to compute on. |
Returns
pyspark.sql.Column: the column for computed results.
Examples
Example 1: Calculating the average age
Python
import pyspark.sql.functions as sf
df = spark.createDataFrame([(1982, 15), (1990, 2)], ["birth", "age"])
df.select(sf.avg("age")).show()
Output
+--------+
|avg(age)|
+--------+
| 8.5|
+--------+
Example 2: Calculating the average age with None
Python
import pyspark.sql.functions as sf
df = spark.createDataFrame([(1982, None), (1990, 2), (2000, 4)], ["birth", "age"])
df.select(sf.avg("age")).show()
Output
+--------+
|avg(age)|
+--------+
| 3.0|
+--------+