Skip to main content

cov (DataFrame)

Calculate the sample covariance for the given columns, specified by their names, as a double value. DataFrame.cov and DataFrameStatFunctions.cov are aliases.

Syntax

cov(col1: str, col2: str)

Parameters

Parameter

Type

Description

col1

str

The name of the first column.

col2

str

The name of the second column.

Returns

float: Covariance of two columns.

Examples

Python
df = spark.createDataFrame([(1, 12), (10, 1), (19, 8)], ["c1", "c2"])
df.cov("c1", "c2")
# -18.0
df = spark.createDataFrame([(11, 12), (10, 11), (9, 10)], ["small", "bigger"])
df.cov("small", "bigger")
# 1.0