Skip to main content

crossJoin

Returns the cartesian product with another DataFrame.

Syntax

crossJoin(other: "DataFrame")

Parameters

Parameter

Type

Description

other

DataFrame

Right side of the cartesian product.

Returns

DataFrame: Joined DataFrame.

Examples

Python
from pyspark.sql import Row
df = spark.createDataFrame(
[(14, "Tom"), (23, "Alice"), (16, "Bob")], ["age", "name"])
df2 = spark.createDataFrame(
[Row(height=80, name="Tom"), Row(height=85, name="Bob")])
df.crossJoin(df2.select("height")).select("age", "name", "height"
).orderBy("age", "name", "height").show()
# +---+-----+------+
# |age| name|height|
# +---+-----+------+
# | 14| Tom| 80|
# | 14| Tom| 85|
# | 16| Bob| 80|
# | 16| Bob| 85|
# | 23|Alice| 80|
# | 23|Alice| 85|
# +---+-----+------+