randn
Generates a random column with independent and identically distributed (i.i.d.) samples from the standard normal distribution. Supports Spark Connect.
For the corresponding Databricks SQL function, see randn function.
Syntax
Python
from pyspark.databricks.sql import functions as dbf
dbf.randn(seed=<seed>)
Parameters
Parameter | Type | Description |
|---|---|---|
|
| Seed value for the random generator. |
Returns
pyspark.sql.Column: A column of random values.
Examples
Python
from pyspark.databricks.sql import functions as dbf
spark.range(0, 2, 1, 1).select("*", dbf.randn()).show() # doctest: +SKIP
Output
+---+--------------------------+
| id|randn(3968742514375399317)|
+---+--------------------------+
| 0| -0.47968645355788...|
| 1| -0.4950952457305...|
+---+--------------------------+
Python
from pyspark.databricks.sql import functions as dbf
spark.range(0, 2, 1, 1).select("*", dbf.randn(seed=42)).show() # doctest: +SKIP
Output
+---+------------------+
| id| randn(42)|
+---+------------------+
| 0| 2.384479054241...|
| 1|0.1920934041293...|
+---+------------------+