date_diff
Returns the number of days from start to end.
For the corresponding Databricks SQL function, see date_diff function.
Syntax
Python
from pyspark.databricks.sql import functions as dbf
dbf.date_diff(end=<end>, start=<start>)
Parameters
Parameter | Type | Description |
|---|---|---|
|
| to date column to work on. |
|
| from date column to work on. |
Returns
pyspark.sql.Column: difference in days between two dates.
Examples
Python
from pyspark.databricks.sql import functions as dbf
df = spark.createDataFrame([('2015-04-08','2015-05-10')], ['d1', 'd2'])
df.select('*', dbf.date_diff('d1', 'd2')).show()
df.select('*', dbf.date_diff(df.d2, df.d1)).show()