Skip to main content

input_file_block_start

Returns the start offset of the block being read, or -1 if not available.

Syntax

Python
from pyspark.sql import functions as sf

sf.input_file_block_start()

Examples

Example 1: Get input file block start offset

Python
from pyspark.sql import functions as sf
df = spark.read.text("python/test_support/sql/ages_newlines.csv", lineSep=",")
df.select(sf.input_file_block_start()).show()
Output
+------------------------+
|input_file_block_start()|
+------------------------+
| 0|
| 0|
| 0|
| 0|
| 0|
| 0|
| 0|
| 0|
+------------------------+