%md
# Apache Spark Built-in and Higher-Order Functions Examples

Apache Spark Built-in and Higher-Order Functions Examples

Last refresh: Never

%md
## For array types

For array types

Last refresh: Never

%md
#### array_distinct(array&lt;T&gt;): array&lt;T&gt;
Removes duplicate values from the given array.

array_distinct(array<T>): array<T>

Removes duplicate values from the given array.

Last refresh: Never

SELECT array_distinct(array(1, 2, 3, null, 3));

array_distinct(array(1, 2, 3, CAST(NULL AS INT), 3))

[1, 2, 3, null]

Showing all 1 rows.

Last refresh: Never

Command took 1.52 seconds

%md
#### array_intersect(array&lt;T&gt;, array&lt;T&gt;): array&lt;T&gt;
Returns an array of the elements in the intersection of the given two arrays, without duplicates.

array_intersect(array<T>, array<T>): array<T>

Returns an array of the elements in the intersection of the given two arrays, without duplicates.

Last refresh: Never

SELECT array_intersect(array(1, 2, 3), array(1, 3, 5));

array_intersect(array(1, 2, 3), array(1, 3, 5))

[1, 3]

Showing all 1 rows.

Last refresh: Never

Command took 1.37 seconds

%md
#### array_union(array&lt;T&gt;, array&lt;T&gt;): array&lt;T&gt;
Returns an array of the elements in the union of the given two arrays, without duplicates.

array_union(array<T>, array<T>): array<T>

Returns an array of the elements in the union of the given two arrays, without duplicates.

Last refresh: Never

SELECT array_union(array(1, 2, 3), array(1, 3, 5));

array_union(array(1, 2, 3), array(1, 3, 5))

[1, 2, 3, 5]

Showing all 1 rows.

Last refresh: Never

Command took 0.06 seconds

%md
#### array_except(array&lt;T&gt;, array&lt;T&gt;): array&lt;T&gt;
Returns an array of the elements in array1 but not in array2, without duplicates.

array_except(array<T>, array<T>): array<T>

Returns an array of the elements in array1 but not in array2, without duplicates.

Last refresh: Never

SELECT array_except(array(1, 2, 3), array(1, 3, 5));

array_except(array(1, 2, 3), array(1, 3, 5))

[2]

Showing all 1 rows.

Last refresh: Never

Command took 1.39 seconds

%md
#### array_join(array&lt;String&gt;, String[, String]): String
Concatenates the elements of the given array using the delimiter and an optional string to replace nulls. If no value is set for null replacement, any null value is filtered.

array_join(array<String>, String[, String]): String

Concatenates the elements of the given array using the delimiter and an optional string to replace nulls. If no value is set for null replacement, any null value is filtered.

Last refresh: Never

SELECT array_join(array('hello', 'world'), ' ');

array_join(array(hello, world), )

hello world

Showing all 1 rows.

Last refresh: Never

Command took 0.08 seconds

SELECT array_join(array('hello', null ,'world'), ' ');

array_join(array(hello, CAST(NULL AS STRING), world), )

hello world

Showing all 1 rows.

Last refresh: Never

Command took 0.08 seconds

SELECT array_join(array('hello', null ,'world'), ' ', ',');

array_join(array(hello, CAST(NULL AS STRING), world), , ,)

hello , world

Showing all 1 rows.

Last refresh: Never

Command took 0.05 seconds

%md
#### array_max(array&lt;T&gt;): T
Returns the maximum value in the given array. null elements are skipped.

array_max(array<T>): T

Returns the maximum value in the given array. null elements are skipped.

Last refresh: Never

SELECT array_max(array(1, 20, null, 3));

array_max(array(1, 20, CAST(NULL AS INT), 3))

Showing all 1 rows.

Last refresh: Never

Command took 0.07 seconds

%md
#### array_min(array&lt;T&gt;): T
Returns the minimum value in the given array. null elements are skipped.

array_min(array<T>): T

Returns the minimum value in the given array. null elements are skipped.

Last refresh: Never

SELECT array_min(array(1, 20, null, 3));

array_min(array(1, 20, CAST(NULL AS INT), 3))

Showing all 1 rows.

Last refresh: Never

Command took 0.10 seconds

%md
#### array_position(array&lt;T&gt;, T): Long
Returns the (1-based) index of the first element of the given array as long.

array_position(array<T>, T): Long

Returns the (1-based) index of the first element of the given array as long.

Last refresh: Never

SELECT array_position(array(3, 2, 1), 1);

array_position(array(3, 2, 1), 1)

Showing all 1 rows.

Last refresh: Never

Command took 1.38 seconds

%md
#### array_remove(array&lt;T&gt;, T): array&lt;T&gt;
Remove all elements that equal to the given element from the given array.

array_remove(array<T>, T): array<T>

Remove all elements that equal to the given element from the given array.

Last refresh: Never

SELECT array_remove(array(1, 2, 3, null, 3), 3);

array_remove(array(1, 2, 3, CAST(NULL AS INT), 3), 3)

[1, 2, null]

Showing all 1 rows.

Last refresh: Never

Command took 0.08 seconds

%md
#### arrays_overlap(array&lt;T&gt;, array&lt;T&gt;): array&lt;T&gt;
Returns true if array1 contains at least a non-null element present also in array2. If the arrays have no common element and they are both non-empty and either of them contains a null element null is returned, false otherwise.

arrays_overlap(array<T>, array<T>): array<T>

Returns true if array1 contains at least a non-null element present also in array2. If the arrays have no common element and they are both non-empty and either of them contains a null element null is returned, false otherwise.

Last refresh: Never

SELECT arrays_overlap(array(1, 2, 3), array(3, 4, 5));

arrays_overlap(array(1, 2, 3), array(3, 4, 5))

true

Showing all 1 rows.

Last refresh: Never

Command took 0.10 seconds

%md
#### array_sort(array&lt;T&gt;): array&lt;T&gt;
Sorts the input array in ascending order. The elements of the input array must be orderable. Null elements will be placed at the end of the returned array.

array_sort(array<T>): array<T>

Sorts the input array in ascending order. The elements of the input array must be orderable. Null elements will be placed at the end of the returned array.

Last refresh: Never

SELECT array_sort(array('b', 'd', null, 'c', 'a'));

array_sort(array(b, d, CAST(NULL AS STRING), c, a), lambdafunction((IF(((namedlambdavariable() IS NULL) AND (namedlambdavariable() IS NULL)), 0, (IF((namedlambdavariable() IS NULL), 1, (IF((namedlambdavariable() IS NULL), -1, (IF((namedlambdavariable() < namedlambdavariable()), -1, (IF((namedlambdavariable() > namedlambdavariable()), 1, 0)))))))))), namedlambdavariable(), namedlambdavariable()))

Showing all 1 rows.

Last refresh: Never

Command took 0.46 seconds

%md
#### concat(String, ...): String / concat(array&lt;T&gt;, ...): array&lt;T&gt;
Returns the concatenation of col1, col2, ..., colN.

This function works with strings, binary and compatible array columns.

concat(String, ...): String / concat(array<T>, ...): array<T>

Returns the concatenation of col1, col2, ..., colN.

This function works with strings, binary and compatible array columns.

Last refresh: Never

SELECT concat('Spark', 'SQL');

concat(Spark, SQL)

SparkSQL

Showing all 1 rows.

Last refresh: Never

Command took 0.04 seconds

SELECT concat(array(1, 2, 3), array(4, 5), array(6));

concat(array(1, 2, 3), array(4, 5), array(6))

[1, 2, 3, 4, 5, 6]

Showing all 1 rows.

Last refresh: Never

Command took 0.04 seconds

%md
#### flatten(array&lt;array&lt;T&gt;&gt;): array&lt;T&gt;
Transforms an array of arrays into a single array.

flatten(array<array<T>>): array<T>

Transforms an array of arrays into a single array.

Last refresh: Never

SELECT flatten(array(array(1, 2), array(3, 4)));

flatten(array(array(1, 2), array(3, 4)))

[1, 2, 3, 4]

Showing all 1 rows.

Last refresh: Never

Command took 0.04 seconds

%md
#### array_repeat(T, Int): array&lt;T&gt;
Returns the array containing element count times.

array_repeat(T, Int): array<T>

Returns the array containing element count times.

Last refresh: Never

SELECT array_repeat('123', 2);

array_repeat(123, 2)

["123", "123"]

Showing all 1 rows.

Last refresh: Never

Command took 0.10 seconds

%md
#### reverse(String): String / reverse(array&lt;T&gt;): array&lt;T&gt;
Returns a reversed string or an array with reverse order of elements.

reverse(String): String / reverse(array<T>): array<T>

Returns a reversed string or an array with reverse order of elements.

Last refresh: Never

SELECT reverse('Spark SQL');

reverse(Spark SQL)

LQS krapS

Showing all 1 rows.

Last refresh: Never

Command took 0.06 seconds

SELECT reverse(array(2, 1, 4, 3));

reverse(array(2, 1, 4, 3))

[3, 4, 1, 2]

Showing all 1 rows.

Last refresh: Never

Command took 0.06 seconds

%md
#### sequence(T, T[, T]): array&lt;T&gt;
Generates an array of elements from start to stop (inclusive), incrementing by step. The type of the returned elements is the same as the type of argument expressions.

sequence(T, T[, T]): array<T>

Generates an array of elements from start to stop (inclusive), incrementing by step. The type of the returned elements is the same as the type of argument expressions.

Last refresh: Never

SELECT sequence(1, 5);

sequence(1, 5)

[1, 2, 3, 4, 5]

Showing all 1 rows.

Last refresh: Never

Command took 0.06 seconds

SELECT sequence(5, 1);

sequence(5, 1)

[5, 4, 3, 2, 1]

Showing all 1 rows.

Last refresh: Never

Command took 0.06 seconds

SELECT sequence(to_date('2018-01-01'), to_date('2018-03-01'), interval 1 month);

sequence(to_date('2018-01-01'), to_date('2018-03-01'), INTERVAL '1 months')

["2018-01-01", "2018-02-01", "2018-03-01"]

Showing all 1 rows.

Last refresh: Never

Command took 0.10 seconds

%md
#### shuffle(array&lt;T&gt;): array&lt;T&gt;
Returns a random permutation of the given array.

shuffle(array<T>): array<T>

Returns a random permutation of the given array.

Last refresh: Never

SELECT shuffle(array(1, 20, 3, 5));

shuffle(array(1, 20, 3, 5))

[20, 1, 5, 3]

Showing all 1 rows.

Last refresh: Never

Command took 0.10 seconds

SELECT shuffle(array(1, 20, null, 3));

shuffle(array(1, 20, CAST(NULL AS INT), 3))

[null, 3, 20, 1]

Showing all 1 rows.

Last refresh: Never

Command took 0.11 seconds

%md
#### slice(array&lt;T&gt;, Int, Int): array&lt;T&gt;
Subsets the given array starting from index start (or starting from the end if start is negative) with the specified length.

slice(array<T>, Int, Int): array<T>

Subsets the given array starting from index start (or starting from the end if start is negative) with the specified length.

Last refresh: Never

SELECT slice(array(1, 2, 3, 4), 2, 2);

slice(array(1, 2, 3, 4), 2, 2)

[2, 3]

Showing all 1 rows.

Last refresh: Never

Command took 0.08 seconds

SELECT slice(array(1, 2, 3, 4), -2, 2);

slice(array(1, 2, 3, 4), -2, 2)

[3, 4]

Showing all 1 rows.

Last refresh: Never

Command took 0.06 seconds

%md
#### array_zip(array&lt;T&gt;, array&lt;U&gt;, ...): array&lt;struct&lt;T, U, ...&gt;&gt;
Returns a merged array of structs in which the N-th struct contains all N-th values of input arrays.

array_zip(array<T>, array<U>, ...): array<struct<T, U, ...>>

Returns a merged array of structs in which the N-th struct contains all N-th values of input arrays.

Last refresh: Never

SELECT arrays_zip(array(1, 2, 3), array(2, 3, 4));

arrays_zip(array(1, 2, 3), array(2, 3, 4))

[{"0": 1, "1": 2}, {"0": 2, "1": 3}, {"0": 3, "1": 4}]

Showing all 1 rows.

Last refresh: Never

Command took 0.15 seconds

SELECT arrays_zip(array(1, 2), array(2, 3), array(3, 4));

arrays_zip(array(1, 2), array(2, 3), array(3, 4))

[{"0": 1, "1": 2, "2": 3}, {"0": 2, "1": 3, "2": 4}]

Showing all 1 rows.

Last refresh: Never

Command took 0.13 seconds

%md
## For map types

For map types

Last refresh: Never

%md
#### map_form_arrays(array&lt;K&gt;, array&lt;V&gt;): map&lt;K, V&gt;
Creates a map with a pair of the given key/value arrays. All elements in keys should not be null.

map_form_arrays(array<K>, array<V>): map<K, V>

Creates a map with a pair of the given key/value arrays. All elements in keys should not be null.

Last refresh: Never

SELECT map_from_arrays(array(1.0, 3.0), array('2', '4'));

map_from_arrays(array(1.0, 3.0), array(2, 4))

{"1.0": "2", "3.0": "4"}

Showing all 1 rows.

Last refresh: Never

Command took 0.17 seconds

%md
#### map_from_entries(array&lt;struct&lt;K, V&gt;&gt;): map&lt;K, V&gt;
Returns a map created from the given array of entries.

map_from_entries(array<struct<K, V>>): map<K, V>

Returns a map created from the given array of entries.

Last refresh: Never

SELECT map_from_entries(array(struct(1, 'a'), struct(2, 'b')));

map_from_entries(array(named_struct(col1, 1, col2, a), named_struct(col1, 2, col2, b)))

{"1": "a", "2": "b"}

Showing all 1 rows.

Last refresh: Never

Command took 0.10 seconds

%md
#### map_concat(map&lt;K, V&gt;, ...): map&lt;K, V&gt;
Returns the union of all the given maps.

map_concat(map<K, V>, ...): map<K, V>

Returns the union of all the given maps.

Last refresh: Never

SELECT map_concat(map(1, 'a', 2, 'b'), map(3, 'c', 4, 'd'));

map_concat(map(1, a, 2, b), map(3, c, 4, d))

{"1": "a", "2": "b", "3": "c", "4": "d"}

Showing all 1 rows.

Last refresh: Never

Command took 0.18 seconds

%md
## For both array and map types

For both array and map types

Last refresh: Never

%md
#### element_at(array&lt;T&gt;, Int): T / element_at(map&lt;K, V&gt;, K): V
For arrays, returns an element of the given array at given (1-based) index. If index &lt; 0, accesses elements from the last to the first. Returns null if the index exceeds the length of the array.

For maps, returns a value for the given key, or null if the key is not contained in the map.

element_at(array<T>, Int): T / element_at(map<K, V>, K): V

For arrays, returns an element of the given array at given (1-based) index. If index < 0, accesses elements from the last to the first. Returns null if the index exceeds the length of the array.

For maps, returns a value for the given key, or null if the key is not contained in the map.

Last refresh: Never

SELECT element_at(array(1, 2, 3), 2);

element_at(array(1, 2, 3), 2)

Showing all 1 rows.

Last refresh: Never

Command took 0.09 seconds

SELECT element_at(map(1, 'a', 2, 'b'), 2);

element_at(map(1, a, 2, b), 2)

Showing all 1 rows.

Last refresh: Never

Command took 0.06 seconds

%md
#### cardinality(array&lt;T&gt;): Int / cardinality(map&lt;K, V&gt;): Int
An alias of size. Returns the size of the given array or a map. Returns -1 if null.

cardinality(array<T>): Int / cardinality(map<K, V>): Int

An alias of size. Returns the size of the given array or a map. Returns -1 if null.

Last refresh: Never

SELECT cardinality(array('b', 'd', 'c', 'a'));

cardinality(array(b, d, c, a))

Showing all 1 rows.

Last refresh: Never

Command took 0.06 seconds

%md
## Higher-order functions

Higher-order functions

Last refresh: Never

%md
#### transform(array&lt;T&gt;, function&lt;T, U&gt;): array&lt;U&gt; and transform(array&lt;T&gt;, function&lt;T, Int, U&gt;): array&lt;U&gt;
Transform elements in an array using the function.

If there are two arguments for the lambda function, the second argument means the index of the element.

transform(array<T>, function<T, U>): array<U> and transform(array<T>, function<T, Int, U>): array<U>

Transform elements in an array using the function.

If there are two arguments for the lambda function, the second argument means the index of the element.

Last refresh: Never

SELECT transform(array(1, 2, 3), x -> x + 1);

transform(array(1, 2, 3), lambdafunction((namedlambdavariable() + 1), namedlambdavariable()))

[2, 3, 4]

Showing all 1 rows.

Last refresh: Never

Command took 0.13 seconds