daft.Expression.agg_set

daft.Expression.agg_set#

Expression.agg_set() Expression[source]#

Aggregates the values in the expression into a set (ignoring nulls).

Example

>>> import daft
>>> df = daft.from_pydict({"values": [1, 1, None, 2, 2, None]})
>>> df.agg(df["values"].agg_set().alias("unique_values")).show()
╭───────────────╮
│ unique_values │
│ ---           │
│ List[Int64]   │
╞═══════════════╡
│ [1, 2]        │
╰───────────────╯

(Showing first 1 of 1 rows)

Note that null values are ignored by default:

>>> df = daft.from_pydict({"values": [None, None, None]})
>>> df.agg(df["values"].agg_set().alias("unique_values")).show()
╭───────────────╮
│ unique_values │
│ ---           │
│ List[Null]    │
╞═══════════════╡
│ []            │
╰───────────────╯

(Showing first 1 of 1 rows)
Returns:

A List expression containing the unique values from the input

Return type:

Expression