daft.Expression.agg_set#
- Expression.agg_set() Expression [source]#
Aggregates the values in the expression into a set (ignoring nulls).
Example
>>> import daft >>> df = daft.from_pydict({"values": [1, 1, None, 2, 2, None]}) >>> df.agg(df["values"].agg_set().alias("unique_values")).show() ╭───────────────╮ │ unique_values │ │ --- │ │ List[Int64] │ ╞═══════════════╡ │ [1, 2] │ ╰───────────────╯ (Showing first 1 of 1 rows)
Note that null values are ignored by default:
>>> df = daft.from_pydict({"values": [None, None, None]}) >>> df.agg(df["values"].agg_set().alias("unique_values")).show() ╭───────────────╮ │ unique_values │ │ --- │ │ List[Null] │ ╞═══════════════╡ │ [] │ ╰───────────────╯ (Showing first 1 of 1 rows)
- Returns:
A List expression containing the unique values from the input
- Return type:
Expression