daft.DataFrame.write_parquet#

DataFrame.write_parquet(root_dir: Union[str, Path], compression: str = 'snappy', partition_cols: Optional[List[Union[Expression, str]]] = None, io_config: Optional[IOConfig] = None) → DataFrame[source]#

Writes the DataFrame as parquet files, returning a new DataFrame with paths to the files that were written

Files will be written to <root_dir>/* with randomly generated UUIDs as the file names.

Note

This call is blocking and will execute the DataFrame when called

Parameters:

root_dir (str) – root file path to write parquet files to.
compression (str, optional) – compression algorithm. Defaults to “snappy”.
partition_cols (Optional[List[ColumnInputType]], optional) – How to subpartition each partition further. Defaults to None.
io_config (Optional[IOConfig], optional) – configurations to use when interacting with remote storage.

Returns:

The filenames that were written out as strings.

Note

This call is blocking and will execute the DataFrame when called

Return type:

DataFrame

daft.DataFrame.write_parquet

Contents

daft.DataFrame.write_parquet#