Dataframe Creation#

In-Memory Data#

Python Objects#

from_pylist

Creates a DataFrame from a list of dictionaries.

from_pydict

Creates a DataFrame from a Python dictionary.

Arrow#

from_arrow

Creates a DataFrame from a pyarrow Table.

Pandas#

from_pandas

Creates a Daft DataFrame from a pandas DataFrame.

Files#

Parquet#

read_parquet

Creates a DataFrame from Parquet file(s)

CSV#

read_csv

Creates a DataFrame from CSV file(s)

JSON#

read_json

Creates a DataFrame from line-delimited JSON file(s)

File Paths#

from_glob_path

Creates a DataFrame of file paths and other metadata from a glob path.

Data Catalogs#

Apache Iceberg#

read_iceberg

Create a DataFrame from an Iceberg table

Delta Lake#

read_deltalake

Create a DataFrame from a Delta Lake table.

Apache Hudi#

read_hudi

Create a DataFrame from a Hudi table.

Integrations#

Ray Datasets#

from_ray_dataset

Creates a DataFrame from a Ray Dataset.

Dask#

from_dask_dataframe

Creates a Daft DataFrame from a Dask DataFrame.

Databases#

read_sql

Create a DataFrame from the results of a SQL query.

read_lance

Create a DataFrame from a LanceDB table