ray.data.Dataset.to_daft#

Dataset.to_daft() daft.DataFrame[source]#

Convert this Dataset into a Daft DataFrame.

This will convert all the data inside the Ray Dataset into a Daft DataFrame in a zero-copy way (using Arrow as the intermediate data format).

Note

This operation will trigger execution of the lazy transformations performed on this dataset.

Time complexity: O(dataset size / parallelism)

Returns:

A Daft DataFrame created from this dataset.