ray.data.Dataset.to_spark
ray.data.Dataset.to_spark#
- Dataset.to_spark(spark: pyspark.sql.SparkSession) pyspark.sql.DataFrame [source]#
Convert this dataset into a Spark dataframe.
Note
This operation will trigger execution of the lazy transformations performed on this dataset.
Time complexity: O(dataset size / parallelism)
- Returns
A Spark dataframe created from this dataset.