ray.data.Datasink#

class ray.data.Datasink[source]#

Interface for defining write-related logic.

If you want to write data to something that isn’t built-in, subclass this class and call write_datasink().

DeveloperAPI: This API may change across minor Ray releases.

Methods

__init__

get_name

Return a human-readable name for this datasink.

on_write_complete

Callback for when a write job completes.

on_write_failed

Callback for when a write job fails.

on_write_start

Callback for when a write job starts.

write

Write blocks.

Attributes

num_rows_per_write

The target number of rows to pass to each write() call.

supports_distributed_writes

If False, only launch write tasks on the driver's node.