ray.data.Datasource.get_read_tasks#

Datasource.get_read_tasks(parallelism: int) List[ReadTask][source]#

Execute the read and return read tasks.

Parameters:

parallelism – The requested read parallelism. The number of read tasks should equal to this value if possible.

Returns:

A list of read tasks that can be executed to read blocks from the datasource in parallel.