ray.data.datasource.Reader.get_read_tasks#

Reader.get_read_tasks(parallelism: int) List[ray.data.datasource.datasource.ReadTask][source]#

Execute the read and return read tasks.

Parameters
  • parallelism – The requested read parallelism. The number of read tasks should equal to this value if possible.

  • read_args – Additional kwargs to pass to the datasource impl.

Returns

A list of read tasks that can be executed to read blocks from the datasource in parallel.