ray.data.datasource.Reader.get_read_tasks
ray.data.datasource.Reader.get_read_tasks#
- Reader.get_read_tasks(parallelism: int) List[ray.data.datasource.datasource.ReadTask] [source]#
Execute the read and return read tasks.
- Parameters
parallelism – The requested read parallelism. The number of read tasks should equal to this value if possible.
read_args – Additional kwargs to pass to the datasource impl.
- Returns
A list of read tasks that can be executed to read blocks from the datasource in parallel.