ray.data.Datasource.get_read_tasks#
- Datasource.get_read_tasks(parallelism: int) List[ReadTask] [source]#
Execute the read and return read tasks.
- Parameters:
parallelism – The requested read parallelism. The number of read tasks should equal to this value if possible.
- Returns:
A list of read tasks that can be executed to read blocks from the datasource in parallel.