ray.data.datasource.FilenameProvider.get_filename_for_block#
- FilenameProvider.get_filename_for_block(block: pyarrow.Table | pandas.DataFrame, write_uuid: str, task_index: int, block_index: int) str [source]#
Generate a filename for a block of data.
Note
Filenames must be unique and deterministic for a given write UUID, and task and block index.
A block consists of multiple rows and corresponds to a single output file. Each task might produce a different number of blocks.
- Parameters:
block – The block that will be written to a file.
write_uuid – The UUID of the write operation.
task_index – The index of the write task.
block_index – The index of the block within the write task.