ray.data.datasource.FilenameProvider.get_filename_for_row#

FilenameProvider.get_filename_for_row(row: Dict[str, Any], write_uuid: str, task_index: int, block_index: int, row_index: int) str[source]#

Generate a filename for a row.

Deprecated since version Implement: get_filename_for_task() instead. Row filenames are automatically derived by appending _{block_index:06}_{row_index:06} to the task filename.

Parameters:
  • row – The row that will be written to a file.

  • write_uuid – The UUID of the write operation.

  • task_index – The index of the write task.

  • block_index – The index of the block within the write task.

  • row_index – The index of the row within the block.

Warning

DEPRECATED: This API is deprecated and may be removed in future Ray releases. Implement get_filename_for_task() instead. Row filenames are automatically derived by appending _{block_index:06}_{row_index:06} to the task filename. All files from the same task must share the task filename as a prefix so that uncommitted data files can be identified and cleaned up during checkpoint recovery.