DataIterator API#
DataIterator#
- class ray.data.DataIterator[source]#
An iterator for reading records from a
Dataset.For Datasets, each iteration call represents a complete read of all items in the Dataset.
If using Ray Train, each trainer actor should get its own iterator by calling
ray.train.get_dataset_shard("train").Examples
>>> import ray >>> ds = ray.data.range(5) >>> ds shape: (5, 1) ╭───────╮ │ id │ │ --- │ │ int64 │ ╰───────╯ (Dataset isn't materialized) >>> ds.iterator() DataIterator(shape: (5, 1) ╭───────╮ │ id │ │ --- │ │ int64 │ ╰───────╯ (Dataset isn't materialized))
Return a batched iterable over the dataset. |
|
Return a local row iterable over the dataset. |
|
Return a batched iterable of Torch Tensors over the dataset. |
|
Execute and materialize this data iterator into object store memory. |
|
Returns a string containing execution timing information. |
|
Return a TF Dataset over this dataset. |