DataIterator API#
DataIterator#
- class ray.data.DataIterator[source]#
An iterator for reading records from a
Dataset
.For Datasets, each iteration call represents a complete read of all items in the Dataset.
If using Ray Train, each trainer actor should get its own iterator by calling
ray.train.get_dataset_shard("train")
.Examples
>>> import ray >>> ds = ray.data.range(5) >>> ds Dataset(num_rows=5, schema={id: int64}) >>> ds.iterator() DataIterator(Dataset(num_rows=5, schema={id: int64}))
PublicAPI (beta): This API is in beta and may change before becoming stable.
Return a batched iterable over the dataset. |
|
Return a batched iterable of Torch Tensors over the dataset. |
|
Execute and materialize this data iterator into object store memory. |
|
Returns a string containing execution timing information. |
|
Return a TF Dataset over this dataset. |