ray.data.from_items#

ray.data.from_items(items: List[Any], *, parallelism: int = - 1) ray.data.dataset.Dataset[Any][source]#

Create a dataset from a list of local Python objects.

Examples

>>> import ray
>>> ds = ray.data.from_items([1, 2, 3, 4, 5]) 
>>> ds 
Dataset(num_blocks=5, num_rows=5, schema=<class 'int'>)
>>> ds.take(2) 
[1, 2]
Parameters
  • items – List of local Python objects.

  • parallelism – The amount of parallelism to use for the dataset. Parallelism may be limited by the number of items.

Returns

Dataset holding the items.

PublicAPI: This API is stable across Ray releases.