ray.data.Dataset.random_sample#
- Dataset.random_sample(fraction: float, *, seed: int | None = None) Dataset [source]#
Returns a new
Dataset
containing a random fraction of the rows.Note
This method returns roughly
fraction * total_rows
rows. An exact number of rows isn’t guaranteed.Examples
>>> import ray >>> ds = ray.data.range(100) >>> ds.random_sample(0.1).count() 10
- Parameters:
fraction – The fraction of elements to sample.
seed – Seeds the python random pRNG generator.
- Returns:
Returns a
Dataset
containing the sampled rows.