ray.data.Dataset.count#
- Dataset.count() int[source]#
- Count the number of rows in the dataset. - For Datasets which only read Parquet files (created with - read_parquet()), this method reads the file metadata to efficiently count the number of rows without reading in the entire data.- Note - If this dataset consists of more than a read, or if the row count can’t be determined from the metadata provided by the datasource, then this operation will trigger execution of the lazy transformations performed on this dataset. - Examples - >>> import ray >>> ds = ray.data.range(10) >>> ds.count() 10 - Returns:
- The number of records in the dataset.