Ray Data API# Input/Output Synthetic Data Python Objects Parquet CSV JSON Text Images Binary TFRecords Pandas NumPy Arrow MongoDB BigQuery SQL Databases Databricks Dask Spark Modin Mars Torch Hugging Face TensorFlow WebDataset Datasource API Datasink API Partitioning API MetadataProvider API Dataset API Constructor Basic Transformations Sorting, Shuffling, Repartitioning Splitting and Merging Datasets Grouped and Global Aggregations Consuming Data I/O and Conversion Inspecting Metadata Execution Serialization Internals DataIterator API DataIterator ray.data.DataIterator.iter_batches ray.data.DataIterator.iter_torch_batches ray.data.DataIterator.to_tf ray.data.DataIterator.stats ExecutionOptions API Constructor Resource Options GroupedData API Constructor Computations / Descriptive Stats Function Application Aggregate Function DataContext API Constructor Get DataContext RandomAccessDataset (experimental) Constructor Functions Utility ray.data.set_progress_bars Preprocessor Preprocessor Interface Generic Preprocessors Categorical Encoders Feature Scalers K-Bins Discretizers API Guide for Users from Other Data Libraries For Pandas Users For PyArrow Users For PyTorch Dataset & DataLoader Users