Ray Data API# Input/Output Synthetic Data Python Objects Parquet CSV JSON Text Avro Images Binary TFRecords Pandas NumPy Arrow MongoDB BigQuery SQL Databases Databricks Delta Sharing Iceberg Lance Dask Spark Modin Mars Torch Hugging Face TensorFlow WebDataset Datasource API Datasink API Partitioning API MetadataProvider API Dataset API Dataset Basic Transformations Consuming Data Execution Grouped and Global aggregations I/O and Conversion Inspecting Metadata Sorting, Shuffling and Repartitioning Splitting and Merging datasets Schema Developer API Deprecated API DataIterator API DataIterator ExecutionOptions API Constructor Resource Options GroupedData API Computations or Descriptive Stats Function Application AggregateFn Global configuration DataContext Preprocessor Preprocessor Interface Generic Preprocessors Categorical Encoders Feature Scalers K-Bins Discretizers API Guide for Users from Other Data Libraries For Pandas Users For PyArrow Users For PyTorch Dataset & DataLoader Users