Ray Data API# Input/Output Synthetic Data Python Objects Parquet CSV JSON Text Avro Images Binary TFRecords Pandas NumPy Arrow MongoDB BigQuery SQL Databases Databricks Dask Spark Modin Mars Torch Hugging Face TensorFlow WebDataset Datasource API Datasink API Partitioning API MetadataProvider API Dataset API Constructor Basic Transformations Sorting, Shuffling, Repartitioning Splitting and Merging Datasets Grouped and Global Aggregations Consuming Data I/O and Conversion Inspecting Metadata Execution Internals DataIterator API DataIterator ExecutionOptions API Constructor Resource Options GroupedData API Constructor Computations / Descriptive Stats Function Application Aggregate Function Global configuration DataContext Utility Preprocessor Preprocessor Interface Generic Preprocessors Categorical Encoders Feature Scalers K-Bins Discretizers API Guide for Users from Other Data Libraries For Pandas Users For PyArrow Users For PyTorch Dataset & DataLoader Users