The Ray Ecosystem#
This page lists libraries that have integrations with Ray for distributed execution in alphabetical order. It’s easy to add your own integration to this list. Simply open a pull request with a few lines of text, see the dropdown below for more information.
Adding Your Integration
To add an integration add an entry to this file, using the same
grid-item-card directive that the other examples use.
Daft is a high-performance multimodal data engine that provides simple and reliable data processing for any modality - from structured tables to images, audio, video, and embeddings. Built with Python and Rust for modern AI workflows, Daft offers seamless scaling from local to distributed clusters, enabling efficient batch inference, document processing, and multimodal ETL pipelines at scale.
RayDP (“Spark on Ray”) enables you to easily use Spark inside a Ray program. You can use Spark to read the input data, process the data using SQL, Spark DataFrame, or Pandas (via Koalas) API, extract and transform features using Spark MLLib, and use RayDP Estimator API for distributed training on the preprocessed dataset.