The Ray Ecosystem#

This page lists libraries that have integrations with Ray for distributed execution in alphabetical order. It’s easy to add your own integration to this list. Simply open a pull request with a few lines of text, see the dropdown below for more information.

Adding Your Integration

To add an integration add an entry to this file, using the same grid-item-card directive that the other examples use.

../_images/aibrix.jpeg
https://img.shields.io/github/stars/vllm-project/aibrix?style=social

AIBrix is a cloud-native LLM inference infrastructure platform that provides building blocks for deploying, scaling, and optimizing large language model serving with Ray-based hybrid orchestration.

../_images/areal.png
https://img.shields.io/github/stars/areal-project/AReaL?style=social

AReaL is an asynchronous reinforcement learning system for LLM agents developed by Ant Group. It decouples generation from training for efficient distributed post-training on Ray clusters.

../_images/cosmos_curate.png
https://img.shields.io/github/stars/NVIDIA/cosmos-curator?style=social

Cosmos Curate is a GPU-accelerated video and image data curation toolkit from NVIDIA. It provides scalable pipelines for filtering, deduplication, and quality scoring using Ray for multi-node, multi-GPU distributed processing.

../_images/daft.png
https://img.shields.io/github/stars/Eventual-Inc/Daft?style=social

Daft is a high-performance multimodal data engine that provides simple and reliable data processing for any modality - from structured tables to images, audio, video, and embeddings. Built with Python and Rust for modern AI workflows, Daft offers seamless scaling from local to distributed clusters, enabling efficient batch inference, document processing, and multimodal ETL pipelines at scale.

../_images/data_juicer.png
https://img.shields.io/github/stars/modelscope/data-juicer?style=social

Data-Juicer is a one-stop multimodal data processing system to make data higher-quality, juicier, and more digestible for foundation models. It integrates with Ray for distributed data processing on large-scale datasets with over 100 multimodal operators and supports TB-size dataset deduplication.

../_images/deltacat.png
https://img.shields.io/github/stars/ray-project/deltacat?style=social

DeltaCAT is a portable multimodal lakehouse powered by Ray for petabyte-scale data compaction, deduplication, and incremental table processing with ACID compliance.

../_images/modin.png
https://img.shields.io/github/stars/modin-project/modin?style=social

Scale your pandas workflows by changing one line of code. Modin transparently distributes the data and computation so that all you need to do is continue using the pandas API as you were before installing Modin.

../_images/nemo.png
https://img.shields.io/github/stars/NVIDIA-NeMo/Curator?style=social

NeMo Curator is a scalable data curation toolkit from NVIDIA for preparing high-quality datasets for large language model training. It uses Ray for distributed data processing including deduplication, filtering, and quality classification at scale.

../_images/nemo.png
https://img.shields.io/github/stars/NVIDIA-NeMo/RL?style=social

NeMo-RL is NVIDIA’s scalable post-training toolkit for large language models. It provides RLHF and alignment training built on Ray for distributed orchestration of training and inference workloads.

../_images/openrlhf.png
https://img.shields.io/github/stars/OpenRLHF/OpenRLHF?style=social

OpenRLHF is an easy-to-use, scalable RLHF training framework. It supports distributed PPO, DPO, rejection sampling, and other alignment methods using Ray for orchestrating training and generation across multiple GPUs and nodes.

../_images/intel.png
https://img.shields.io/github/stars/Intel-bigdata/oap-raydp?style=social

RayDP (“Spark on Ray”) enables you to easily use Spark inside a Ray program. You can use Spark to read the input data, process the data using SQL, Spark DataFrame, or Pandas (via Koalas) API, extract and transform features using Spark MLLib, and use RayDP Estimator API for distributed training on the preprocessed dataset.

../_images/roll.jpg
https://img.shields.io/github/stars/alibaba/ROLL?style=social

ROLL is Alibaba’s reinforcement learning scaling library for large language models. It provides efficient distributed RL training with flexible resource scheduling and heterogeneous task management built on Ray.

../_images/skyrl.png
https://img.shields.io/github/stars/NovaSky-AI/SkyRL?style=social

SkyRL is a modular reinforcement learning library for LLM agents from UC Berkeley. It enables training through multi-turn environment interactions using Ray for distributed rollout and training.

../_images/slime.jpg
https://img.shields.io/github/stars/THUDM/slime?style=social

SLIME is a post-training framework for large language models from Tsinghua University. It provides RL scaling with a service-oriented architecture built on Ray and Megatron-LM for distributed training orchestration.

../_images/syftr.jpg
https://img.shields.io/github/stars/datarobot/syftr?style=social

Syftr is an open-source agent workflow optimizer from DataRobot. It uses Ray and Ray Tune for scalable multi-objective optimization of agentic AI workflows across prompts, models, and tool selections.

../_images/verl.jpg
https://img.shields.io/github/stars/verl-project/verl?style=social

verl is a flexible and efficient reinforcement learning training library for large language models from ByteDance. It provides a Ray-native hybrid controller for scalable RLHF training with distributed orchestration of rollout, training, and reward computation.

../_images/vllm.png
https://img.shields.io/github/stars/vllm-project/vllm?style=social

vLLM is a high-throughput and memory-efficient inference and serving engine for large language models. It uses Ray for distributed tensor parallelism and pipeline parallelism across multiple GPUs and nodes.