Ray Clusters Overview#

Ray enables seamless scaling of workloads from a laptop to a large cluster. While Ray works out of the box on single machines with just a call to ray.init, to run Ray applications on multiple nodes you must first deploy a Ray cluster.

A Ray cluster is a set of worker nodes connected to a common Ray head node. Ray clusters can be fixed-size, or they may autoscale up and down according to the resources requested by applications running on the cluster.

Where can I deploy Ray clusters?#

Ray provides native cluster deployment support on the following technology stacks:

On AWS and GCP. Community-supported Azure, Aliyun and vSphere integrations also exist.
On Kubernetes, via the officially supported KubeRay project.
On Anyscale, a fully managed Ray platform by the creators of Ray. You can either bring an existing AWS, GCP, Azure and Kubernetes clusters, or use the Anyscale hosted compute layer.

Advanced users may want to deploy Ray manually or onto platforms not listed here.

Note

Multi-node Ray clusters are only supported on Linux. At your own risk, you may deploy Windows and OSX clusters by setting the environment variable RAY_ENABLE_WINDOWS_OR_OSX_CLUSTER=1 during deployment.

What’s next?#

I want to learn key Ray cluster concepts

Understand the key concepts and main ways of interacting with a Ray cluster.

Learn Key Concepts

I want to run Ray on Kubernetes

Deploy a Ray application to a Kubernetes cluster. You can run the tutorial on a Kubernetes cluster or on your laptop via Kind.

Get Started with Ray on Kubernetes

I want to run Ray on a cloud provider

Take a sample application designed to run on a laptop and scale it up in the cloud. Access to an AWS or GCP account is required.

Get Started with Ray on VMs

I want to run my application on an existing Ray cluster

Guide to submitting applications as Jobs to existing Ray clusters.

Job Submission