Check your version!

Things can change quickly, and so does this contributor guide. To make sure you’ve got the most cutting edge version of this guide, go check out the latest version.

Getting Involved / Contributing#

Ray is more than a framework for distributed applications but also an active community of developers, researchers, and folks that love machine learning.

Tip

Ask questions on our forum! The community is extremely active in helping people succeed in building their Ray applications.

You can join (and Star!) us on on GitHub.

Contributing to Ray#

We welcome (and encourage!) all forms of contributions to Ray, including and not limited to:

  • Code reviewing of patches and PRs.

  • Pushing patches.

  • Documentation and examples.

  • Community participation in forums and issues.

  • Code readability and code comments to improve readability.

  • Test cases to make the codebase more robust.

  • Tutorials, blog posts, talks that promote the project.

  • Features and major changes via Ray Enhancement Proposals (REP): ray-project/enhancements

What can I work on?#

We use Github to track issues, feature requests, and bugs. Take a look at the ones labeled “good first issue” for a place to start.

Setting up your development environment#

To edit the Ray source code, you’ll want to checkout the repository and also build Ray from source. Follow these instructions for building a local copy of Ray to easily make changes.

Submitting and Merging a Contribution#

There are a couple steps to merge a contribution.

  1. First merge the most recent version of master into your development branch.

    git remote add upstream https://github.com/ray-project/ray.git
    git pull . upstream/master
    
  2. Make sure all existing tests and linters pass. Run setup_hooks.sh to create a git hook that will run the linter before you push your changes.

  3. If introducing a new feature or patching a bug, be sure to add new test cases in the relevant file in ray/python/ray/tests/.

  4. Document the code. Public functions need to be documented, and remember to provide an usage example if applicable. See doc/README.md for instructions on editing and building public documentation.

  5. Address comments on your PR. During the review process you may need to address merge conflicts with other changes. To resolve merge conflicts, run git pull . upstream/master on your branch (please do not use rebase, as it is less friendly to the GitHub review tool. All commits will be squashed on merge.)

  6. Reviewers will merge and approve the pull request; be sure to ping them if the pull request is getting stale.

PR Review Process#

For contributors who are in the ray-project organization:#

  • When you first create a PR, add an reviewer to the assignee section.

  • Assignees will review your PR and add the @author-action-required label if further actions are required.

  • Address their comments and remove the @author-action-required label from the PR.

  • Repeat this process until assignees approve your PR.

  • Once the PR is approved, the author is in charge of ensuring the PR passes the build. Add the test-ok label if the build succeeds.

  • Committers will merge the PR once the build is passing.

For contributors who are not in the ray-project organization:#

  • Your PRs will have assignees shortly. Assignees of PRs will be actively engaging with contributors to merge the PR.

  • Please actively ping assignees after you address your comments!

Testing#

Even though we have hooks to run unit tests automatically for each pull request, we recommend you to run unit tests locally beforehand to reduce reviewers’ burden and speedup review process.

If you are running tests for the first time, you can install the required dependencies with:

pip install -c python/requirements_compiled.txt -r python/requirements/test-requirements.txt

Testing for Python development#

The full suite of tests is too large to run on a single machine. However, you can run individual relevant Python test files. Suppose that one of the tests in a file of tests, e.g., python/ray/tests/test_basic.py, is failing. You can run just that test file locally as follows:

# Directly calling `pytest -v ...` may lose import paths.
python -m pytest -v -s python/ray/tests/test_basic.py

This will run all of the tests in the file. To run a specific test, use the following:

# Directly calling `pytest -v ...` may lose import paths.
python -m pytest -v -s test_file.py::name_of_the_test

Testing for C++ development#

To compile and run all C++ tests, you can run:

bazel test $(bazel query 'kind(cc_test, ...)')

Alternatively, you can also run one specific C++ test. You can use:

bazel test $(bazel query 'kind(cc_test, ...)') --test_filter=ClientConnectionTest --test_output=streamed

Code Style#

In general, we follow the Google style guide for C++ code and the Black code style for Python code. Python imports follow PEP8 style. However, it is more important for code to be in a locally consistent style than to strictly follow guidelines. Whenever in doubt, follow the local code style of the component.

For Python documentation, we follow a subset of the Google pydoc format. The following code snippets demonstrate the canonical Ray pydoc formatting:

def ray_canonical_doc_style(param1: int, param2: str) -> bool:
    """First sentence MUST be inline with the quotes and fit on one line.

    Additional explanatory text can be added in paragraphs such as this one.
    Do not introduce multi-line first sentences.

    Examples:
        .. doctest::

            >>> # Provide code examples for key use cases, as possible.
            >>> ray_canonical_doc_style(41, "hello")
            True

            >>> # A second example.
            >>> ray_canonical_doc_style(72, "goodbye")
            False

    Args:
        param1: The first parameter. Do not include the types in the
            docstring. They should be defined only in the signature.
            Multi-line parameter docs should be indented by four spaces.
        param2: The second parameter.

    Returns:
        The return value. Do not include types here.
    """
class RayClass:
    """The summary line for a class docstring should fit on one line.

    Additional explanatory text can be added in paragraphs such as this one.
    Do not introduce multi-line first sentences.

    The __init__ method is documented here in the class level docstring.

    All the public methods and attributes should have docstrings.

    Examples:
        .. testcode::

            obj = RayClass(12, "world")
            obj.increment_attr1()

    Args:
        param1: The first parameter. Do not include the types in the
            docstring. They should be defined only in the signature.
            Multi-line parameter docs should be indented by four spaces.
        param2: The second parameter.
    """

    def __init__(self, param1: int, param2: str):
        #: Public attribute is documented here.
        self.attr1 = param1
        #: Public attribute is documented here.
        self.attr2 = param2

    @property
    def attr3(self) -> str:
        """Public property of the class.

        Properties created with the @property decorator
        should be documented here.
        """
        return "hello"

    def increment_attr1(self) -> None:
        """Class methods are similar to regular functions.

        See above about how to document functions.
        """

        self.attr1 = self.attr1 + 1

See this for more details about how to write code snippets in docstrings.

Lint and Formatting#

We also have tests for code formatting and linting that need to pass before merge.

pip install -r python/requirements/lint-requirements.txt
  • If developing for C++, you will need clang-format version 12 (download this version of Clang from here)

You can run the following locally:

scripts/format.sh

An output like the following indicates failure:

WARNING: clang-format is not installed!  # This is harmless
From https://github.com/ray-project/ray
 * branch                master     -> FETCH_HEAD
python/ray/util/sgd/tf/tf_runner.py:4:1: F401 'numpy as np' imported but unused  # Below is the failure

In addition, there are other formatting and semantic checkers for components like the following (not included in scripts/format.sh):

  • Python README format:

cd python
python setup.py check --restructuredtext --strict --metadata
  • Python & Docs banned words check

./ci/lint/check-banned-words.sh
  • Bazel format:

./ci/lint/bazel-format.sh
  • clang-tidy for C++ lint, requires clang and clang-tidy version 12 to be installed:

./ci/lint/check-git-clang-tidy-output.sh

You can run setup_hooks.sh to create a git hook that will run the linter before you push your changes.

Understanding CI test jobs#

The Ray project automatically runs continuous integration (CI) tests once a PR is opened using Buildkite with multiple CI test jobs.

The CI test folder contains all integration test scripts and they invoke other test scripts via pytest, bazel-based test or other bash scripts. Some of the examples include:

  • Raylet integration tests commands:
    • bazel test //:core_worker_test

  • Bazel test command:
    • bazel test --build_tests_only //:all

  • Ray serving test commands:
    • pytest python/ray/serve/tests

    • python python/ray/serve/examples/echo_full.py

If a CI build exception doesn’t appear to be related to your change, please visit this link to check recent tests known to be flaky.

API compatibility style guide#

Ray provides stability guarantees for its public APIs in Ray core and libraries, which are described in the API Stability guide.

It’s hard to fully capture the semantics of API compatibility into a single annotation (for example, public APIs may have “experimental” arguments). For more granular stability contracts, those can be noted in the pydoc (e.g., “the random_shuffle option is experimental”). When possible, experimental arguments should also be prefixed by underscores in Python (e.g., _owner=).

Other recommendations:

In Python APIs, consider forcing the use of kwargs instead of positional arguments (with the * operator). Kwargs are easier to keep backwards compatible than positional arguments, e.g. imagine if you needed to deprecate “opt1” below, it’s easier with forced kwargs:

def foo_bar(file, *, opt1=x, opt2=y)
    pass

For callback APIs, consider adding a **kwargs placeholder as a “forward compatibility placeholder” in case more args need to be passed to the callback in the future, e.g.:

def tune_user_callback(model, score, **future_kwargs):
    pass

Becoming a Reviewer#

We identify reviewers from active contributors. Reviewers are individuals who not only actively contribute to the project and are also willing to participate in the code review of new contributions. A pull request to the project has to be reviewed by at least one reviewer in order to be merged. There is currently no formal process, but active contributors to Ray will be solicited by current reviewers.

More Resources for Getting Involved#

Ray is more than a framework for distributed applications but also an active community of developers, researchers, and folks that love machine learning. Here’s a list of tips for getting involved with the Ray community:

Note

These tips are based off of the TVM contributor guide.