ray.train.v2.api.data_parallel_trainer.DataParallelTrainer.fit#
- DataParallelTrainer.fit() Result [source]#
Launches the Ray Train controller to run training on workers.
- Returns:
A Result object containing the training result.
- Raises:
ray.train.v2.api.exceptions.ControllerError – If a non-retryable error occurs in the Ray Train controller itself, or if the number of retries configured in
FailureConfig
is exhausted.ray.train.v2.api.exceptions.WorkerGroupError – If one or more workers fail during training and the number of retries configured in
FailureConfig
is exhausted.