ray.rllib.algorithms.algorithm.Algorithm.train#
- Algorithm.train()#
Runs one logical iteration of training.
Calls
step()
internally. Subclasses should overridestep()
instead to return results. This method automatically fills the following fields in the result:done
(bool): training is terminated. Filled only if not provided.time_this_iter_s
(float): Time in seconds this iteration took to run. This may be overridden in order to override the system-computed time difference.time_total_s
(float): Accumulated time in seconds for this entire experiment.training_iteration
(int): The index of this training iteration, e.g. call to train(). This is incremented afterstep()
is called.pid
(str): The pid of the training process.date
(str): A formatted date of when the result was processed.timestamp
(str): A UNIX timestamp of when the result was processed. This may be overridden.hostname
(str): Hostname of the machine hosting the training process.node_ip
(str): Node ip of the machine hosting the training process.- Returns:
A dict that describes training progress.