ray.rllib.algorithms.algorithm.Algorithm.evaluate#
- Algorithm.evaluate(parallel_train_future: ThreadPoolExecutor | None = None) Dict[source]#
Evaluates current policy under
evaluation_configsettings.- Parameters:
parallel_train_future – In case, we are training and avaluating in parallel, this arg carries the currently running ThreadPoolExecutor object that runs the training iteration. Use
parallel_train_future.done()to check, whether the parallel training job has completed andparallel_train_future.result()to get its return values.- Returns:
A ResultDict only containing the evaluation results from the current iteration.