ray.rllib.evaluation.rollout_worker.RolloutWorker.foreach_policy_to_train
ray.rllib.evaluation.rollout_worker.RolloutWorker.foreach_policy_to_train#
- RolloutWorker.foreach_policy_to_train(func: Callable[[ray.rllib.policy.policy.Policy, str, Optional[Any]], ray.rllib.utils.typing.T], **kwargs) List[ray.rllib.utils.typing.T] [source]#
Calls the given function with each (policy, policy_id) tuple.
Only those policies/IDs will be called on, for which
self.is_policy_to_train()
returns True.- Parameters
func – The function to call with each (policy, policy ID) tuple, for only those policies that
self.is_policy_to_train
returns True.- Keyword Arguments
kwargs – Additional kwargs to be passed to the call.
- Returns
The list of return values of all calls to
func([policy, pid, **kwargs])
.