ray.rllib.env.env_runner_group.EnvRunnerGroup.sync_weights#

Syncs model weights from the given weight source to all remote workers.

Weight source can be either a (local) rollout worker or a learner_group. It should just implement a get_weights method.

Parameters:

policies – Optional list of PolicyIDs to sync weights for. If None (default), sync weights to/from all policies.
from_worker_or_learner_group – Optional (local) EnvRunner instance or LearnerGroup instance to sync from. If None (default), sync from this EnvRunnerGroup’s local worker.
to_worker_indices – Optional list of worker indices to sync the weights to. If None (default), sync to all remote workers.
global_vars – An optional global vars dict to set this worker to. If None, do not update the global_vars.
timeout_seconds – Timeout in seconds to wait for the sync weights calls to complete. Default is 0.0 (fire-and-forget, do not wait for any sync calls to finish). Setting this to 0.0 might significantly improve algorithm performance, depending on the algo’s training_step logic.
inference_only – Sync weights with workers that keep inference-only modules. This is needed for algorithms in the new stack that use inference-only modules. In this case only a part of the parameters are synced to the workers. Default is False.

DeveloperAPI: This API may change across minor Ray releases.