ray.rllib.core.learner.learner.Learner.additional_update_for_module#

Learner.additional_update_for_module(*, module_id: str, config: AlgorithmConfig | None = None, timestep: int, hps=None, **kwargs) → Dict[str, Any][source]#

Apply additional non-gradient based updates for a single module.

See additional_update for more details.

Parameters:

module_id – The id of the module to update.
config – The AlgorithmConfig specific to the given module_id.
timestep – The current global timestep (to be used with schedulers).
**kwargs – Keyword arguments to use for the additional update.

Returns:

A dictionary of results from the update