ray.rllib.policy.torch_policy_v2.TorchPolicyV2.__init__#

TorchPolicyV2.__init__(observation_space: gymnasium.spaces.Space, action_space: gymnasium.spaces.Space, config: dict, *, max_seq_len: int = 20)[source]#

Initializes a TorchPolicy instance.

Parameters:
  • observation_space – Observation space of the policy.

  • action_space – Action space of the policy.

  • config – The Policy’s config dict.

  • max_seq_len – Max sequence length for LSTM training.