ray.rllib.policy.torch_policy_v2.TorchPolicyV2.__init__#

TorchPolicyV2.__init__(observation_space: <MagicMock name='mock.spaces.Space' id='140527168090832'>, action_space: <MagicMock name='mock.spaces.Space' id='140527168090832'>, config: dict, *, max_seq_len: int = 20)[source]#

Initializes a TorchPolicy instance.

Parameters
  • observation_space – Observation space of the policy.

  • action_space – Action space of the policy.

  • config – The Policy’s config dict.

  • max_seq_len – Max sequence length for LSTM training.