ray.rllib.policy.eager_tf_policy_v2.EagerTFPolicyV2.stats_fn#

EagerTFPolicyV2.stats_fn(train_batch: ray.rllib.policy.sample_batch.SampleBatch) Dict[str, Union[numpy.array, jnp.ndarray, tf.Tensor, torch.Tensor]][source]#

Stats function. Returns a dict of statistics.

Parameters

train_batch – The SampleBatch (already) used for training.

Returns

The stats dict.