Ray
The number of agent steps (there are >= 1 agent steps per env step).
The number of agent steps total in this batch.