ray.rllib.policy.sample_batch.SampleBatch.get_single_step_input_dict#

SampleBatch.get_single_step_input_dict(view_requirements: Dict[str, ViewRequirement], index: str | int = 'last') → SampleBatch[source]#

Creates single ts SampleBatch at given index from self.

For usage as input-dict for model (action or value function) calls.

Parameters:

view_requirements – A view requirements dict from the model for which to produce the input_dict.
index – An integer index value indicating the position in the trajectory for which to generate the compute_actions input dict. Set to “last” to generate the dict at the very end of the trajectory (e.g. for value estimation). Note that “last” is different from -1, as “last” will use the final NEXT_OBS as observation input.

Returns:

The (single-timestep) input dict for ModelV2 calls.