ray.rllib.policy.policy.Policy.load_batch_into_buffer#

Policy.load_batch_into_buffer(batch: SampleBatch, buffer_index: int = 0) int[source]#

Bulk-loads the given SampleBatch into the devices’ memories.

The data is split equally across all the Policy’s devices. If the data is not evenly divisible by the batch size, excess data should be discarded.

Parameters:
  • batch – The SampleBatch to load.

  • buffer_index – The index of the buffer (a MultiGPUTowerStack) to use on the devices. The number of buffers on each device depends on the value of the num_multi_gpu_tower_stacks config key.

Returns:

The number of tuples loaded per device.