ray.rllib.policy.Policy.learn\_on\_batch ======================================== .. currentmodule:: ray.rllib.policy .. automethod:: Policy.learn_on_batch