Try Ray with $100 credit — Start now

Offline RL API#

Configuring Offline RL#

`AlgorithmConfig.offline_data`	Sets the config's offline data settings.
`AlgorithmConfig.learners`	Sets LearnerGroup and Learner worker related configurations.

Configuring Offline Recording EnvRunners#

AlgorithmConfig.env_runners

Sets the rollout worker configuration.

Constructing a Recording EnvRunner#

OfflineSingleAgentEnvRunner

The environment runner to record the single agent case.

Constructing OfflineData#

`OfflineData`	PublicAPI (alpha): This API is in alpha and may change before becoming stable.
`OfflineData.__init__`

Sampling from Offline Data#

`OfflineData.sample`
`OfflineData.default_map_batches_kwargs`
`OfflineData.default_iter_batches_kwargs`

Constructing an OfflinePreLearner#

`OfflinePreLearner`	Class that coordinates data transformation from dataset to learner.
`OfflinePreLearner.__init__`

Transforming Data with an OfflinePreLearner#

`SCHEMA`	This is the default schema used if no `input_read_schema` is set in the config.
`OfflinePreLearner.__call__`	Prepares plain data batches for training with `Learner`'s.
`OfflinePreLearner._map_to_episodes`	Maps a batch of data to episodes.
`OfflinePreLearner._map_sample_batch_to_episode`	Maps an old stack `SampleBatch` to new stack episodes.
`OfflinePreLearner._should_module_be_updated`	Checks which modules in a MultiRLModule should be updated.
`OfflinePreLearner.default_prelearner_buffer_class`	Sets the default replay buffer.
`OfflinePreLearner.default_prelearner_buffer_kwargs`	Sets the default arguments for the replay buffer.