ray.train.lightning.RayFSDPStrategy.lightning_module_state_dict#

RayFSDPStrategy.lightning_module_state_dict() Dict[str, Any][source]#

Gathers the full state dict to rank 0 on CPU.

FSDP checkpointing is broken in Lightning 2.0.x. This subclass patches the behavior to perform a full state dict checkpointing, gathering the checkpoint shards on rank 0 CPU. Upgrade to lightning>=2.1 to do sharded state dict checkpointing.

See the note in the class docstring for more details.