ray.serve.llm.LLMServer.sleep#

async LLMServer.sleep(**kwargs: Any) None[source]#

Put the engine to sleep.

Parameters:

**kwargs – Engine-specific sleep options. Passed through to the engine.