ray.serve.llm.LLMServer.is_sleeping#

async LLMServer.is_sleeping() bool[source]#

Check whether the engine is currently sleeping.

Returns:

True if the engine is sleeping, False otherwise.