ray.serve.llm.LLMServer.reset_prefix_cache#

async LLMServer.reset_prefix_cache() None[source]#

Reset the prefix cache of the underlying engine