Skip to main content

Ctrl+K

Try Ray with $100 credit — Start now

Site Navigation

Get Started
Use Cases
Example Gallery
Library
APIs
Resources

Try Managed Ray

Site Navigation

Get Started
Use Cases
Example Gallery
Library
APIs
Resources

Try Managed Ray

Loading API navigation…

Ray APIs
Ray Serve API
LLMServer
reset_prefix_cache

reset_prefix_cache#

async LLMServer.reset_prefix_cache() → None[source]#

Reset the KV prefix cache on the engine.

Clears cached key-value pairs from previous requests.

previous

record_routing_stats

next

resume

On this page

LLMServer.reset_prefix_cache()

Thanks for the feedback!

Was this helpful?

Yes

No

Feedback

Submit

© Copyright 2026, The Ray Team.

Created using Sphinx 8.2.3.

Built with the PyData Sphinx Theme 0.18.0.