ray.serve.handle.DeploymentResponse#
- class ray.serve.handle.DeploymentResponse[source]#
A future-like object wrapping the result of a unary deployment handle call.
From inside a deployment, a
DeploymentResponse
can be awaited to retrieve the output of the call without blocking the asyncio event loop.From outside a deployment,
.result()
can be used to retrieve the output in a blocking manner.Example:
from ray import serve from ray.serve.handle import DeploymentHandle @serve.deployment class Downstream: def say_hi(self, message: str) -> str: return f"Hello {message}!" @serve.deployment class Caller: def __init__(self, handle: DeploymentHandle): self._downstream_handle = handle async def __call__(self, message: str) -> str: # Inside a deployment: `await` the result to enable concurrency. response = self._downstream_handle.say_hi.remote(message) return await response app = Caller.bind(Downstream.bind()) handle: DeploymentHandle = serve.run(app) # Outside a deployment: call `.result()` to get output. response = handle.remote("world") assert response.result() == "Hello world!"
A
DeploymentResponse
can be passed directly to anotherDeploymentHandle
call without fetching the result to enable composing multiple deployments together.Example:
from ray import serve from ray.serve.handle import DeploymentHandle @serve.deployment class Adder: def add(self, val: int) -> int: return val + 1 @serve.deployment class Caller: def __init__(self, handle: DeploymentHandle): self._adder_handle = handle async def __call__(self, start: int) -> int: return await self._adder_handle.add.remote( # Pass the response directly to another handle call without awaiting. self._adder_handle.add.remote(start) ) app = Caller.bind(Adder.bind()) handle: DeploymentHandle = serve.run(app) assert handle.remote(0).result() == 2
- result(*, timeout_s: float | None = None, _skip_asyncio_check: bool = False) Any [source]#
Fetch the result of the handle call synchronously.
This should not be used from within a deployment as it runs in an asyncio event loop. For model composition,
await
the response instead.If
timeout_s
is provided and the result is not available before the timeout, aTimeoutError
is raised.