ray.serve.request_router.RunningReplica#
- class ray.serve.request_router.RunningReplica(replica_info: RunningReplicaInfo)[source]#
Contains info on a running replica. Also defines the interface for a request router to talk to a replica.
PublicAPI (alpha): This API is in alpha and may change before becoming stable.
Methods
Returns current queue len for the replica.
When on proxy, push proxy's self handle to replica
Send request to this replica.
Attributes
Actor ID of this replica.
Availability zone of the node this replica is running on.
Whether this replica is cross-language (Java).
Max concurrent requests that can be sent to this replica.
Set of model IDs on this replica.
Node ID of the node this replica is running on.
ID of this replica.
Dictionary of routing stats.