ray.serve.request_router.MultiplexMixin.rank_replicas_via_multiplex#
- MultiplexMixin.rank_replicas_via_multiplex(replicas: List[RunningReplica], multiplexed_model_id: str) List[List[RunningReplica]] [source]#
Rank the replicas based on the multiplexed model ID. Rank 0 is the list of replicas that have the multiplexed model ID. Rank 1 is the list of replicas that have the fewest multiplexed models. Rank 2 is the list of all other replicas.