ray.serve.schema.ControllerHealthMetrics#
- pydantic model ray.serve.schema.ControllerHealthMetrics[source]#
Health metrics for the Ray Serve controller.
These metrics help diagnose controller performance issues, especially as cluster size increases.
PublicAPI (alpha): This API is in alpha and may change before becoming stable.
- field application_state_update_duration_s: DurationStats | None = None#
Application state update duration statistics over a rolling window.
- field deployment_state_update_duration_s: DurationStats | None = None#
Deployment state update duration statistics over a rolling window.
- field event_loop_delay_s: float = 0.0#
Difference between actual and expected sleep duration. Positive values indicate an overloaded event loop.
- field handle_metrics_delay_ms: DurationStats | None = None#
Delay between when handle metrics are generated and when they reach the controller (rolling window, milliseconds).
- field loop_duration_s: DurationStats | None = None#
Control loop duration statistics over a rolling window.
- field node_update_duration_s: DurationStats | None = None#
Node update duration statistics over a rolling window.
- field proxy_state_update_duration_s: DurationStats | None = None#
Proxy state update duration statistics over a rolling window.
- field replica_metrics_delay_ms: DurationStats | None = None#
Delay between when replica metrics are generated and when they reach the controller (rolling window, milliseconds).