ray.serve.metrics.Histogram#
- class ray.serve.metrics.Histogram(name: str, description: str = '', boundaries: List[float] = None, tag_keys: Tuple[str] | None = None)[source]#
- Bases: - Histogram- Tracks the size and number of events in buckets. - Histograms allow you to calculate aggregate quantiles such as 25, 50, 95, 99 percentile latency for an RPC. - This corresponds to Prometheus’ histogram metric: https://prometheus.io/docs/concepts/metric_types/#histogram - Serve-related tags (“deployment”, “replica”, “application”, “route”) are added automatically if not provided. - @serve.deployment class MyDeployment: def __init__(self): self.my_histogram = Histogram( "my_histogram", description=("Histogram of the __call__ method running time."), boundaries=[1,2,4,8,16,32,64], tag_keys=("model",), ) self.my_histogram.set_default_tags({"model": "123"}) def __call__(self): start = time.time() self.my_histogram.observe(time.time() - start) - Parameters:
- name – Name of the metric. 
- description – Description of the metric. 
- boundaries – Boundaries of histogram buckets. 
- tag_keys – Tag keys of the metric. 
 
 - PublicAPI (beta): This API is in beta and may change before becoming stable. - Methods - Observe the given value, add serve context tag values to the tags - Attributes - Return information about histogram metric.