ray.serve.config.AutoscalingConfig#
- class ray.serve.config.AutoscalingConfig(*, min_replicas: NonNegativeInt = 1, initial_replicas: NonNegativeInt | None = None, max_replicas: PositiveInt = 1, target_ongoing_requests: PositiveFloat | None = 2, metrics_interval_s: PositiveFloat = 10.0, look_back_period_s: PositiveFloat = 30.0, smoothing_factor: PositiveFloat = 1.0, upscale_smoothing_factor: PositiveFloat | None = None, downscale_smoothing_factor: PositiveFloat | None = None, upscaling_factor: PositiveFloat | None = None, downscaling_factor: PositiveFloat | None = None, downscale_delay_s: NonNegativeFloat = 600.0, downscale_to_zero_delay_s: NonNegativeFloat | None = None, upscale_delay_s: NonNegativeFloat = 30.0, aggregation_function: str | AggregationFunction = AggregationFunction.MEAN, policy: AutoscalingPolicy = None)[source]#
- Bases: - BaseModel- Config for the Serve Autoscaler. - Methods - Create a new model by parsing and validating input data from keyword arguments. - Creates a new model setting __dict__ and __fields_set__ from trusted or pre-validated data. - Duplicate a model, optionally choose which fields to include, exclude and change. - Generate a dictionary representation of the model, optionally specifying which fields to include or exclude. - Generate a JSON representation of the model, - includeand- excludearguments as per- dict().- Try to update ForwardRefs on fields based on this Model, globalns and localns. - Attributes