ray.serve.schema.DeploymentSchema#
- class ray.serve.schema.DeploymentSchema(*, name: str, num_replicas: PositiveInt | str | None = DEFAULT.VALUE, max_ongoing_requests: ConstrainedIntValue = DEFAULT.VALUE, max_queued_requests: StrictInt = DEFAULT.VALUE, user_config: Dict | None = DEFAULT.VALUE, autoscaling_config: Dict | None = DEFAULT.VALUE, graceful_shutdown_wait_loop_s: ConstrainedFloatValue = DEFAULT.VALUE, graceful_shutdown_timeout_s: ConstrainedFloatValue = DEFAULT.VALUE, health_check_period_s: ConstrainedFloatValue = DEFAULT.VALUE, health_check_timeout_s: ConstrainedFloatValue = DEFAULT.VALUE, ray_actor_options: RayActorOptionsSchema = DEFAULT.VALUE, placement_group_bundles: List[Dict[str, float]] = DEFAULT.VALUE, placement_group_strategy: str = DEFAULT.VALUE, max_replicas_per_node: int = DEFAULT.VALUE, logging_config: LoggingConfig = DEFAULT.VALUE)[source]#
Bases:
BaseModel
Specifies options for one deployment within a Serve application. For each deployment this can optionally be included in
ServeApplicationSchema
to override deployment options specified in code.Methods
Create a new model by parsing and validating input data from keyword arguments.
Creates a new model setting __dict__ and __fields_set__ from trusted or pre-validated data.
Duplicate a model, optionally choose which fields to include, exclude and change.
Generate a dictionary representation of the model, optionally specifying which fields to include or exclude.
Generate a JSON representation of the model,
include
andexclude
arguments as perdict()
.Try to update ForwardRefs on fields based on this Model, globalns and localns.
Attributes