ray.serve.llm.deployments.VLLMService.__init__#
- async VLLMService.__init__(llm_config: LLMConfig, *, engine_cls: Type[VLLMEngine] | None = None, image_retriever_cls: Type[ImageRetriever] | None = None, model_downloader: LoraModelLoader | None = None)[source]#
Constructor of VLLMDeployment.
Only the llm_config is public api, the other arguments are private and used for testing.
- Parameters:
llm_config – LLMConfig for the model.
- Keyword Arguments:
engine_cls – Dependency injection for the vllm engine class. Defaults to
VLLMEngine
.image_retriever_cls – Dependency injection for the image retriever class. Defaults to
ImageRetriever
.model_downloader – Dependency injection for the model downloader object. Defaults to be initialized with
LoraModelLoader
.