Reference documentation and code samples for the Google Cloud Gke Recommender V1 Client class ModelServerInfo.
Model server information gives. Valid model server info combinations can be found using GkeInferenceQuickstart.FetchProfiles .
Generated from protobuf message google.cloud.gkerecommender.v1.ModelServerInfo
Namespace
Google \ Cloud \ GkeRecommender \ V1Methods
__construct
Constructor.
data
array
Optional. Data for populating the Message object.
↳ model
string
Required. The model. Open-source models follow the Huggingface Hub owner/model_name
format. Use GkeInferenceQuickstart.FetchModels
to find available models.
↳ model_server
string
Required. The model server. Open-source model servers use simplified, lowercase names (e.g., vllm
). Use GkeInferenceQuickstart.FetchModelServers
to find available servers.
↳ model_server_version
string
Optional. The model server version. Use GkeInferenceQuickstart.FetchModelServerVersions to find available versions. If not provided, the latest available version is used.
getModel
Required. The model. Open-source models follow the Huggingface Hub owner/model_name
format. Use GkeInferenceQuickstart.FetchModels
to find available models.
string
setModel
Required. The model. Open-source models follow the Huggingface Hub owner/model_name
format. Use GkeInferenceQuickstart.FetchModels
to find available models.
var
string
$this
getModelServer
Required. The model server. Open-source model servers use simplified,
lowercase names (e.g., vllm
). Use GkeInferenceQuickstart.FetchModelServers
to find available servers.
string
setModelServer
Required. The model server. Open-source model servers use simplified,
lowercase names (e.g., vllm
). Use GkeInferenceQuickstart.FetchModelServers
to find available servers.
var
string
$this
getModelServerVersion
Optional. The model server version. Use GkeInferenceQuickstart.FetchModelServerVersions to find available versions. If not provided, the latest available version is used.
string
setModelServerVersion
Optional. The model server version. Use GkeInferenceQuickstart.FetchModelServerVersions to find available versions. If not provided, the latest available version is used.
var
string
$this

