Problem
When using the databricks-agents library to update an existing model serving endpoint, calling the agents.deploy()
method with the workload_size
parameter set to any value other than SMALL
still results in deployment with workload size SMALL
.
Example
The following code snippet intends to redeploy a new model on the existing serving endpoint with the MEDIUM
size. However, due to this issue, the endpoint incorrectly deploys using the default size SMALL
.
agents.deploy(
<your-uc-model-name>,
<your-new-model-version>,
workload_size=ServedModelInputWorkloadSize.MEDIUM
)
Cause
The specified workload_size
parameter is not correctly applied when deploying a new agent version to the existing model-serving endpoint.
Solution
Update the databricks-agents library to version 0.17.0 or later. This version includes a resolution to the issue, ensuring the workload_size
parameter is correctly respected during deployments on existing model serving endpoints.