Skip to content

Conversation

@copybara-service
Copy link
Contributor

feat: Add autoscaling_target_dcgm_fi_dev_gpu_util, autoscaling_target_vllm_gpu_cache_usage_perc, autoscaling_target_vllm_num_requests_waiting options in model deployment on Endpoint & Model classes.

@product-auto-label product-auto-label bot added size: xl Pull request size is extra large. api: vertex-ai Issues related to the googleapis/python-aiplatform API. labels Jan 20, 2026
@copybara-service copybara-service bot force-pushed the copybara_857352519 branch 2 times, most recently from 75ecf16 to 84abe32 Compare January 21, 2026 02:56
@product-auto-label product-auto-label bot added size: l Pull request size is large. and removed size: xl Pull request size is extra large. labels Jan 21, 2026
@copybara-service copybara-service bot force-pushed the copybara_857352519 branch 3 times, most recently from 7e3cc2a to cbb5370 Compare January 21, 2026 19:38
…_vllm_gpu_cache_usage_perc, autoscaling_target_vllm_num_requests_waiting options in model deployment on Endpoint & Model classes.

PiperOrigin-RevId: 857352519
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

api: vertex-ai Issues related to the googleapis/python-aiplatform API. google-contributor size: l Pull request size is large.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant