GCP 统一人工智能平台 - AutoScaling

GCP AI Platform Unified - AutoScaling

GCP AI Platform Unified 的文档中说:

AI Platform scales your nodes based on CPU usage even if you have configured your prediction nodes to use GPUs; therefore if your prediction throughput is causing high GPU usage, but not high CPU usage, your nodes might not scale as you expect

我们如何根据 GPU 使用率进行扩展?

  1. 似乎 AI Platform legacy 能够做到这一点 [1],但它也在预览中,看起来此功能尚未添加到 AI Platform Unified。
  2. 您可以查看 AI Platform Unified 发行说明更新 [2] 以检查有关此功能的更新

[1]https://cloud.google.com/ai-platform/prediction/docs/machine-types-online-prediction#specifying_gpus

[2]https://cloud.google.com/ai-platform-unified/docs/resources/release-notes