GCP 统一人工智能平台 - AutoScaling
GCP AI Platform Unified - AutoScaling
在 GCP AI Platform Unified 的文档中说:
AI Platform scales your nodes based on CPU usage even if you have configured your prediction nodes to use GPUs; therefore if your prediction throughput is causing high GPU usage, but not high CPU usage, your nodes might not scale as you expect
我们如何根据 GPU 使用率进行扩展?
- 似乎 AI Platform legacy 能够做到这一点 [1],但它也在预览中,看起来此功能尚未添加到 AI Platform Unified。
- 您可以查看 AI Platform Unified 发行说明更新 [2] 以检查有关此功能的更新
[1]https://cloud.google.com/ai-platform/prediction/docs/machine-types-online-prediction#specifying_gpus
[2]https://cloud.google.com/ai-platform-unified/docs/resources/release-notes
在 GCP AI Platform Unified 的文档中说:
AI Platform scales your nodes based on CPU usage even if you have configured your prediction nodes to use GPUs; therefore if your prediction throughput is causing high GPU usage, but not high CPU usage, your nodes might not scale as you expect
我们如何根据 GPU 使用率进行扩展?
- 似乎 AI Platform legacy 能够做到这一点 [1],但它也在预览中,看起来此功能尚未添加到 AI Platform Unified。
- 您可以查看 AI Platform Unified 发行说明更新 [2] 以检查有关此功能的更新
[1]https://cloud.google.com/ai-platform/prediction/docs/machine-types-online-prediction#specifying_gpus
[2]https://cloud.google.com/ai-platform-unified/docs/resources/release-notes