从 S3 保存和加载模型

Question

有什么方法可以让 H2O 集群 save/load 直接连接到 S3？

model.save('s3n://my-domain/gbm-from-the-future')
model.load('s3n://my-domain/gbm-from-the-future')

从历史上看，我通过以下方式实现了这一目标： - 保存到集群外的文件系统 - 与 S3 同步 - 从 S3 下载 - 从文件系统加载

显然，集群本身必须有更好的方法。

Answer 1

根据 Python docs for h2o.save_model() this is already supported (you did not mention which of the APIs you are using, so I am using Python as an example). Have you tried putting an S3 address in the file location argument of the standard model save and load functions? If you find that this is not working, please file a bug report on the H2O JIRA.

从 S3 保存和加载模型

Save and Load models from S3

amazon-s3

h2o