使用 H2O GBM 模型的 MOJO 文件重现预测

Question

我使用 H2O 版本 3.26.0.5 在二元问题中训练 GBM 模型，以预测正概率 class。我将模型文件保存为 MOJO 并使用此文件在新数据中生成预测：

## first, restart R session ##

# load the model
library(h2o)

h2o.init(nthreads = -1)

model <- h2o.import_mojo("path_to_mojo_file")

# load the new data input
input <- read_csv("path_to_new_data")

input_h2o <- as.h2o(input)

# predictions
predictions <- predict(model, input_h2o)

当我运行在我的计算机中执行此操作时，我得到的预测与我在生产环境中使用相同的 MOJO 文件进行预测时得到的预测不同。

MOJO 文件会发生这种情况吗？我相信一旦模型以 MOJO 格式保存，您就可以在任何环境中进行预测并获得相同的结果。有谁知道为什么会这样？

Answer 1

When I run this in my computer I get different predictions than when I use the same MOJO file to predict in a production environment.

生产环境运行是完全一样的R脚本吗？

Answer 2

最后发现是生产环境的脚本出错了。修复后，预测变得非常接近。

使用 H2O GBM 模型的 MOJO 文件重现预测

Reproduce predictions with MOJO file of a H2O GBM model

r

machine-learning

prediction

gbm

h2o