H2O 中的 GLRM - 性能指标 return NaN

Question

我正在使用广义低秩估计器来推断数据集中有关传感器读数的缺失值。我正在使用 H2O 创建和训练模型：

glrm = H2OGeneralizedLowRankEstimator(k=10,
                                      loss="quadratic",
                                      gamma_x=0.5,
                                      gamma_y=0.5,
                                      max_iterations=2000,
                                      recover_svd=True,
                                      init="SVD",
                                      transform="standardize")
glrm.train(training_frame=train)

训练模型后，提供的有关性能指标（MSE 和 RMSE）的信息都是 return NaN。有人知道为什么吗？首先我认为它可能与数据集中的 NaN 条目有关，但我已经尝试过一个完整的，并且出现了同样的问题。我需要此信息来对某些模型参数执行网格搜索，以 select 找到最佳参数。

非常感谢，

路易莎·诺盖拉

Answer 1

下面是在 docs 中找到的示例。期望得到 MSE 为 NaN。最好将其从输出中排除。检查您是否得到 Sum of Squared Error (Numeric) 或使用您定义为 "quadratic".

的损失函数 (objective)

import h2o
from h2o.estimators import H2OGeneralizedLowRankEstimator
h2o.init()

# Import the USArrests dataset into H2O:
arrestsH2O = h2o.import_file("https://s3.amazonaws.com/h2o-public-test-data/smalldata/pca_test/USArrests.csv")

# Split the dataset into a train and valid set:
train, valid = arrestsH2O.split_frame(ratios=[.8], seed=1234)

# Build and train the model:
glrm_model = H2OGeneralizedLowRankEstimator(k=4,
                                            loss="quadratic",
                                            gamma_x=0.5,
                                            gamma_y=0.5,
                                            max_iterations=700,
                                            recover_svd=True,
                                            init="SVD",
                                            transform="standardize")
glrm_model.train(training_frame=train)

Returns MSE、RMSE 和 NaN：

Model Details ============= H2OGeneralizedLowRankEstimator : Generalized Low Rank Modeling Model Key: GLRM_model_python_1617769810268_1

Model Summary: number_of_iterations final_step_size final_objective_value 0 58.0 0.00005 8.250804e-31

ModelMetricsGLRM: glrm ** Reported on train data. **

MSE: NaN RMSE: NaN
Sum of Squared Error (Numeric): 1.9833472629189004e-13
Misclassification Error (Categorical): 0.0

H2O 中的 GLRM - 性能指标 return NaN

GLRM in H2O - Performance Metrics return NaN

python

h2o

h2o.ai