RandomizedSearchCV 的 best_params 未按预期显示输出

Question

我试图改进我的随机森林分类器参数，但我得到的输出看起来不像我在查看其他人的一些示例后预期的输出。

我使用的代码：

train_x, test_x, train_y, test_y = train_test_split(df, avalanche, shuffle=False)

# Create the random forest
rf = RandomForestClassifier()
rf_random = RandomizedSearchCV(estimator=rf, param_distributions=random_grid, n_iter=100, cv=3, verbose=2, random_state=42, n_jobs=-1)

# Train the model
rf_random.fit(train_x, train_y)

print(rf_random.best_params_)

我得到的输出（这只是几行，但它给了我几百行）：

Fitting 3 folds for each of 100 candidates, totalling 300 fits
[CV] END bootstrap=True, max_depth=30, max_features=sqrt, min_samples_leaf=1, min_samples_split=5, n_estimators=400; total time=   1.3s
[CV] END bootstrap=True, max_depth=30, max_features=sqrt, min_samples_leaf=1, min_samples_split=5, n_estimators=400; total time=   1.3s
[CV] END bootstrap=True, max_depth=30, max_features=sqrt, min_samples_leaf=1, min_samples_split=5, n_estimators=400; total time=   1.4s
[CV] END bootstrap=False, max_depth=10, max_features=sqrt, min_samples_leaf=2, min_samples_split=5, n_estimators=1200; total time=   3.8

我期待的输出：

{'bootstrap': True,
 'max_depth': 70,
 'max_features': 'auto',
 'min_samples_leaf': 4,
 'min_samples_split': 10,
 'n_estimators': 400}

来自 this 网站。

有谁知道我做错了什么或者我应该更改什么以使输出成为我预期的样子？

Answer 1

你得到那个输出是因为 verbose=2。它的值越高，打印的文本就越多。这些文本提示不是结果。他们只是告诉您搜索当前适合数据的模型。

这对于查看搜索的当前进度很有用（有时可能需要几天时间，所以很高兴知道搜索当前处于流程的哪个部分）。如果您不想显示此文本，请设置 verbose=0.

您还没有得到预期的结果，因为 rf_random 仍在为数据拟合模型。

搜索完成后，使用 rf_random.best_params_ 获取所需的输出。

RandomizedSearchCV 的 best_params 未按预期显示输出

RandomizedSearchCV's best_params does not show output as expected

python

parameters

random-forest

scikit-learn

cross-validation