如何检查 Google AutoML Tables 模型的结构？

Question

我使用 Google AutoML Tables 训练了一个模型。我想检查训练模型的一般结构（使用了哪些算法，如果有的话进行了哪些预处理等）

在文档 here 的 "Viewing model architecture with Cloud Logging" 部分，我看到：

If more than one model was used to create the final model, the hyperparameters for each model are returned as an entry in the modelParameters array, indexed by position (0, 1, 2, and so on)

我的modelParameters数组如下图所示（展开了第一个和最后一个元素）。我只是在做 AutoML Tables quickstart，它使用 Bank marketing 开源数据集，所以我很惊讶它 return 如此复杂的模型（25 stacked/ensembled 模型？）。我认为仅模型 0（具有 300 棵树且最大深度为 15 的单个梯度提升决策树）就足够了。

此外，“25”是一个可疑的整数。我们是否确定文档是正确的并且这个列表实际上不是最好的 25 个，按准确度分数排名？有没有更好的方法来理解 Google AutoML Tables 正在生成的端到端模型（包括预处理）？

  modelParameters: [
   0: {
    hyperparameters: {
     Center Bias: "False"      
     Max tree depth: 15      
     Model type: "GBDT"      
     Number of trees: 300      
     Tree L1 regularization: 0      
     Tree L2 regularization: 0.10000000149011612      
     Tree complexity: 3      
    }
   }
   1: {…}    
   2: {…}    
   3: {…}    
   4: {…}    
   5: {…}    
   6: {…}    
   7: {…}    
   8: {…}    
   9: {…}    
   10: {…}    
   11: {…}    
   12: {…}    
   13: {…}    
   14: {…}    
   15: {…}    
   16: {…}    
   17: {…}    
   18: {…}    
   19: {…}    
   20: {…}    
   21: {…}    
   22: {…}    
   23: {…}    
   24: {
    hyperparameters: {
     Center Bias: "False"      
     Max tree depth: 9      
     Model type: "GBDT"      
     Number of trees: 500      
     Tree L1 regularization: 0      
     Tree L2 regularization: 0      
     Tree complexity: 0.10000000149011612      
    }
   }
  ]
 }

Answer 1

文档和您对结果的原始解释是正确的。在本例中，AutoML Tables 创建了一个包含 25 个模型的集合。

如果您单击 "Trials" link，它还会提供在搜索过程中尝试过的各个模型的完整列表。那应该比 25 大得多。

如何检查 Google AutoML Tables 模型的结构？

How to inspect the structure of a Google AutoML Tables model?

google-cloud-ml

automl

google-cloud-automl