如何准备 CSV 文件以从 GCP 提取 AutoML 实体？

Question

我已经创建了 google 指定的 Jsonl 文件和格式。我把文件上传到云存储了。

我准备了一个CSV文件，第一列有Jsonl文件的路径(gs://*example/file.jsonl)，第二列有'TRAIN'或'VALIDATE'或'TEST'.

我收到一条错误消息 'Cannot find the referenced file: TRAIN in request.'

如何准备CSV文件？

Answer 1

听起来你的列顺序倒了。列的顺序应首先是 "ML Use"，然后是 GCS URI。请参阅快速入门中的示例 CSV 文件：

https://cloud.google.com/natural-language/automl/entity-analysis/docs/quickstart

gs://cloud-ml-data/NL-entity/dataset.csv

https://console.cloud.google.com/storage/browser/cloud-ml-data/NL-entity/?_ga=2.132412110.-1530629862.1558449111

$ cat Downloads/NL-entity_dataset.csv 
TRAIN,gs://cloud-ml-data/NL-entity/train.jsonl
TEST,gs://cloud-ml-data/NL-entity/test.jsonl
VALIDATION,gs://cloud-ml-data/NL-entity/validation.jsonl

如何准备 CSV 文件以从 GCP 提取 AutoML 实体？

How to prepare CSV file for AutoML entity extraction from GCP?

google-cloud-storage

google-cloud-platform

google-natural-language

google-cloud-automl