从分区的 ORC 加载分区的 BigQuery table

Question

我想从分区的 ORC 创建一个按 mydate 列分区 table 的 BigQuery。

GCS 中的文件：

mydate=2021-04-01/*.orc
...
mydate=2021-04-30/*.orc

命令 bq:

bq load --source_format=ORC --time_partitioning_field mydate --time_partitioning_type DAY mydataset.mytable gs://mydata/*.orc

当我运行这个命令时我有这个错误：The field specified for partitioning cannot be found in the schema 因为 mydate 不在 ORC 文件中。

我该如何管理？

感谢您的帮助，祝您有愉快的一天。

Answer 1

我认为我们可以通过提供通过 source_uri_prefix 字段编码的自定义分区键模式来做到这一点。

Load partitioned BigQuery table from partitioned ORC