流式处理 BigQuery API

Streaming BigQuery API

根据 Google BigQuery Doc 它指出。

To help ensure data consistency, you can supply insertId for each inserted row. BigQuery remembers this ID for at least one minute. If you try to stream the same set of rows within that time period and the insertId property is set, BigQuery uses the insertId property to de-duplicate your data on a best effort basis.

尝试使用 Ruby client.I can't seemed to find a way to set the insertID in the stream request(reference attached here).

实现相同的功能

您可以在以下位置找到 insert_id: Class: Google::Apis::BigqueryV2::InsertAllTableDataRequest::Row

如果您查看 Row 定义,它就在那里 in the source:

您还可以在 RubyInfo

上找到更多文档