如何将我的 Bigquery 查询转换为 AWS Athena 查询?

How do I translate my Bigquery query to AWS Athena query?

我目前在翻译这段 Bigquery 代码时遇到问题:

WITH
  dates AS (
  SELECT
    REPLACE('2020-09-10','-','') AS req_date)
SELECT
  hits.type,
  hits.hitNumber
FROM
  my_data_table_from_GA,
  UNNEST(hits) hits
WHERE
  _TABLE_SUFFIX = (
  SELECT
    req_date
  FROM
    dates)
  AND ClientId = '1234.5678910'
  AND visitstarttime = 1600000000;

My output looks like this in Bigquery

同样,我试图将查询转换为 Athena,如下所示:

SELECT 
  hits[1].type,
  hits[1].hitNumber
  
FROM 
  my_data_table_from_GA,
  UNNEST(hits) hits
WHERE 
 (from_iso8601_date(CAST((substring(date,1,4) || '-' || substring(date,5,2) || '-' || substring(date,7,2)) AS varchar))) = date('2020-09-10') 
 AND ClientId = '1234.5678910' AND visitstarttime = 1600000000;

然而,输出看起来不同as shown here. hitnumber 自始至终不应该是 1,并且 PAGE 与 EVENT 的比例看起来与上面 Bigquery 的输出不同。

我的源数据来自 Google Analytics,我最大的问题是我不确定如何在 Athena 中查询 'hits' 以使其在 Bigquery 中产生相同的输出。我尝试了 hits.type,但它抛出错误提示“Expression hits is not of type ROW”。这就是我放置 'hits1.type' 的原因,但我认为这有问题。如果有人可以帮助我,我将不胜感激!

我设法解决了我的问题。

这对我有用:

SELECT 
  hit.type,
  hit.hitNumber
  
FROM 
  my_data_table_from_GA,
  UNNEST(hits) WITH ORDINALITY t(hit, index)
WHERE 
 (from_iso8601_date(CAST((substring(date,1,4) || '-' || substring(date,5,2) || '-' || substring(date,7,2)) AS varchar))) = date('2020-09-10') 
 AND ClientId = '1234.5678910' AND visitstarttime = 1600000000;