PostgreSQL 中的时间 window

Question

我是 PostgreSQL 的新手（具体来说，我使用 Timescale 数据库）并且有一个关于时间的问题 windows。

数据：

date      |customerid|names   
2014-01-01|1         |Andrew 
2014-01-02|2         |Pete   
2014-01-03|2         |Andrew 
2014-01-04|2         |Steve  
2014-01-05|2         |Stef   
2014-01-06|3         |Stef  
2014-01-07|1         |Jason 
2014-01-08|1         |Jason

问题是：回到过去 x 天（从每一行来看），有多少个共享相同 ID 的不同名称？

对于 x=2 天，结果应如下所示：

date      |customerid|names  |count 
2014-01-01|1         |Andrew |1 
2014-01-02|2         |Pete   |1 
2014-01-03|2         |Andrew |2 
2014-01-04|2         |Steve  |3 
2014-01-05|2         |Stef   |3 
2014-01-06|3         |Stef   |1
2014-01-07|1         |Jason  |1
2014-01-08|1         |Jason  |1

这在 PostgreSQL 中是否可以在不对每一行使用循环的情况下实现？

补充说明：数据的时间间隔实际上并不是等距的。

非常感谢！

Answer 1

如果能使用window函数就好了：

select t.*,
       count(distinct name) over (partition by id
                                  order by date
                                  range between interval 'x day' preceding and current row
                                 ) as cnt_x
from t;

唉，那是不可能的。所以你可以使用横向连接：

select t.*, tt.cnt_x
from t left join lateral
     (select count(distinct t2.name) as cnt_x
      from t t2
      where t2.id = t.id and
             t2.date >= t.date - interval 'x day' and t2.date <= t.date
     ) tt
     on true;

为了性能，您需要 (id, date, name) 上的索引。

PostgreSQL 中的时间 window

Time window in PostgreSQL

sql

postgresql

timescaledb