带有嵌套 SQL 查询的 Postgres 视图或如何找到最后一个 INET

Postgres view with nested SQL query or how to find the last INET

我有这样的查询:

SELECT DISTINCT(orders.email), uuid_nil() AS customer_id, 'Order' AS customer_type, orders.first_name, orders.last_name, MAX(orders.paid_at) AS last_order_at, 1 AS order_count, SUM(orders.total_price_cents) AS total_spent_pennies
FROM orders
WHERE orders.state = 'paid' AND orders.customer_id IS null
GROUP BY orders.email, customer_id, orders.first_name, orders.last_name
UNION
SELECT DISTINCT(customers.email), customers.id AS customer_id, 'Customer' AS customer_type, customers.first_name, customers.last_name, MAX(orders.paid_at) AS last_order_at, COUNT(orders.*) AS order_count, SUM(orders.total_price_cents) AS total_spent_pennies
FROM customers
JOIN orders ON customers.id = orders.customer_id
GROUP BY customers.email, customers.id, customers.first_name, customers.last_name

看起来像:

+-------------------------------+--------------------------------------+---------------+------------+--------------+-------------------------+-------------+---------------------+
| email                         | customer_id                          | customer_type | first_name | last_name    | last_order_at           | order_count | total_spent_pennies |
+-------------------------------+--------------------------------------+---------------+------------+--------------+-------------------------+-------------+---------------------+
| blah@gmail.com                | 00000000-0000-0000-0000-000000000000 | Order         | Richard    | Doe          | 2015-12-18 14:45:22 UTC | 1           | 2000                |
| paul@blah.com                 | 00000000-0000-0000-0000-000000000000 | Order         | Paul       | Doe          | 2016-04-05 09:04:57 UTC | 1           | 5000                |
+-------------------------------+--------------------------------------+---------------+------------+--------------+-------------------------+-------------+---------------------+

我的问题是如何也包括他们的最后一个 IP 地址(INET 列)。一个日期我可以简单地使用 MAX 聚合函数但是 IP 地址显然没有。

基本上我如何结合上面的这个查询给我一个新的列,他们的 last_ip 地址是这样的:

SELECT browser_ip FROM orders
WHERE email = 'blah@gmail.com'
ORDER BY paid_at DESC
LIMIT 1

Cast to varchar, and use string_agg - 类似于:

SELECT email, paid_at, string_agg(browser_ip::varchar, ',') as ips 
WHERE email = 'blah@gmail.com'
GROUP BY email, paid_at
ORDER BY email, paid_at DESC
LIMIT 1

应该可以正常工作。

几个选项:

  1. 使用 LATERAL 子查询。请注意,这将强制进行嵌套循环连接。
  2. 编写一个函数来检索最新的 IP 地址并调用它。它还将强制嵌套循环。
  3. 使用 window 函数和过滤器。这通常会表现得更糟,因为您必须在加入之前扫描整个 table。

在你的情况下,因为联合,我可能会做第二个并做这样的事情:

CREATE OR REPLACE FUNCTION latest_ip(in_email text)
RETURNS inet LANGUAGE SQL AS
$$
SELECT paid_at, string_agg(browser_ip::varchar, ',') as ips 
WHERE email = in_email
GROUP BY paid_at
ORDER BY paid_at DESC
LIMIT 1
$$;

然后您只需在列列表中调用 latest_ip(orders.email)

另一个需要将上述内容复制为子查询,跟在联合的两个分支上的 LATERAL 语句之后。值得了解,但在这种情况下可能是维护问题。

您可能可以像这样使用一个简单的子查询,您只需要命名您的查询以便在它们之间进行引用。

http://www.techonthenet.com/postgresql/subqueries.php

例如,您的查询的第一部分类似于:

SELECT DISTINCT(c1.email), c1.id AS customer_id, 'Customer' AS customer_type, 
c1.first_name, c1.last_name, MAX(orders.paid_at) AS last_order_at, 
COUNT(orders.*) AS order_count, SUM(orders.total_price_cents) AS total_spent_pennies, 
(SELECT browser_ip FROM orders WHERE c1.email = orders.email 
ORDER BY paid_at DESC LIMIT 1) last_ip
FROM customers c1
JOIN orders ON c1.id = orders.customer_id
GROUP BY c1.email, c1.id, c1.first_name, c1.last_name