如何在 Postgres 9.4 中忽略没有唯一约束的重复项？

Question

我目前在我们的旧数据库 (postgres 9.4) table 中遇到一个问题，其中包含一些重复的行。我想确保不再生成重复行。

但我还想保留已经生成的重复行。因此，我无法对这些列（多列）应用唯一约束。

我创建了一个触发器，它会检查该行是否已存在并相应地引发异常。但是在处理并发事务时它也会失败。

示例：

TAB1

col1   |  col2  |  col3  |
------------------------------------
1      |  A     |  B     |   -- 
2      |  A     |  B     |   -- already present duplicates for column col2 and col3(allowed)
3      |  C     |  D     |

INSERT INTO TAB1 VALUES(4 , 'A' , 'B') ; -- This insert statement will not be allowed.

注意：由于数据库版本较旧，我无法使用冲突。

Answer 1

您可以尝试创建一个 partial index 以仅对 table 行的子集具有唯一约束：

例如：

create unique index on t(x) where (d > '2020-01-01');

Answer 2

据推测，您不希望新行重复历史行。如果是这样，您可以这样做，但它需要修改 table 并添加一个新列。

alter table t add duplicate_seq int default 1;

然后更新此列以识别现有的重复项：

update t
    set duplicate_seq = seqnum
    from (select t.*, row_number() over (partition by col order by col) as seqnum
          from t
         ) tt
    where t.<primary key> = tt.<primary key>;

现在，创建唯一索引或约束：

alter table t add constraint unq_t_col_seq on t(col, duplicate_seq);

当您插入行时，不要为 duplicate_seq 提供值。默认值为 1。这将与任何现有值或最近输入的重复项发生冲突。历史重复将被允许。

如何在 Postgres 9.4 中忽略没有唯一约束的重复项？

How to ignore duplicates without unique constraint in Postgres 9.4?

sql

postgresql

duplicates

postgresql-9.4