在 INTARRAY 列中进行更快的搜索

Question

我有 table 大约 300 000 行 INT[] 列类型

每个数组包含大约 2000 个元素

我为这个数组列创建了索引

create index index_name ON table_name USING GIN (column_name)

然后运行查询：

SELECT COUNT(*)
FROM table_name 
WHERE
column_name@> ARRAY[1777]

此查询运行非常慢 Execution time: 66886.132 ms 并且如 EXPLAIN ANALYZE 所示，未使用 GIN 索引，仅使用 Seq Scan 索引。

为什么不使用 Postgres GIN 索引和主要目的地：如何尽可能快地运行以上查询？

编辑

这是 explain (analyze, verbose) 以上查询的结果

Aggregate  (cost=10000024724.75..10000024724.76 rows=1 width=0) (actual time=61087.513..61087.513 rows=1 loops=1)
  Output: count(*)
  ->  Seq Scan on public.users  (cost=10000000000.00..10000024724.00 rows=300 width=0) (actual time=12104.651..61087.500 rows=5 loops=1)
        Output: id, email, pass, nick, reg_dt, reg_ip, gender, curr_location, about, followed_tag_ids, avatar_img_ext, rep_tag_ids, rep_tag_id_scores, stats, status
        Filter: (users.rep_tag_ids @> '{1777}'::integer[])
        Rows Removed by Filter: 299995
Planning time: 0.110 ms
Execution time: 61087.564 ms

这是table和索引定义

CREATE TABLE users
(
  id serial PRIMARY KEY,
  rep_tag_ids integer[] DEFAULT '{}'
  -- other columns here
);

create index users_rep_tag_ids_idx ON users USING GIN (rep_tag_ids);

Answer 1

你应该帮助查询优化器使用索引。如果您还没有安装 PostgreSQL 的 intarray 扩展，然后使用 gin__int_ops 运算符 class.

重新创建索引

DROP INDEX users_rep_tag_ids_idx;
CREATE INDEX users_rep_tag_ids_idx ON users USING gin (rep_tag_ids gin__int_ops);

在 INTARRAY 列中进行更快的搜索

Faster search in INTARRAY column

postgresql

postgresql-9.4