连接派生表的慢查询

Slow query with joined derived tables

我对我的应用程序 "custom dashboard" 有一些查询,其中一个需要 10-12 秒才能执行。使用 EXPLAIN 我可以明白为什么它很慢,但我不知道该怎么办。这是查询:

SELECT person.PersonID,FullName,Furigana,qualdate FROM person
  INNER JOIN (
    SELECT pq.PersonID,MAX(ContactDate) AS qualdate FROM person pq
    INNER JOIN contact cq ON pq.PersonID=cq.PersonID
    WHERE cq.ContactTypeID IN (22,26,45) GROUP BY pq.PersonID
  ) qual ON person.PersonID=qual.PersonID
  LEFT OUTER JOIN (
    SELECT pe.personID,MAX(ContactDate) AS elimdate FROM person pe
    INNER JOIN contact ce ON pe.PersonID=ce.PersonID WHERE ce.ContactTypeID IN (25,31,30,41,23,42,2,33,35,29,12)
    GROUP BY pe.PersonID
  ) elim ON qual.PersonID=elim.PersonID
  LEFT OUTER JOIN (
    SELECT po.personID FROM person po
    INNER JOIN percat pc ON po.PersonID=pc.PersonID WHERE pc.CategoryID=38
  ) overseas ON qual.PersonID=overseas.PersonID
  WHERE (elimdate IS NULL OR qualdate > elimdate)
  AND qualdate < CURDATE()-INTERVAL 7 DAY
  AND overseas.PersonID IS NULL
  ORDER BY qualdate

这是 EXPLAIN 结果:

id  select_type  table        type    possible_keys           key         key_len  ref                      rows  Extra
1   PRIMARY      <derived2>   ALL     NULL                    NULL        NULL     NULL                     5447  Using where; Using temporary; Using filesort
1   PRIMARY      <derived3>   ALL     NULL                    NULL        NULL     NULL                     5565  Using where
1   PRIMARY      <derived4>   ALL     NULL                    NULL        NULL     NULL                     9     Using where; Not exists
1   PRIMARY      person       eq_ref  PRIMARY                 PRIMARY     4        qual.PersonID            1     
4   DERIVED      pc           ref     PRIMARY,CategoryID      CategoryID  4                                 8     
4   DERIVED      po           eq_ref  PRIMARY                 PRIMARY     4        kizuna_misa.pc.PersonID  1     Using index
3   DERIVED      pe           index   PRIMARY                 PRIMARY     4        NULL                     5964  Using index
3   DERIVED      ce           ref     PersonID,ContactTypeID  PersonID    4        kizuna_misa.pe.PersonID  1     Using where
2   DERIVED      pq           index   PRIMARY                 PRIMARY     4        NULL                     5964  Using index
2   DERIVED      cq           ref     PersonID,ContactTypeID  PersonID    4        kizuna_misa.pq.PersonID  1     Using where

我确定 EXPLAIN 的第一行揭示了问题(与类似的查询相比,第二行似乎并不太慢),但我不知道如何解决它。我已经在连接中出现的每一列上都有索引,但由于表是 <derived2> 等,我想索引是无关紧要的。

objective(因为对于不熟悉我的应用程序和架构的人来说它可能并不明显)是一个后续提示列表 - 如果发生了#22/26/45 联系人之一但未完成任何操作作为回应(其他几个联系人中的一个或通过类别分配指定此人在海外),然后此人应在等待一周后出现在后续跟进列表中。子查询比这些混乱的连接更容易编写和理解,但我无法检查日期的顺序(而且子查询通常也很慢)。

编辑(回应 Rick James):

MySQL 版本是 5.0.95(是的,我知道...)。这里是涉及的三个表的 SHOW CREATE TABLE,即使 person 中的大多数字段都是无关紧要的:

CREATE TABLE `contact` (
 `ContactID` int(11) unsigned NOT NULL auto_increment,
 `PersonID` int(11) unsigned NOT NULL default '0',
 `ContactTypeID` int(11) unsigned NOT NULL default '0',
 `ContactDate` date NOT NULL default '0000-00-00',
 `Description` text,
 PRIMARY KEY  (`ContactID`),
 KEY `ContactDate` (`ContactDate`),
 KEY `PersonID` (`PersonID`),
 KEY `ContactTypeID` (`ContactTypeID`)
) ENGINE=MyISAM AUTO_INCREMENT=16901 DEFAULT CHARSET=utf8

CREATE TABLE `percat` (
 `PersonID` int(11) unsigned NOT NULL default '0',
 `CategoryID` int(11) unsigned NOT NULL default '0',
 PRIMARY KEY  (`PersonID`,`CategoryID`),
 KEY `CategoryID` (`CategoryID`)
) ENGINE=MyISAM DEFAULT CHARSET=utf8

CREATE TABLE `person` (
 `PersonID` int(11) unsigned NOT NULL auto_increment,
 `FullName` varchar(100) NOT NULL default '',
 `Furigana` varchar(100) NOT NULL default '',
 `Sex` enum('','M','F') character set ascii NOT NULL default '',
 `HouseholdID` int(11) unsigned NOT NULL default '0',
 `Relation` varchar(6) character set ascii NOT NULL default '',
 `Title` varchar(6) NOT NULL default '',
 `CellPhone` varchar(30) character set ascii NOT NULL default '',
 `Email` varchar(70) character set ascii NOT NULL default '',
 `Birthdate` date NOT NULL default '0000-00-00',
 `Country` varchar(30) NOT NULL default '',
 `URL` varchar(150) NOT NULL default '',
 `Organization` tinyint(1) NOT NULL default '0',
 `Remarks` text NOT NULL,
 `Photo` tinyint(1) NOT NULL default '0',
 `UpdDate` date NOT NULL default '0000-00-00',
 PRIMARY KEY  (`PersonID`),
 KEY `Furigana` (`Furigana`),
 KEY `FullName` (`FullName`),
 KEY `Email` (`Email`),
 KEY `Organization` (`Organization`,`Furigana`)
) ENGINE=MyISAM AUTO_INCREMENT=6063 DEFAULT CHARSET=utf8

已尝试的建议:

我尝试实现 Rick James 将子选择放在字段列表中的建议(我什至不知道这是可能的),如下所示:

SELECT
  p.PersonID,
  FullName,
  Furigana,
  (SELECT MAX(ContactDate) FROM contact cq
    WHERE cq.PersonID=p.PersonID
    AND  cq.ContactTypeID IN (22,26,45))
  AS qualdate,
  (SELECT MAX(ContactDate) FROM contact ce
    WHERE ce.PersonID=p.PersonID
    AND ce.ContactTypeID IN (25,31,30,41,23,42,2,33,35,29,12))
  AS elimdate
FROM person p
WHERE (elimdate IS NULL OR qualdate > elimdate)
AND qualdate < CURDATE()-INTERVAL 7 DAY
AND NOT EXISTS (SELECT * FROM percat WHERE CategoryID=38 AND percat.PersonID=p.PersonID)
ORDER BY qualdate

但它抱怨:#1054 - Unknown column 'elimdate' in 'where clause'根据文档,WHERE 子句在字段列表之前被解释,所以这种方法行不通。

您有一个有趣的查询。我不确定最好的解决方案是什么。这里有两个猜测:

计划A

INDEX(qualdate)

可能会有帮助。请提供 SHOW CREATE TABLE.

此构造优化不佳:

FROM ( SELECT ... )
JOIN ( SELECT ... )

在你的情况下,overseas 可能应该变成 JOIN,而不是子选择。另外两个可能应该变成不同风格的依赖子查询:

SELECT  ..., 
        ( SELECT MAX(...) ... )  AS qualdate,
        ( SELECT MAX(...) ... )  AS elimdate
    FROM ...

你 运行 是 MySQL 的哪个版本?

B计划

如果可行,将它们折叠到子查询中,以便它们生成更少的行,从而减少外部查询的工作量。 (每个子查询一个)

elimdate IS NOT NULL
qualdate < CURDATE()-INTERVAL 7 DAY
overseas.PersonID IS NOT NULL

也许 NULL 测试适用于 LEFT 而此建议可能不适用。