如何在多个全文索引的任何列中搜索所有单词?

How do I search for ALL words within ANY columns of multiple Full Text indexes?

如果我在 ContactsCompanies 等表上有两个全文索引,我该如何编写查询以确保搜索短语的所有词都存在于 中两个索引中的

例如,如果我正在搜索 all 关键字存在于联系人记录或公司中的联系人,我将如何编写查询?

我已经尝试在联系人和公司表上执行 CONTAINSTABLE,然后将这些表连接在一起,但是如果我将搜索短语作为 '"searchTerm1*' AND '"searchTerm2*"' 传递给每个表,那么它仅在以下情况下匹配全部 搜索词在两个 索引和returns 太少的记录上。如果我像 '"searchTerm1*' OR '"searchTerm2*"' 一样传递它,那么它匹配搜索词 any(而不是 all)在 [=30] 中的位置=]索引和returns记录太多。

我还尝试创建一个索引视图,将联系人与公司联系起来,这样我就可以一次搜索所有列,但不幸的是,一个联系人可能属于多个公司,所以我要使用的 ContactKey因为视图的键不再是唯一的,所以创建失败。

看来我可能需要将短语分开并分别查询每个单词,然后将结果重新组合在一起以确保所有单词都匹配,但我想不出我该怎么做会写那个查询。

以下是该模型的示例:

Contact           CompanyContact    Company
--------------    --------------    ------------
ContactKey        ContactKey        CompanyKey
FirstName         CompanyKey        CompanyName
LastName

我在 FirstName、LastName 和 CompanyName 上有一个全文索引。

此答案已重建以解决您的问题,因此多个字符串必须存在于各个字段中。请注意 CompanyContactLink 链接中的单个键 table:

CREATE FULLTEXT CATALOG CompanyContact WITH ACCENT_SENSITIVITY = OFF
GO

CREATE TABLE Contact ( ContactKey INT IDENTITY, FirstName VARCHAR(20) NOT NULL, LastName VARCHAR(20) NOT NULL )
ALTER TABLE Contact ADD CONSTRAINT PK_Contact PRIMARY KEY NONCLUSTERED ( ContactKey )

CREATE TABLE Company ( CompanyKey INT IDENTITY, CompanyName VARCHAR(50) NOT NULL )
ALTER TABLE Company ADD CONSTRAINT PK_Company PRIMARY KEY NONCLUSTERED ( CompanyKey )
GO

CREATE TABLE CompanyContactLink ( CompanyContactKey INT IDENTITY NOT NULL, CompanyKey INT NOT NULL, ContactKey INT NOT NULL )
GO

INSERT INTO Contact ( FirstName, LastName ) VALUES ( 'Dipper', 'Pines' )
INSERT INTO Contact ( FirstName, LastName ) VALUES ( 'Mabel', 'Pines' )
INSERT INTO Contact ( FirstName, LastName ) VALUES ( 'Stanley', 'Pines' )
INSERT INTO Contact ( FirstName, LastName ) VALUES ( 'Soos', 'Ramirez' )
INSERT INTO Contact ( FirstName, LastName ) VALUES ( 'Wendy', 'Corduroy' )
INSERT INTO Contact ( FirstName, LastName ) VALUES ( 'Sheriff', 'Blubs' )
INSERT INTO Contact ( FirstName, LastName ) VALUES ( 'Bill', 'Cipher' )
INSERT INTO Contact ( FirstName, LastName ) VALUES ( 'Pine Dip', 'Nobody' )
INSERT INTO Contact ( FirstNAme, LastName ) VALUES ( 'Nobody', 'Pine Dip' )

INSERT INTO Company ( CompanyName ) VALUES ( 'Mystery Shack' )
INSERT INTO Company ( CompanyName ) VALUES ( 'Greesy Diner' )
INSERT INTO Company ( CompanyName ) VALUES ( 'Watertower' )
INSERT INTO Company ( CompanyName ) VALUES ( 'Manotaur Cave' )
INSERT INTO Company ( CompanyName ) VALUES ( 'Big Dipper Watering Hole' )
INSERT INTO Company ( CompanyName ) VALUES ( 'Lost Pines Dipping Pool' )
GO

INSERT INTO CompanyContactLink Values (3, 5), (1, 1), (1, 2), (1, 3), (1, 4), (1,5), (5,1), (3,1), (4,1)
GO

CREATE FULLTEXT INDEX ON Contact (LastName, FirstName)
KEY INDEX PK_Contact
ON CompanyContact
WITH STOPLIST = SYSTEM

CREATE FULLTEXT INDEX ON Company (CompanyName)
KEY INDEX PK_Company
ON CompanyContact
WITH STOPLIST = SYSTEM
GO

CREATE VIEW CompanyContactView
WITH SCHEMABINDING
AS
  SELECT
    CompanyContactKey,
    CompanyName,
    FirstName,
    LastName
  FROM
    dbo.CompanyContactLink
    INNER JOIN dbo.Company ON Company.CompanyKey = CompanyContactLink.CompanyKey
    INNER JOIN dbo.Contact ON Contact.ContactKey = CompanyContactLink.ContactKey
GO

CREATE UNIQUE CLUSTERED INDEX idx_CompanyContactView ON CompanyContactView (CompanyContactKey);
GO

CREATE FULLTEXT INDEX ON CompanyContactView (CompanyName, LastName, FirstName)
KEY INDEX idx_CompanyContactView
ON CompanyContact
WITH STOPLIST = SYSTEM
GO

-- Wait a few moments for the FULLTEXT INDEXing to take place.
-- Check to see how the index is doing ... repeat the following line until you get a zero back.

DECLARE @ReadyStatus INT
SET @ReadyStatus = 1
WHILE (@ReadyStatus != 0)
BEGIN
  SELECT @ReadyStatus = FULLTEXTCATALOGPROPERTY('CompanyContact', 'PopulateStatus')
END

SELECT
  CompanyContactView.*
FROM
  CompanyContactView
WHERE
  FREETEXT((FirstName,LastName,CompanyName), 'Dipper') AND
  FREETEXT((FirstName,LastName,CompanyName), 'Shack')
GO

为了你在 Watertower 与 Wendy 的榜样:

SELECT
  CompanyContactView.*
FROM
  CompanyContactView
WHERE
  FREETEXT((FirstName,LastName,CompanyName), 'Wendy') AND
  FREETEXT((FirstName,LastName,CompanyName), 'Watertower')
GO

我创建了一种适用于任意数量的全文索引和列的方法。使用这种方法,可以很容易地添加额外的方面进行搜索。

  1. 将搜索词组拆分成临时行 table
  2. 加入此临时 table 以在每个适用的全文索引上使用 CONTAINSTABLE 搜索每个搜索词。
  3. 将结果合并在一起,得到找到的搜索词的非重复计数。
  4. 过滤掉指定的搜索词数量与找到的搜索词数量不匹配的结果。

示例:

DECLARE @SearchPhrase nvarchar(255) = 'John Doe'
DECLARE @Matches Table(
    MentionedKey int,
    CoreType char(1),
    Label nvarchar(1000),
    Ranking int
)

-- Split the search phrase into separate words.
DECLARE @SearchTerms TABLE (Term NVARCHAR(100), Position INT)
INSERT INTO @SearchTerms (Term, Position)
SELECT dbo.ScrubSearchTerm(Term)-- Removes invalid characters and convert the words into search tokens for Full Text searching such as '"word*"'.
FROM dbo.SplitSearchTerms(@SearchPhrase)

-- Count the search words.
DECLARE @numSearchTerms int = (SELECT COUNT(*) FROM @SearchTerms)

-- Find the matching contacts.
;WITH MatchingContacts AS
(
    SELECT
        [ContactKey] = sc.[KEY],
        [Ranking] = sc.[RANK],
        [Term] = st.Term
    FROM @SearchTerms st
    CROSS APPLY dbo.SearchContacts(st.Term) sc -- I wrap my CONTAINSTABLE query in a Sql Function for convenience
)
-- Find the matching companies
,MatchingContactCompanies AS
(
    SELECT
        c.ContactKey,
        Ranking = sc.[RANK],
        st.Term
    FROM @SearchTerms st
    CROSS APPLY dbo.SearchCompanies(st.Term) sc
    JOIN dbo.CompanyContact cc ON sc.CompanyKey = cc.CompanyKey
    JOIN dbo.Contact c ON c.ContactKey = cc.ContactKey
)
-- Find the matches where ALL search words were found.
,ContactsWithAllTerms AS
(
    SELECT
        c.ContactKey,
        Ranking = SUM(x.Ranking)
    FROM (
        SELECT ContactKey, Ranking, Term FROM MatchingContacts  UNION ALL
        SELECT ContactKey, Ranking, Term FROM MatchingContactCompanies
    ) x
    GROUP BY c.ContactKey
    HAVING COUNT(DISTINCT x.Term) = @numSearchTerms
)
SELECT
    *
FROM ContactsWithAllTerms c

更新 根据评论,这是我的 SearchContacts 函数的示例。它只是一个简单的包装函数,因为我在多个过程中使用它。

CREATE FUNCTION [dbo].[SearchContacts]
(
    @contactsKeyword nvarchar(4000)
)
RETURNS @returntable TABLE
(
    [KEY] int,
    [RANK] int
)
AS
BEGIN
    INSERT @returntable
    SELECT [KEY],[RANK] FROM CONTAINSTABLE(dbo.Contact, ([FullName],[LastName],[FirstName]), @contactsKeyword)
    RETURN
END
GO