使用每个条目实例变量删除列表中重复项的最快方法

Fastest way to remove duplicates in a list using each entries instance variables

TL;DR

有没有更快的方法

listOfClasses = [fooA, fooB, fooC]
setOfStrings = {c.string for c in listOfClasses}
newListOfClasses = []
for c in listOfClasses:
  if c.string in setOfStrings:
    newListOfClasses.append(c)
    setOfStrings.remove(c.string)

Restrictions/Caveats:


假设我有一个 class

class Foo(object):
  def __init__(self,string):
    self.string = string

还有一个 classes 列表,我想在其中删除所有具有重复“字符串”实例变量的 classes

fooA = Foo("alice")
fooB = Foo("alice")
fooC = Foo("His Royal Highness The Prince Philip, Duke of Edinburgh, Earl of Merioneth, Baron Greenwich, Royal Knight of the Most Noble Order of the Garter, Extra Knight of the Most Ancient and Most Noble Order of the Thistle, Member of the Order of Merit, Grand Master and First and Principal Knight Grand Cross of the Most Excellent Order of the British Empire, Knight of the Order of Australia, Additional Member of the Order of New Zealand, Extra Companion of the Queen’s Service Order, Royal Chief of the Order of Logohu, Extraordinary Companion of the Order of Canada, Extraordinary Commander of the Order of Military Merit, Lord of Her Majesty’s Most Honourable Privy Council, Privy Councillor of the Queen’s Privy Council for Canada, Personal Aide-de-Camp to Her Majesty, Lord High Admiral of the United Kingdom.")

listOfClasses = [fooA, fooB, fooC]

在这里,我想删除 fooAfooB(哪个并不重要),这样我就只剩下

listOfClasses = [fooB, fooC] # for example

到目前为止我有以下内容:

setOfStrings = {c.string for c in listOfClasses}
newListOfClasses = []
for c in listOfClasses:
  if c.string in setOfStrings:
    newListOfClasses.append(c)
    setOfStrings.remove(c.string)

对于以上我得到以下时间:

# len(listOfClasses) = 3
2.22 ms ± 24.5 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)
# len(listOfClasses) = 20
2.29 ms ± 119 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)

在字典理解中使用字典的唯一键应该非常快:

list({cls.string: cls for cls in listOfClasses}.values())

完整示例:

class Foo(object):
    def __init__(self, string):
        self.string = string

fooA = Foo("alice")
fooB = Foo("alice")
fooC = Foo("His Royal Highness")

listOfClasses = [fooA, fooB, fooC]

print(list({cls.string: cls for cls in listOfClasses}.values()))