从目录中删除没有特定单词的文本文件

Delete text files that don't have specific words in them from directory

我有一个包含约 2200 个文本文件的目录。我需要删除任何不包含我定义的特定词的文本文件。有人可以看看这段代码并就如何让它工作提出建议吗?现在,当我 运行 这个它说找不到目录 "C".

此外,我想确保此 运行 用于该目录中的每个文件。我需要包含下一个功能吗?

import os

path = r'C:\Users\user\Desktop\AFL codes to test'
words = ['buy', 'sell']

for root, dirs, files in os.walk(path):
    for file in path:
        if not any(words in file for words in words):
            os.remove(file)

另外,这里是完整的回溯:

runfile('C:/Users/user/.spyder-py3/DELETE FILES THAT DONT CONTAIN CERTAIN WORDS.py', wdir='C:/Users/user/.spyder-py3')
Traceback (most recent call last):

  File "<ipython-input-23-dbc80e182b2b>", line 1, in <module>
    runfile('C:/Users/user/.spyder-py3/DELETE FILES THAT DONT CONTAIN CERTAIN WORDS.py', wdir='C:/Users/user/.spyder-py3')

  File "C:\Users\user\Anaconda31\lib\site-packages\spyder\utils\site\sitecustomize.py", line 705, in runfile
    execfile(filename, namespace)

  File "C:\Users\user\Anaconda31\lib\site-packages\spyder\utils\site\sitecustomize.py", line 102, in execfile
    exec(compile(f.read(), filename, 'exec'), namespace)

  File "C:/Users/user/.spyder-py3/DELETE FILES THAT DONT CONTAIN CERTAIN WORDS.py", line 9, in <module>
    os.remove(file)

FileNotFoundError: [WinError 2] The system cannot find the file specified: 'C'

This is the error after trying shutil.rmtree

runfile('C:/Users/user/.spyder-py3/DELETE FILES THAT DONT CONTAIN CERTAIN WORDS.py', wdir='C:/Users/user/.spyder-py3')
Traceback (most recent call last):

  File "<ipython-input-16-dbc80e182b2b>", line 1, in <module>
    runfile('C:/Users/user/.spyder-py3/DELETE FILES THAT DONT CONTAIN CERTAIN WORDS.py', wdir='C:/Users/user/.spyder-py3')

  File "C:\Users\user\Anaconda31\lib\site-packages\spyder\utils\site\sitecustomize.py", line 705, in runfile
    execfile(filename, namespace)

  File "C:\Users\user\Anaconda31\lib\site-packages\spyder\utils\site\sitecustomize.py", line 102, in execfile
    exec(compile(f.read(), filename, 'exec'), namespace)

  File "C:/Users/user/.spyder-py3/DELETE FILES THAT DONT CONTAIN CERTAIN WORDS.py", line 12, in <module>
    shutil.rmtree(full_path)

  File "C:\Users\user\Anaconda31\lib\shutil.py", line 494, in rmtree
    return _rmtree_unsafe(path, onerror)

  File "C:\Users\user\Anaconda31\lib\shutil.py", line 376, in _rmtree_unsafe
    onerror(os.listdir, path, sys.exc_info())

  File "C:\Users\user\Anaconda31\lib\shutil.py", line 374, in _rmtree_unsafe
    names = os.listdir(path)

NotADirectoryError: [WinError 267] The directory name is invalid: 'C:/Users/user/Desktop/AFL codes to test/newfile1.txt'

您应该将反斜杠替换为正斜杠。

path = r'C:\Users\user\Desktop\AFL codes to test'

应该是

path = 'C:/Users/user/Desktop/AFL codes to test'

编辑:这里是完整的代码,可以让你继续:

import os

path = 'C:/Users/user/Desktop/AFL codes to test'
words = ['buy', 'sell']

files = os.listdir(path)
for each_file in files:
    full_path = "%s/%s" % (path, each_file)
    each_file_content = open(full_path, 'r', encoding="utf-8").read()
    if not any(word in each_file_content for word in words):
       os.unlink(full_path)