从目录中删除没有特定单词的文本文件
Delete text files that don't have specific words in them from directory
我有一个包含约 2200 个文本文件的目录。我需要删除任何不包含我定义的特定词的文本文件。有人可以看看这段代码并就如何让它工作提出建议吗?现在,当我 运行 这个它说找不到目录 "C".
此外,我想确保此 运行 用于该目录中的每个文件。我需要包含下一个功能吗?
import os
path = r'C:\Users\user\Desktop\AFL codes to test'
words = ['buy', 'sell']
for root, dirs, files in os.walk(path):
for file in path:
if not any(words in file for words in words):
os.remove(file)
另外,这里是完整的回溯:
runfile('C:/Users/user/.spyder-py3/DELETE FILES THAT DONT CONTAIN CERTAIN WORDS.py', wdir='C:/Users/user/.spyder-py3')
Traceback (most recent call last):
File "<ipython-input-23-dbc80e182b2b>", line 1, in <module>
runfile('C:/Users/user/.spyder-py3/DELETE FILES THAT DONT CONTAIN CERTAIN WORDS.py', wdir='C:/Users/user/.spyder-py3')
File "C:\Users\user\Anaconda31\lib\site-packages\spyder\utils\site\sitecustomize.py", line 705, in runfile
execfile(filename, namespace)
File "C:\Users\user\Anaconda31\lib\site-packages\spyder\utils\site\sitecustomize.py", line 102, in execfile
exec(compile(f.read(), filename, 'exec'), namespace)
File "C:/Users/user/.spyder-py3/DELETE FILES THAT DONT CONTAIN CERTAIN WORDS.py", line 9, in <module>
os.remove(file)
FileNotFoundError: [WinError 2] The system cannot find the file specified: 'C'
This is the error after trying shutil.rmtree
runfile('C:/Users/user/.spyder-py3/DELETE FILES THAT DONT CONTAIN CERTAIN WORDS.py', wdir='C:/Users/user/.spyder-py3')
Traceback (most recent call last):
File "<ipython-input-16-dbc80e182b2b>", line 1, in <module>
runfile('C:/Users/user/.spyder-py3/DELETE FILES THAT DONT CONTAIN CERTAIN WORDS.py', wdir='C:/Users/user/.spyder-py3')
File "C:\Users\user\Anaconda31\lib\site-packages\spyder\utils\site\sitecustomize.py", line 705, in runfile
execfile(filename, namespace)
File "C:\Users\user\Anaconda31\lib\site-packages\spyder\utils\site\sitecustomize.py", line 102, in execfile
exec(compile(f.read(), filename, 'exec'), namespace)
File "C:/Users/user/.spyder-py3/DELETE FILES THAT DONT CONTAIN CERTAIN WORDS.py", line 12, in <module>
shutil.rmtree(full_path)
File "C:\Users\user\Anaconda31\lib\shutil.py", line 494, in rmtree
return _rmtree_unsafe(path, onerror)
File "C:\Users\user\Anaconda31\lib\shutil.py", line 376, in _rmtree_unsafe
onerror(os.listdir, path, sys.exc_info())
File "C:\Users\user\Anaconda31\lib\shutil.py", line 374, in _rmtree_unsafe
names = os.listdir(path)
NotADirectoryError: [WinError 267] The directory name is invalid: 'C:/Users/user/Desktop/AFL codes to test/newfile1.txt'
您应该将反斜杠替换为正斜杠。
path = r'C:\Users\user\Desktop\AFL codes to test'
应该是
path = 'C:/Users/user/Desktop/AFL codes to test'
编辑:这里是完整的代码,可以让你继续:
import os
path = 'C:/Users/user/Desktop/AFL codes to test'
words = ['buy', 'sell']
files = os.listdir(path)
for each_file in files:
full_path = "%s/%s" % (path, each_file)
each_file_content = open(full_path, 'r', encoding="utf-8").read()
if not any(word in each_file_content for word in words):
os.unlink(full_path)
我有一个包含约 2200 个文本文件的目录。我需要删除任何不包含我定义的特定词的文本文件。有人可以看看这段代码并就如何让它工作提出建议吗?现在,当我 运行 这个它说找不到目录 "C".
此外,我想确保此 运行 用于该目录中的每个文件。我需要包含下一个功能吗?
import os
path = r'C:\Users\user\Desktop\AFL codes to test'
words = ['buy', 'sell']
for root, dirs, files in os.walk(path):
for file in path:
if not any(words in file for words in words):
os.remove(file)
另外,这里是完整的回溯:
runfile('C:/Users/user/.spyder-py3/DELETE FILES THAT DONT CONTAIN CERTAIN WORDS.py', wdir='C:/Users/user/.spyder-py3')
Traceback (most recent call last):
File "<ipython-input-23-dbc80e182b2b>", line 1, in <module>
runfile('C:/Users/user/.spyder-py3/DELETE FILES THAT DONT CONTAIN CERTAIN WORDS.py', wdir='C:/Users/user/.spyder-py3')
File "C:\Users\user\Anaconda31\lib\site-packages\spyder\utils\site\sitecustomize.py", line 705, in runfile
execfile(filename, namespace)
File "C:\Users\user\Anaconda31\lib\site-packages\spyder\utils\site\sitecustomize.py", line 102, in execfile
exec(compile(f.read(), filename, 'exec'), namespace)
File "C:/Users/user/.spyder-py3/DELETE FILES THAT DONT CONTAIN CERTAIN WORDS.py", line 9, in <module>
os.remove(file)
FileNotFoundError: [WinError 2] The system cannot find the file specified: 'C'
This is the error after trying shutil.rmtree
runfile('C:/Users/user/.spyder-py3/DELETE FILES THAT DONT CONTAIN CERTAIN WORDS.py', wdir='C:/Users/user/.spyder-py3')
Traceback (most recent call last):
File "<ipython-input-16-dbc80e182b2b>", line 1, in <module>
runfile('C:/Users/user/.spyder-py3/DELETE FILES THAT DONT CONTAIN CERTAIN WORDS.py', wdir='C:/Users/user/.spyder-py3')
File "C:\Users\user\Anaconda31\lib\site-packages\spyder\utils\site\sitecustomize.py", line 705, in runfile
execfile(filename, namespace)
File "C:\Users\user\Anaconda31\lib\site-packages\spyder\utils\site\sitecustomize.py", line 102, in execfile
exec(compile(f.read(), filename, 'exec'), namespace)
File "C:/Users/user/.spyder-py3/DELETE FILES THAT DONT CONTAIN CERTAIN WORDS.py", line 12, in <module>
shutil.rmtree(full_path)
File "C:\Users\user\Anaconda31\lib\shutil.py", line 494, in rmtree
return _rmtree_unsafe(path, onerror)
File "C:\Users\user\Anaconda31\lib\shutil.py", line 376, in _rmtree_unsafe
onerror(os.listdir, path, sys.exc_info())
File "C:\Users\user\Anaconda31\lib\shutil.py", line 374, in _rmtree_unsafe
names = os.listdir(path)
NotADirectoryError: [WinError 267] The directory name is invalid: 'C:/Users/user/Desktop/AFL codes to test/newfile1.txt'
您应该将反斜杠替换为正斜杠。
path = r'C:\Users\user\Desktop\AFL codes to test'
应该是
path = 'C:/Users/user/Desktop/AFL codes to test'
编辑:这里是完整的代码,可以让你继续:
import os
path = 'C:/Users/user/Desktop/AFL codes to test'
words = ['buy', 'sell']
files = os.listdir(path)
for each_file in files:
full_path = "%s/%s" % (path, each_file)
each_file_content = open(full_path, 'r', encoding="utf-8").read()
if not any(word in each_file_content for word in words):
os.unlink(full_path)