如何单独浏览子文件夹？

Question

实际上我有一个文件夹（下图中的数据），其中包含2个子文件夹，每个子文件夹都包含一些.png文件。我需要遍历每个子文件夹并对该子文件夹中的每个图像文件进行一些编码并保存结果。我使用了 os.walk()、os.listdir() 和 glob.glob()，但其中 none 有效。我尝试过的许多代码之一与以下相同：

path1 = Path('./data')
path2 = os.listdir(path1)

# loop through main folder to read each subfolder
for i in path2:
    if not i.startswith('.'):
       path3 = Path(os.path.join(path1,i))
       path4 = os.listdir(path3)

    #loop through each subfolder to read each file
       for j in path4:
           #some coding

enter image description here

如有任何建议，我们将不胜感激。

Answer 1

我建议使用 pathlib 库。该库是一个“面向对象的文件系统路径”模块，它结合了 Python 的文件系统模块的优点，如 os、os.path 和 glob。

from pathlib import Path

path1 = Path('./data')
files = [item.as_posix() for item in path1 .glob('**/*.png') if item.is_file()]

这将为您提供数据子文件夹中所有 .png 路径的列表。

Answer 2

你可以这样使用listdir()：

# pathname of root dir
images_path = "./data"

# filtered file extension
suffix = ".png"

# For each image,
for i in os.listdir(images_path):
    file = os.path.basename(i)
    fileName, fileExtension = os.path.splitext(file)
    # is it an image file with 'suffix' extension ?
    if os.path.isfile(images_path+'/'+i) and fileExtension == suffix:
        # do some coding

Answer 3

与 os.walk 类似：

import os
for root, dirs, files in os.walk(path_to_data_folder):
#    if not root.endswith(good_folder_name):
#        continue
    for fname in files:
        if fname_meets_my_criteria:
            fpath = os.path.join(root, fname)
            with open(fpath, 'r') as f, open(new_file_path, 'w') as newfile:
                data = f.read()
                # process file data
                new_data = func_that_processes_data(data)
                newfile.write(new_data)

有一点伪代码：

fname_meets_my_criteria 是比较的替代品，如果您想过滤要处理的文件，则需要它 - 它可能类似于 fname.edswith('.txt') 或 not fname.endswith('.cfg')
new_file_path 是处理数据将写入的新文件的路径和名称。

如果您打算在文件处理后覆盖它们，请改用此方法：

for root, dirs, files in os.walk(path_to_data_folder):
#    if not root.endswith(good_folder_name):
#        continue
    for fname in files:
        if fname_meets_my_criteria:
            fpath = os.path.join(root, fname)
            with open(fpath, 'r') as f:
                data = f.read()
            # process file data
            new_data = func_that_processes_data(data)
            with open(fpath, 'w') as f:
                f.write(new_data)

在我的两个示例中，文件都是作为文本文件打开的。如果您需要处理字节而不是 test/strings 用 mode arguments of 'rb' or 'wb'

打开文件

Answer 4

我能找到我的答案！这很简单，但我在命名时犯了一个错误。所以，下面写的代码可能会帮助其他人遇到同样的问题：

path = "./data/"

for subfolder in os.listdir(path):
    subfolder_name = path + subfolder

    for imgs in os.listdir(subfolder_name):
        imagename = subfolder_name + '/' + imgs

        # do some coding

如何单独浏览子文件夹？

How to walk through subfolders individually?

python

glob

os.walk

subdirectory

listdir