有效地从文件中读取主要版本号

Question

我正在编写一个脚本，它将通过读取文件来检查主要版本

在任何一行存储版本号[=22=]

像这样：

VERSION = 23.5.1

为了阅读这个数字 23，我正在这样做

filePath = os.path.join(os.getcwd(), 'Makefile')
with open(filePath, 'r') as mkfh:
    for line in mkfh:
        if line.startswith('VERSION'):
            print line.replace(' ','').split('=')[-1].split('.')[0]
            break

他们获取主版本的方法比使用替换和拆分两次更有效吗？

Answer 1

您不要使用 replace

print line.split('=')[-1].split('.')[0].strip()

lstrip比较合适

print line.split('=')[-1].split('.')[0].lstrip()

Answer 2

使用正则表达式：

import re

pattern = re.compile(r'VERSION\s*=\s*(\d+)')  # \s: space, \d: digits

with open('Makefile') as mkfh:
    for line in mkfh:
        matched = pattern.match(line)
        if matched:
            print matched.group(1)
            break

顺便说一句，如果您正在访问当前工作目录中的文件，则不需要使用 os.path.join。

Answer 3

我愿意 line.split(' = ')[1].split('.')[0]，但除此之外我觉得还不错。有些人可能会使用正则表达式解决方案，例如 re.search(r'VERSION = (\d+)', line).group(1).

Answer 4

如果效率是目标，那么对于像 makefile 这样的东西，您可能应该一次处理所有文件，而不是一次处理一行：

import os
import re

filePath = os.path.join(os.getcwd(), 'Makefile')
with open(filePath, 'rb') as mkfh:
    data = mkfh.read()

pattern = '^VERSION *= *(\d+)'
search = re.compile(pattern, re.MULTILINE).search

print(search(data).group(1))

有效地从文件中读取主要版本号

efficiently read major version number from a file

python

list

slice