从文件中提取特定范围的文本

Question

我正试图帮助我的妻子审阅工作文件 - 笔记的段落有不同的类别，我试图将它们提取为单独的字符串以保存到不同的文本文件中，以便我可以做其他事情稍后给他们。示例段落是：

客户行为观察：总体干扰行为数据趋势如下：这是所需的文本。观察客户对技能的反应 acquisition：整体技能获取数据趋势....

我试图只提取“总体干扰行为数据趋势如下：”到“观察客户对技能的反应”之前的文本获取：

我已经尝试使用正则表达式但没有成功，任何指导方面的帮助将不胜感激，谢谢！

Answer 1

参考自此 post Regular expression to return all characters between two special characters

import re

file = open("filename.txt", "r") # Insert the file name here

pat = r'.*?Overall interfering behavior data trends are as followed:(.*)Observations of Client\'s response to skill acquisition:.*'
match = re.search(pat, line)

for line in file:
    print(match.group(1).strip())

给出输出

'THIS IS THE DESIRED TEXT.'

从文件中提取特定范围的文本

Extracting specific range of text from a file

python

text-extraction