如何剪切第一个字符直到找到特殊字符?

How to cut first characters until a special character found?

我有 100 行,每行包含一些 sentences.Each 行,以一些字符开头,后跟 -rrb-

如何格式化文本,使每一行实际上都在 -rrb- 之后开始。

行示例:

Lorem ipsum -lrb- dolor -rrb- sit amet, consectetur adipiscing elit. Nam quis dolor a sapien efficitur fermentum egestas ut elit. Nulla porttitor diam id orci bibendum volutpat. Quisque imperdiet euismod neque in efficitur.

我想要这一行和每一行:

sit amet, consectetur adipiscing elit. Nam quis dolor a sapien efficitur fermentum egestas ut elit. Nulla porttitor diam id orci bibendum volutpat. Quisque imperdiet euismod neque in efficitur.

所以我想从 -rrb- 之后的单词开始,每 100 行开始( "sit" 对于此示例)。

使用拆分:

text = line.split('-rrb-', 1)[-1]

只需索引即可:

print(s[s.index('-rrb-')+6:])

输出:

sit amet, consectetur adipiscing elit. Nam quis dolor a sapien efficitur fermentum egestas ut elit. Nulla porttitor diam id orci bibendum volutpat. Quisque imperdiet euismod neque in efficitur.

@NavaW,你可以试试下面的代码,得到1行-rrb-后面的句子。

最后你可以循环超过 100 行并处理每一行。

line = "Lorem ipsum -lrb- dolor -rrb- sit amet, consectetur adipiscing elit. Nam quis dolor a sapien efficitur fermentum egestas ut elit. Nulla porttitor diam id orci bibendum volutpat. Quisque imperdiet euismod neque in efficitur."
new_line = line.split("-rrb-")[1].strip()
print(new_line)

"""
sit amet, consectetur adipiscing elit. Nam quis dolor a sapien efficitur fermentum egestas ut elit. Nulla porttitor diam id orci bibendum volutpat. Quisque imperdiet euismod neque in efficitur.
"""