如何剪切第一个字符直到找到特殊字符?
How to cut first characters until a special character found?
我有 100 行,每行包含一些 sentences.Each 行,以一些字符开头,后跟 -rrb-
。
如何格式化文本,使每一行实际上都在 -rrb-
之后开始。
行示例:
Lorem ipsum -lrb- dolor -rrb- sit amet, consectetur adipiscing elit. Nam quis dolor a sapien efficitur fermentum egestas ut elit. Nulla porttitor diam id orci bibendum volutpat. Quisque imperdiet euismod neque in efficitur.
我想要这一行和每一行:
sit amet, consectetur adipiscing elit. Nam quis dolor a sapien efficitur fermentum egestas ut elit. Nulla porttitor diam id orci bibendum volutpat. Quisque imperdiet euismod neque in efficitur.
所以我想从 -rrb-
之后的单词开始,每 100 行开始(
"sit" 对于此示例)。
使用拆分:
text = line.split('-rrb-', 1)[-1]
只需索引即可:
print(s[s.index('-rrb-')+6:])
输出:
sit amet, consectetur adipiscing elit. Nam quis dolor a sapien efficitur fermentum egestas ut elit. Nulla porttitor diam id orci bibendum volutpat. Quisque imperdiet euismod neque in efficitur.
@NavaW,你可以试试下面的代码,得到1行-rrb-后面的句子。
最后你可以循环超过 100 行并处理每一行。
line = "Lorem ipsum -lrb- dolor -rrb- sit amet, consectetur adipiscing elit. Nam quis dolor a sapien efficitur fermentum egestas ut elit. Nulla porttitor diam id orci bibendum volutpat. Quisque imperdiet euismod neque in efficitur."
new_line = line.split("-rrb-")[1].strip()
print(new_line)
"""
sit amet, consectetur adipiscing elit. Nam quis dolor a sapien efficitur fermentum egestas ut elit. Nulla porttitor diam id orci bibendum volutpat. Quisque imperdiet euismod neque in efficitur.
"""
我有 100 行,每行包含一些 sentences.Each 行,以一些字符开头,后跟 -rrb-
。
如何格式化文本,使每一行实际上都在 -rrb-
之后开始。
行示例:
Lorem ipsum -lrb- dolor -rrb- sit amet, consectetur adipiscing elit. Nam quis dolor a sapien efficitur fermentum egestas ut elit. Nulla porttitor diam id orci bibendum volutpat. Quisque imperdiet euismod neque in efficitur.
我想要这一行和每一行:
sit amet, consectetur adipiscing elit. Nam quis dolor a sapien efficitur fermentum egestas ut elit. Nulla porttitor diam id orci bibendum volutpat. Quisque imperdiet euismod neque in efficitur.
所以我想从 -rrb-
之后的单词开始,每 100 行开始(
"sit" 对于此示例)。
使用拆分:
text = line.split('-rrb-', 1)[-1]
只需索引即可:
print(s[s.index('-rrb-')+6:])
输出:
sit amet, consectetur adipiscing elit. Nam quis dolor a sapien efficitur fermentum egestas ut elit. Nulla porttitor diam id orci bibendum volutpat. Quisque imperdiet euismod neque in efficitur.
@NavaW,你可以试试下面的代码,得到1行-rrb-后面的句子。
最后你可以循环超过 100 行并处理每一行。
line = "Lorem ipsum -lrb- dolor -rrb- sit amet, consectetur adipiscing elit. Nam quis dolor a sapien efficitur fermentum egestas ut elit. Nulla porttitor diam id orci bibendum volutpat. Quisque imperdiet euismod neque in efficitur."
new_line = line.split("-rrb-")[1].strip()
print(new_line)
"""
sit amet, consectetur adipiscing elit. Nam quis dolor a sapien efficitur fermentum egestas ut elit. Nulla porttitor diam id orci bibendum volutpat. Quisque imperdiet euismod neque in efficitur.
"""