提取 python 中的子字符串
Extract substrings in python
我想解析一个字符串以提取花括号中的所有子字符串:
'The value of x is {x}, and the list is {y} of len {}'
应该产生:
(x, y)
然后我想格式化字符串以打印具有以下值的初始字符串:
str.format('The value of x is {x}, and the list is {y} of len {}', x, y, len(y))
我该怎么做?
Example usage:
def somefunc():
x = 123
y = ['a', 'b']
MyFormat('The value of x is {x}, and the list is {y} of len {}',len(y))
output:
The value of x is 123, and the list is ['a', 'b'] of len 2
您可以使用 string.Formatter.parse:
Loop over the format_string and return an iterable of tuples (literal_text, field_name, format_spec, conversion). This is used by vformat() to break the string into either literal text, or replacement fields.
The values in the tuple conceptually represent a span of literal text followed by a single replacement field. If there is no literal text (which can happen if two replacement fields occur consecutively), then literal_text will be a zero-length string. If there is no replacement field, then the values of field_name, format_spec and conversion will be None.
from string import Formatter
s = 'The value of x is {x}, and the list is {y} of len {}'
print([t[1] for t in Formatter().parse(s) if t[1]])
['x', 'y']
不确定这对您尝试做的事情有何帮助,因为您可以在函数中将 x 和 y 传递给 str.format 或使用 **locals:
def somefunc():
x = 123
y = ['a', 'b']
print('The value of x is {x}, and the list is {y} of len {}'.format(len(y),**locals()))
如果您想打印命名参数,您可以添加格式化程序输出:
def somefunc():
x = 123
y = ['a', 'b']
print("The named args are {}".format( [t[1] for t in Formatter().parse(s) if t[1]]))
print('The value of x is {x}, and the list is {y} of len {}'.format(len(y), **locals()))
这将输出:
The named args are ['x', 'y']
The value of x is 123, and the list is ['a', 'b'] of len 2
您可以使用re.findall
>>> import re
>>> s = 'The value of x is {x}, and the list is {y} of len {}'
>>> re.findall(r'\{([^{}]+)\}', s)
['x', 'y']
>>> tuple(re.findall(r'\{([^{}]+)\}', s))
('x', 'y')
提取值后你在做什么?
import re
st = "The value of x is {x}, and the list is {y} of len {}"
exp = re.compile(r"\{(.+?)\}")
print(tuple(exp.findall(st)))
输出是
('x', 'y')
我想解析一个字符串以提取花括号中的所有子字符串:
'The value of x is {x}, and the list is {y} of len {}'
应该产生:
(x, y)
然后我想格式化字符串以打印具有以下值的初始字符串:
str.format('The value of x is {x}, and the list is {y} of len {}', x, y, len(y))
我该怎么做?
Example usage:
def somefunc():
x = 123
y = ['a', 'b']
MyFormat('The value of x is {x}, and the list is {y} of len {}',len(y))
output:
The value of x is 123, and the list is ['a', 'b'] of len 2
您可以使用 string.Formatter.parse:
Loop over the format_string and return an iterable of tuples (literal_text, field_name, format_spec, conversion). This is used by vformat() to break the string into either literal text, or replacement fields.
The values in the tuple conceptually represent a span of literal text followed by a single replacement field. If there is no literal text (which can happen if two replacement fields occur consecutively), then literal_text will be a zero-length string. If there is no replacement field, then the values of field_name, format_spec and conversion will be None.
from string import Formatter
s = 'The value of x is {x}, and the list is {y} of len {}'
print([t[1] for t in Formatter().parse(s) if t[1]])
['x', 'y']
不确定这对您尝试做的事情有何帮助,因为您可以在函数中将 x 和 y 传递给 str.format 或使用 **locals:
def somefunc():
x = 123
y = ['a', 'b']
print('The value of x is {x}, and the list is {y} of len {}'.format(len(y),**locals()))
如果您想打印命名参数,您可以添加格式化程序输出:
def somefunc():
x = 123
y = ['a', 'b']
print("The named args are {}".format( [t[1] for t in Formatter().parse(s) if t[1]]))
print('The value of x is {x}, and the list is {y} of len {}'.format(len(y), **locals()))
这将输出:
The named args are ['x', 'y']
The value of x is 123, and the list is ['a', 'b'] of len 2
您可以使用re.findall
>>> import re
>>> s = 'The value of x is {x}, and the list is {y} of len {}'
>>> re.findall(r'\{([^{}]+)\}', s)
['x', 'y']
>>> tuple(re.findall(r'\{([^{}]+)\}', s))
('x', 'y')
提取值后你在做什么?
import re
st = "The value of x is {x}, and the list is {y} of len {}"
exp = re.compile(r"\{(.+?)\}")
print(tuple(exp.findall(st)))
输出是
('x', 'y')