python randomizer - 在双嵌套级别的花括号之间获取随机文本

python randomizer - get random text between curly braces with double nesting level

嘿,我需要创建简单的 python 随机化器。输入示例:

{{hey|hello|hi}|{privet|zdravstvuy|kak dela}|{bonjour|salut}}, can {you|u} give me advice?

输出应该是:

hello, can you give me advice

我有一个脚本,可以做到这一点,但只能在一个嵌套级别

with open('text.txt', 'r') as text:
    matches = re.findall('([^{}]+)', text.read())
words = []
for match in matches:
    parts = match.split('|')
    if parts[0]:
        words.append(parts[random.randint(0, len(parts)-1)])
message = ''.join(words)

这对我来说还不够)

Python正则表达式不支持嵌套结构,因此您必须找到其他方法来解析字符串。

这是我的快速拼凑:

def randomize(text):
    start= text.find('{')
    if start==-1: #if there are no curly braces, there's nothing to randomize
        return text

    # parse the choices we have
    end= start
    word_start= start+1
    nesting_level= 0
    choices= [] # list of |-separated values
    while True:
        end+= 1
        try:
            char= text[end]
        except IndexError:
            break # if there's no matching closing brace, we'll pretend there is.
        if char=='{':
            nesting_level+= 1
        elif char=='}':
            if nesting_level==0: # matching closing brace found - stop parsing.
                break
            nesting_level-= 1
        elif char=='|' and nesting_level==0:
            # put all text up to this pipe into the list
            choices.append(text[word_start:end])
            word_start= end+1
    # there's no pipe character after the last choice, so we have to add it to the list now
    choices.append(text[word_start:end])
    # recursively call this function on each choice
    choices= [randomize(t) for t in choices]
    # return the text up to the opening brace, a randomly chosen string, and
    # don't forget to randomize the text after the closing brace 
    return text[:start] + random.choice(choices) + randomize(text[end+1:])

正如我上面所说,嵌套在这里基本上是无用的,但是如果你想保留你当前的语法,一种处理它的方法是在循环中替换大括号直到没有更多:

import re, random

msg = '{{hey|hello|hi}|{privet|zdravstvuy|kak dela}|{bonjour|salut}}, can {you|u} give me advice?'


while re.search(r'{.*}', msg):
    msg = re.sub(
        r'{([^{}]*)}', 
        lambda m: random.choice(m.group(1).split('|')), 
        msg)

print msg
# zdravstvuy, can u give me advice?