Lua 模式——我怎样才能让它发挥作用？

Question

我有一个文本文件要处理，示例内容如下：

[FCT-FCTVALUEXXXX-IA]
Name=value
Label = value
Zero or more lines of text
Abbr=
Zero or more lines of text
Field A=1
Field B=0
Zero or more lines of text
Hidden=N
[Text-FCT-FCTVALUEXXXX-IA-Note]
One or more note lines
[FCT-FCT-FCTVALUEZ-IE-DETAIL]
Zero or more lines of text
[FCT-FCT-FCTVALUEQ-IA-DETAIL]
Zero or more lines of text
[FCT-_FCTVALUEY-IA]
Name=value
Zero or more lines of text
Label=value
Zero or more lines of text
Field A=1
Abbr=value
Field A=1
Zero or more lines of text
Hidden=N

我需要找到这样的部分：

[FCT-FCTVALUEXXXX-IA]
Name=value
Label = value
Zero or more lines of text
Abbr=
Zero or more lines of text
Field A=1
Field B=0
Zero or more lines of text
Hidden=N

并提取FCT-FCTVALUEXXXX-AA、Name、Label、Abbr、Field A and B和Hidden，然后找到对应的section（如果存在）：

[Text-FCT-FCTVALUEXXXX-IA-Note]
One or more note lines

结束将注释行提取为单个字符串。

我不关心版块

[FCT-FCT-FCTVALUEZ-IE-DETAIL]
Zero or more lines of text

所有这三种部分都可以出现在文件中的任何位置，包括最后，并且部分之间的位置没有可预测的关系。

不能保证缩写和字段 A 和 B 的顺序，但它们总是出现在名称和标签之后和隐藏之前。

我目前拥有的：

strParse = "(%[FCT%-.-%-)([IF])([EA])%]%c+Name=(.-)%c.-Label=(.-)%c(.-)Hidden=(%a)%c" --cant pull everything out at once because the order of some fields is not predictable
for id, rt, ft, name, label, detail, hidden in strFacts:gmatch(strParse) do
    --extract details
    abbr=detail:match("Abbr=(.-)%c") --may be blank
    if abbr == nil then abbr = "" end
    FieldA = detail:match("Field A=(%d)")
    FieldB = detail:match("Field B=(%d)")               
    --need to sanitise id which could have a bunch of extraneous material tacked on the front and use it to get the Note
    ident=id:match(".*(%[FCT%-.-%-$)")..rt..ft
    Note = ParseAutonote(ident)  --this is a function to parse the note which I've yet to test so a dummy function returns ""                      
    tblResults[name]={ident, rt, ft, name, label, abbr, FieldA, FieldB, hidden, note}               
end

大部分工作正常（经过数小时的工作），但不工作的部分是：

(".*(%[FCT%-.-%-$)")

这应该会拉出最后出现的 FCT-sometext- 在字符串 id

我的逻辑：将搜索定位到字符串的末尾，并捕获字符串末尾以“[FCT-”开头并以“-”结尾的最短字符串。

给定值“[FCT-_ABCD-PDQR-”或 “[FCT-XYZ-DETAIL] 行文本 [FCT-_ABCD-PDQR-” 当我想要 return "FCT-_ABCD-PDQR-" 时它 return 为零。（注意 ABCD、PDQR 等可以是包含 Alpha、- 和 _ 的任意长度的文本）。

Answer 1

当你发现自己 (".*(%[FCT%-.-%-)$") 以你想要的方式工作时，其中 (".*(%[FCT%-.-%-$)") 没有。 $ 和 ^ 是锚点，必须出现在模式的末尾或开头，它们不能出现在捕获闭包内。

当锚字符出现在模式中的其他任何地方时，它们将成为您要查找的字符串的一部分，不包括 ^ 在集合中用于排除字符的情况，即：排除大写字符[^A-Z]

以下是使用示例字符串和问题中的模式进行模式匹配的示例。

print(string.match("[FCT-_ABCD-PDQR-", (".*(%[FCT%-.-%-$)")))  -- initial pattern
> nil
print(string.match("[FCT-_ABCD-PDQR-$", (".*(%[FCT%-.-%-$)"))) -- $ added to end of string
> [FCT-_ABCD-PDQR-$
print(string.match("[FCT-_ABCD-PDQR-", (".*(%[FCT%-.-%-)$")))  -- $ moved to end of pattern
> [FCT-_ABCD-PDQR-

Lua 模式——我怎样才能让它发挥作用？

Lua pattern -- how can I get this to work?

lua

lua-patterns