正则表达式从 PHP 7.4 开始失败,在 7.3 中工作

RegEx failing since PHP 7.4, working in 7.3

知道为什么这个 preg_match 可以达到 PHP7.2 但在 7.3+ 时失败了吗?

$word = 'umweltfreundilch'; //real life example :/
preg_match('/^(?U)(.*(?:[aeiouyäöü])(?:[^aeiouyäöü]))(?X)(.*)$/u', $word, $matches);
var_dump($matches);

Warning: preg_match(): Compilation failed: unrecognized character after (? or (?-

PHP 7.2及以下输出:

array(3) {
  [0]=>
  string(16) "umweltfreundilch"
  [1]=>
  string(2) "um"
  [2]=>
  string(14) "weltfreundilch"
}

RegEx 似乎没问题,不是吗?
https://regex101.com/r/LGdhaM/1

在 PHP 7.3 及更高版本中,Perl-Compatible 正则表达式 (PCRE) 扩展 was upgraded 到 PCRE2。

PCRE2 syntax documentation 未将 (?X) 列为可用的内联修饰符选项。以下是支持的选项:

  (?i)            caseless
  (?J)            allow duplicate named groups
  (?m)            multiline
  (?n)            no auto capture
  (?s)            single line (dotall)
  (?U)            default ungreedy (lazy)
  (?x)            extended: ignore white space except in classes
  (?xx)           as (?x) but also ignore space and tab in classes
  (?-...)         unset option(s)
  (?^)            unset imnsx options

但是,您实际上可以在尾随定界符后使用 X 标志:

preg_match('/^(?U)(.*[aeiouyäöü][^aeiouyäöü])(.*)$/Xu', $word, $matches)

PHP 7.4 demo

要取消 (?U) 效果,您可以使用以下两个选项之一:(?-U) 内联修饰符,如

preg_match('/^(?U)(.*[aeiouyäöü][^aeiouyäöü])(?-U)(.*)$/u', $word, $matches);
//                                           ^^^^^

或者,将受影响的模式包含在 (?U:...) 修饰符组中:

preg_match('/^(?U:(.*[aeiouyäöü][^aeiouyäöü]))(.*)$/u', $word, $matches);
//            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^        

preg_match(): Compilation failed: invalid range in character class at offset.

中查看有关 PHP 7.3+ 中正则表达式处理更改的更多信息