用于拆分服装尺码的正则表达式
Regex for splitting apparel sizes
我有以下输入(仅作为示例,真实输入包含更多疯狂数据)
$values = [
'32/34, 36/38, 40/42, 44/46',
'40/42/44/46/48',
'58/60',
'39-42',
'40-50-60',
'24-25,26,28,30',
'36 40,5 44',
];
并想用 /
或 ,
等分隔符拆分它,但保留成对的值。仅当分隔符未多次出现时才应执行此操作,因此结果应如下所示:
'32/34, 36/38, 40/42, 44/46'
=> [ '32/34', '36/38', '40/42', '44/46' ]
'40/42/44/46/48'
=> [ '40', '42', '44', '46', '48' ]
'58/60'
=> [ '58/60' ]
'39-42'
=> [ '39-42' ]
'40-50-60'
=> [ '40', '50', '60' ]
'24-25,26,28,30'
=> [ '24-25', '26', '28', '30' ]
'36 40,5 44'
=> [ '36', '40,5', '44' ]
我目前所掌握的是
$separator = '^|$|[\s,\/-]';
$decimals = '\d+(?:[,.][05])?';
foreach ($values as $value) {
preg_match_all('/' .
'(?<=' . $separator . ')' .
'(?:' .
'(?P<var1>(' . $decimals . ')[\/-](?-1)|(?-1))' .
')(?=' . $separator . ')' .
'/ui', $value, $matches);
print_r($matches);
}
但这对于 40/42/44/46/48
是失败的 returns
[var1] => Array
(
[0] => 40/42
[1] => 44/46
[2] => 48
)
但是每个数字应该单独返回。将正则表达式修改为 '(?P<var1>(' . $decimals . ')([\/-])(?-2)|(?-2))(?!)'
更好,但仍然 returns 错误结果
[var1] => Array
(
[0] => 40
[1] => 42
[2] => 44
[3] => 46/48
)
正确的正则表达式应该是什么样的?
如上面的评论所述,我知道 100% 匹配是不可能的,因为用户输入。但是我找到了适合我的大多数用例的正则表达式:
(?<=^|$|[\s,\/-])(?:(?P<var1>(?<![\/-])(?!(?:(\d+(?:[,.][05])?)[\/-]){2}(?-1))(\d+(?:[,.][05])?)[\/-](?-1)|(?-1)))(?=^|$|[\s,\/-])
我有以下输入(仅作为示例,真实输入包含更多疯狂数据)
$values = [
'32/34, 36/38, 40/42, 44/46',
'40/42/44/46/48',
'58/60',
'39-42',
'40-50-60',
'24-25,26,28,30',
'36 40,5 44',
];
并想用 /
或 ,
等分隔符拆分它,但保留成对的值。仅当分隔符未多次出现时才应执行此操作,因此结果应如下所示:
'32/34, 36/38, 40/42, 44/46'
=> [ '32/34', '36/38', '40/42', '44/46' ]
'40/42/44/46/48'
=> [ '40', '42', '44', '46', '48' ]
'58/60'
=> [ '58/60' ]
'39-42'
=> [ '39-42' ]
'40-50-60'
=> [ '40', '50', '60' ]
'24-25,26,28,30'
=> [ '24-25', '26', '28', '30' ]
'36 40,5 44'
=> [ '36', '40,5', '44' ]
我目前所掌握的是
$separator = '^|$|[\s,\/-]';
$decimals = '\d+(?:[,.][05])?';
foreach ($values as $value) {
preg_match_all('/' .
'(?<=' . $separator . ')' .
'(?:' .
'(?P<var1>(' . $decimals . ')[\/-](?-1)|(?-1))' .
')(?=' . $separator . ')' .
'/ui', $value, $matches);
print_r($matches);
}
但这对于 40/42/44/46/48
是失败的 returns
[var1] => Array
(
[0] => 40/42
[1] => 44/46
[2] => 48
)
但是每个数字应该单独返回。将正则表达式修改为 '(?P<var1>(' . $decimals . ')([\/-])(?-2)|(?-2))(?!)'
更好,但仍然 returns 错误结果
[var1] => Array
(
[0] => 40
[1] => 42
[2] => 44
[3] => 46/48
)
正确的正则表达式应该是什么样的?
如上面的评论所述,我知道 100% 匹配是不可能的,因为用户输入。但是我找到了适合我的大多数用例的正则表达式:
(?<=^|$|[\s,\/-])(?:(?P<var1>(?<![\/-])(?!(?:(\d+(?:[,.][05])?)[\/-]){2}(?-1))(\d+(?:[,.][05])?)[\/-](?-1)|(?-1)))(?=^|$|[\s,\/-])