将“abc123def”替换为与“abc 123 def”在多字节字符串中

发布于 2024-08-07 09:23:08 字数 304 浏览 8 评论 0原文

通常我会这样做。

$str = preg_replace('#(\d+)#', ' $1 ', $str);

如果我知道它将是 utf-8，我会在模式中添加一个小写的“u”修饰符，我想我会做得很好。但由于有报告称 utf-8 占用的存储空间是使用本机字符集时的 2 倍，在某些情况下是 3 倍，因此我尝试不将应用程序限制为 utf-8。

因此，我试图远离我最喜欢的 preg_ 函数。

到目前为止，大多数事情都相当简单，但我在替换方面有点困难，我通常在 preg_ 中使用字符类，例如“\d”。

原文

Normally I would just do this.

$str = preg_replace('#(\d+)#', ' $1 ', $str);

If I knew it was going to be utf-8 I would add a lowercase "u" modifier to the pattern and I think I would be good. But because of reports of utf-8 taking 2x and in some cases 3x the storage space than it would take if the native character set were used, I'm trying not to restrict the application to utf-8.

Thus, I'm trying to stay away from my favorite preg_ functions.

Most things have been fairly simple so far, but I'm a little stuck on replacements where I'd normally use character classes in preg_ such as "\d".

分享到QQ

分享到微博