正则表达式 - 任何文本到 URL 友好的表达式
PHP 正则表达式脚本可删除任何非字母或数字 0 到 9 的内容,并将空格替换为连字符 - 更改为小写字母,确保只有一个连字符 - 单词之间没有 -- 或 --- 等。
例如:
示例:敏捷的棕色狐狸跳得很快 结果:the-quick-brown-fox-jumped
示例:the Quick Brown Fox Jumped! 结果:the-quick-brown-fox-jumped
示例:the Quick Brown Fox - 跳了! 结果: the-quick-brown-fox-jumped
示例: 快速 ~`!@#$%^ &*()_+= ------- Brown {}|][ :"'; < >?.,/狐狸-跳了起来! 结果:the-quick-brown-fox-jumped
示例:The Quick 1234567890 ~`!@#$%^ &*()_+= ------- Brown {}|][ :"'; < ;>?.,/ 狐狸 - 跳了起来! 结果:the-quick-1234567890-brown-fox-jumped
有人知道正则表达式吗?
谢谢!
PHP regular expression script to remove anything that is not a alphabetical letter or number 0 to 9 and replace space to a hyphen - change to lowercase make sure there is only one hyphen - between words no -- or --- etc.
For example:
Example: The quick brown fox jumped
Result: the-quick-brown-fox-jumped
Example: The quick brown fox jumped!
Result: the-quick-brown-fox-jumped
Example: The quick brown fox - jumped!
Result: the-quick-brown-fox-jumped
Example: The quick ~`!@#$%^ &*()_+= ------- brown {}|][ :"'; <>?.,/ fox - jumped!
Result: the-quick-brown-fox-jumped
Example: The quick 1234567890 ~`!@#$%^ &*()_+= ------- brown {}|][ :"'; <>?.,/ fox - jumped!
Result: the-quick-1234567890-brown-fox-jumped
Anybody have idea for the regular expression?
Thanks!
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
data:image/s3,"s3://crabby-images/d5906/d59060df4059a6cc364216c4d63ceec29ef7fe66" alt="扫码二维码加入Web技术交流群"
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(5)
由于您似乎希望所有非字母数字字符序列都被单个连字符替换,因此您可以使用这个:
但这可能会导致前导或尾随连字符可以用
trim
:要将结果转换为小写,请使用
strtolower
:所以全部放在一起:
或者用一个紧凑的单行:
Since you seem to want all sequences of non-alphanumeric characters being replaced by a single hyphen, you can use this:
But this can result in leading or trailing hyphens that can be removed with
trim
:And to convert the result into lowercase, use
strtolower
:So all together:
Or in a compact one-liner:
我正在处理类似的事情,我想出了这段小代码,它还考虑了拉丁字符的使用。
这是示例字符串:
$str = 'El veloz murciélagohindu comía fe&@#$%&!"#%&?¡?*liz cardillo y kiwi。La cigüeña ¡ ;.-|°Øtocaba el saxofón detrás del palmenque de paja';
首先,我将字符串转换为 htmlentities,以便稍后使用。
$friendURL = htmlentities($str, ENT_COMPAT, " UTF-8", false);
然后我用相应的 ascii 字符替换拉丁字符(
á
变为a
,Ü
变为U
等):$FriendlyURL = preg_replace('/&([az]{1,2})(?:acute|circ|lig|grave|ring|tilde |uml|cedil|caron);/i','\1',$FriendlyURL);
然后,我将字符串从 html 实体转换回符号,以便稍后使用
$FriendlyURL = html_entity_decode 。 ($FriendlyURL,ENT_COMPAT, "UTF-8");
接下来,我将所有非字母数字字符替换为连字符
$FriendlyURL = preg_replace('/[^a-z0-9-]+/i' , '-', $FriendlyURL);
我删除了字符串中多余的连字符:
$FriendlyURL = preg_replace('/-+/', '-', $FriendlyURL);
我删除了前导和尾随连字符:
$FriendlyURL = TRIM($FriendlyURL, '-');
最后全部转换为小写:
$FriendlyURL = strtolower($FriendlyURL);
全部一起:
测试:
结果:
我想Gumbo的答案更适合你的问题,而且代码更短,但我认为它对其他人有用。
干杯,
阿德里安娜
I was just working with something similar, and I came up with this little piece of code, it also contemplates the use of latin characters.
This is the sample string:
$str = 'El veloz murciélago hindú comía fe<!>&@#$%&!"#%&?¡?*liz cardillo y kiwi. La cigüeña ¨^;.-|°¬tocaba el saxofón detrás del palenque de paja';
First I convert the string to htmlentities just to make it easier to use later.
$friendlyURL = htmlentities($str, ENT_COMPAT, "UTF-8", false);
Then I replace latin characters with their corresponding ascii characters (
á
becomesa
,Ü
becomesU
, and so on):$friendlyURL = preg_replace('/&([a-z]{1,2})(?:acute|circ|lig|grave|ring|tilde|uml|cedil|caron);/i','\1',$friendlyURL);
Then I convert the string back from html entities to symbols, again for easier use later.
$friendlyURL = html_entity_decode($friendlyURL,ENT_COMPAT, "UTF-8");
Next I replace all non alphanumeric characters into hyphens.
$friendlyURL = preg_replace('/[^a-z0-9-]+/i', '-', $friendlyURL);
I remove extra hyphens inside the string:
$friendlyURL = preg_replace('/-+/', '-', $friendlyURL);
I remove leading and trailing hyphens:
$friendlyURL = trim($friendlyURL, '-');
And finally convert all into lowercase:
$friendlyURL = strtolower($friendlyURL);
All together:
Test:
Outcome:
I guess Gumbo's answer fits your problem better, and it's a shorter code, but I thought it would be useful for others.
Cheers,
Adriana
在函数中:
它的作用:
注意:
您可以通过传递数组来组合多个
preg_replace
。请参阅顶部的功能。例如:
编辑:在正则表达式末尾添加“/u”以使用 UTF8
编辑:考虑了重复和前导/尾随连字符,感谢@LuBre
In a function:
What it does:
Note:
You can combine multiple
preg_replace
by passing an array. See the function at the top.For example:
EDIT: added "/u" at the end of the regex to use UTF8
EDIT: accounted for duplicated and leading/trailing hyphens, thanks to @LuBre
如果您在 PHP 中使用它作为文件名,Gumbo 的答案将是
为文件名添加句点,它是
strtolower()
,而不是strtolowercase()
。If you're using this for filenames in PHP, the answer by Gumbo would be
Added a period for file names and it's
strtolower()
, notstrtolowercase()
.