如何从正则表达式字符类中排除换行符？

发布于 2024-11-03 05:24:10 字数 647 浏览 1 评论 0原文

给定这个 PCRE 模式：

/(<name>[^<>]*<\/name>[^<>]*<phone>[^<>]*<\/phone>)/

以及这个主题文本：

<name>John Stevens</name>  <phone>888-555-1212</phone>
<name>Peter Wilson</name>  
<phone>888-555-2424</phone>

如何让正则表达式匹配第一个姓名-电话对，但不匹配第二个？我不想匹配由换行符分隔的对。我尝试在否定字符类中包含行尾，例如 [^<>$]* 但没有任何改变。

您可以使用以下在线工具来测试您的表达：
http://rubular.com/
http://www.regextester.com/
谢谢。

原文

Given this PCRE pattern:

/(<name>[^<>]*<\/name>[^<>]*<phone>[^<>]*<\/phone>)/

And this subject text:

<name>John Stevens</name>  <phone>888-555-1212</phone>
<name>Peter Wilson</name>  
<phone>888-555-2424</phone>

How can I get the Regular Expression to match the first name-phone pair but not the second? I don't want to match pairs that are separated by line breaks. I tried including an end-of-line in the negated character class like so [^<>$]* but nothing changed.

You can use the following online tools to test your expressions:
http://rubular.com/
http://www.regextester.com/
Thank you.

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

山川志 2024-11-10 05:24:10

我认为这样就可以了，

/<name>[^<>]*<\/name>[^<>\r\n]*<phone>[^<>]*<\/phone>/

无论您在类 [ ] 中放入什么，都必须是代表单个字符的内容。 $ 在类中被解释为文字 $，可能是因为 $ 作为行尾是 0 宽度，并且不能在类中被解释为这样一堂课。（在 ridgerunner 评论后编辑）

顺便说一句，我去掉了正则表达式周围的括号，因为无论匹配什么，都可以称为整个匹配。

I think this will do it

/<name>[^<>]*<\/name>[^<>\r\n]*<phone>[^<>]*<\/phone>/

Whatever you put in the class [ ] must be something that represents a single character. $ is interpreted as literal $ within a class, probably because $ as line end is 0-width, and could not be interpreted as such within a class. (Edited after comment by ridgerunner)

By the way, I took off the parentheses that surrounds your regex because whatever matches it can be referred to as the whole match.

回复收藏 0 原文

别在捏我脸啦 2024-11-10 05:24:10

如果您不想匹配由换行符分隔的对，则以下正则表达式将完成这项工作：

/(<name>[^<>]*<\/name>.*?<phone>[^<>]*<\/phone>)/

仅匹配名字，电话对，因为点 . 不会匹配 EOL 但[^<>] 将匹配它。

在 http://rubular.com/r/amXvq20sl8 上进行了测试

If you don't want to match pairs separated by line breaks then following regex will do the job:

/(<name>[^<>]*<\/name>.*?<phone>[^<>]*<\/phone>)/

Matches only first name, phone pair since dot . will not match EOL but [^<>] will match it.

Tested it on http://rubular.com/r/amXvq20sl8

回复收藏 0 原文

剪不断理还乱 2024-11-10 05:24:10

这些网站似乎不支持整个 PCRE 语法。我用过这个网站：
http://lumadis.be/regex/test_regex.php

这有效：

/^(<name>[^<>]*<\/name>[^<>$]*<phone>[^<>]*<\/phone>)/

/(?-s)(<name>[^<>]*<\/name>.*<phone>[^<>]*<\/phone>)/

可能更好

Those sites don't seem to support the whole PCRE syntax. I used this site:
http://lumadis.be/regex/test_regex.php

And this worked:

/^(<name>[^<>]*<\/name>[^<>$]*<phone>[^<>]*<\/phone>)/

/(?-s)(<name>[^<>]*<\/name>.*<phone>[^<>]*<\/phone>)/

is probably better

回复收藏 0 原文

~没有更多了~

关于作者

路弥

暂无简介

0 文章

0 评论

23 人气

关注发私信

友情链接

文江博客

如何从正则表达式字符类中排除换行符？

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（3）

关于作者

相关话题

热门标签

推荐作者

烙印

singlesman

给自己一个微笑

独孤求败

晨钟暮鼓

我是自愿种绣球花的

友情链接

如何从正则表达式字符类中排除换行符？

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（3）

关于作者

相关话题

热门标签

推荐作者

烙印

singlesman

给自己一个微笑

独孤求败

晨钟暮鼓

我是自愿种绣球花的

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。