提高我的正则表达能力

发布于 2024-07-26 19:25:56 字数 748 浏览 2 评论 0原文

我想提高我的正则表达式技能已经有一段时间了，并且“掌握正则表达式”被推荐了很多次，所以我买了它并在过去一天左右的时间里一直在阅读它。

我创建了以下正则表达式：

^(?:<b>)?(?:^<i>)?<a href="/site\.php\?id=([0-9]*)">(.*?) \(([ a-z0-9]{2,10})\)</a>(?:^</i>)?(?:</b>)?$

它匹配前两个链接，但忽略 标记包含的两个链接。它提取 id、标题和类型。

<a href="/site.php?id=6321">site 1 title (type 1)</a>
<b><a href="/site.php?id=10254">site 2 title (type 2)</a></b>

<i><a href="/site.php?id=5479">site 3 title (type 3)</a></i>
<b><i><a href="/site.php?id=325">site 4 title (type 4)</a></i></b>

虽然它有效，但对于这么简单的东西来说似乎相当长，它可以改进吗？

原文

I've been wanting to improve my regex skills for quite some time now and "Mastering Regular Expressions" was recommended quite a few times so I bought it and have been reading it over the past day or so.

I have created the following regular expression:

^(?:<b>)?(?:^<i>)?<a href="/site\.php\?id=([0-9]*)">(.*?) \(([ a-z0-9]{2,10})\)</a>(?:^</i>)?(?:</b>)?$

Which matches the first two links but ignores the two enclosed by an <i> tag.
It extracts the id, title and type.

<a href="/site.php?id=6321">site 1 title (type 1)</a>
<b><a href="/site.php?id=10254">site 2 title (type 2)</a></b>

<i><a href="/site.php?id=5479">site 3 title (type 3)</a></i>
<b><i><a href="/site.php?id=325">site 4 title (type 4)</a></i></b>

Although it works, it seems fairly long for something so simple, could it be improved?

分享到QQ

分享到微博