除了子表达式之外的任何内容

发布于 2024-11-05 07:38:24 字数 252 浏览 3 评论 0原文

我正在尝试使用 PHP 制作一个正则表达式来识别相对 src 路径。为此，我的想法是使用前瞻 (?= then not ^ 和子表达式 (http)，但这不起作用。它适用于单个字符，但 ^ 不适用于子表达式。是否有&& 运算符或其他什么？

 <img.*?src=[\'\"]\(?=^(http))

我需要它来获取整个 http，否则以 h、t 或 p 开头的 imgs 会受到影响，有什么建议吗？

原文

I am trying to make a regex to identify relative src paths using PHP. To do this my idea was to use a look ahead (?= then not ^ and a subexpression (http) but this doesn't work. It works for a single charater but the ^ doesn't work with a subexpression. Is there an && operator or something?

 <img.*?src=[\'\"]\(?=^(http))

I need it to take the entire http or else imgs with starting with h, t or p will be prejudiced against. Any suggestions? Is this task too big for regex?

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

风苍溪 2024-11-12 07:38:24

您可以使用负向先行，即 (?!...) 而不是 (?=...)。对于您的示例（我将锚点放在开头）：

^(?!http)

其内容为：字符串开头，然后是不是“http”的内容。

编辑：因为您更新了更完整的示例：

<img [^>]*src=['"](?!http)([^'"]+)['"]

                          ^------^ - this capturing group captures the link
                                     which doesn't start with http

当然，为了正确解析，您应该使用 DOM ;)

You can use negative lookahead, which is (?!...) instead of (?=...). For your example (I'd put the anchor at the start):

^(?!http)

Which reads: start of string, then something which is not "http".

Edit: since you updated with a fuller example:

<img [^>]*src=['"](?!http)([^'"]+)['"]

                          ^------^ - this capturing group captures the link
                                     which doesn't start with http

Of course, for proper parsing you should use DOM ;)

回复收藏 0 原文