如何使正则表达式变成非贪婪的？

发布于 2024-09-01 16:58:23 字数 1092 浏览 19 评论 0原文

我正在使用 jQuery。我有一个带有特殊字符块（开始和结束）的字符串。我想从该特殊字符块中获取文本。我使用正则表达式对象进行字符串内查找。但是，当有两个或更多特殊字符时，如何告诉 jQuery 查找多个结果呢？

我的 HTML：

<div id="container">
    <div id="textcontainer">
     Cuộc chiến pháp lý giữa [|cơ thử|nghiệm|] thị trường [|test2|đây là test lần 2|] chứng khoán [|Mỹ|day la nuoc my|] và ngân hàng đầu tư quyền lực nhất Phố Wall mới chỉ bắt đầu.
    </div>
</div>

和我的 JavaScript 代码：

$(document).ready(function() {
  var takedata = $("#textcontainer").text();
  var test = 'abcd adddb';
  var filterdata = takedata.match(/(\[.+\])/);

  alert(filterdata); 

  //end write js 
});

我刚刚在互联网上搜索信息后完成了工作^^。我编写了这样的代码：

var filterdata = takedata.match(/(\[.*?\])/g);

我的结果是：[|cơ thử|nghiệm|],[|test2|đây là test lần 2|] 这是对的！但我不太明白这一点。你能回答我的原因吗？

原文

I'm using jQuery. I have a string with a block of special characters (begin and end). I want get the text from that special characters block. I used a regular expression object for in-string finding. But how can I tell jQuery to find multiple results when have two special character or more?

My HTML:

<div id="container">
    <div id="textcontainer">
     Cuộc chiến pháp lý giữa [|cơ thử|nghiệm|] thị trường [|test2|đây là test lần 2|] chứng khoán [|Mỹ|day la nuoc my|] và ngân hàng đầu tư quyền lực nhất Phố Wall mới chỉ bắt đầu.
    </div>
</div>

and my JavaScript code:

$(document).ready(function() {
  var takedata = $("#textcontainer").text();
  var test = 'abcd adddb';
  var filterdata = takedata.match(/(\[.+\])/);

  alert(filterdata); 

  //end write js 
});

I've just done my work after searching info on internet ^^. I make code like this:

var filterdata = takedata.match(/(\[.*?\])/g);

my result is : [|cơ thử|nghiệm|],[|test2|đây là test lần 2|]
this is right!. but I don't really understand this. Can you answer my why?

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

动次打次papapa 2024-09-08 16:58:23

非贪婪的正则表达式修饰符就像它们的贪婪的对应部分一样，但紧随其后的是 ? ：

*  - zero or more
*? - zero or more (non-greedy)
+  - one or more
+? - one or more (non-greedy)
?  - zero or one
?? - zero or one (non-greedy)

The non-greedy regex modifiers are like their greedy counter-parts but with a ? immediately following them:

*  - zero or more
*? - zero or more (non-greedy)
+  - one or more
+? - one or more (non-greedy)
?  - zero or one
?? - zero or one (non-greedy)

回复收藏 0 原文

坐在坟头思考人生 2024-09-08 16:58:23

你是对的，贪婪是一个问题：

--A--Z--A--Z--
  ^^^^^^^^^^
     A.*Z

如果你想匹配 A--Z，你必须使用 A.*?Z （? 使 * “不情愿”，或懒惰）。

不过，有时有更好的方法可以做到这一点，例如，

A[^Z]*+Z

这使用否定字符类和所有格量词，以减少回溯，并且可能更有效。

在你的情况下，正则表达式将是：

/(\[[^\]]++\])/

不幸的是 Javascript正则表达式不支持所有格量词，所以你只需要这样做：

/(\[[^\]]+\])/

另请参阅

正则表达式.info/Repetition
- 参见：懒惰的替代方案
  - 所有格量词
- 口味比较

快速摘要

*   Zero or more, greedy
*?  Zero or more, reluctant
*+  Zero or more, possessive

+   One or more, greedy
+?  One or more, reluctant
++  One or more, possessive

?   Zero or one, greedy
??  Zero or one, reluctant
?+  Zero or one, possessive

请注意，不情愿和所有格量词也适用于有限重复 {n,m} 结构。

Java 中的示例：

System.out.println("aAoZbAoZc".replaceAll("A.*Z", "!"));  // prints "a!c"
System.out.println("aAoZbAoZc".replaceAll("A.*?Z", "!")); // prints "a!b!c"

System.out.println("xxxxxx".replaceAll("x{3,5}", "Y"));  // prints "Yx"
System.out.println("xxxxxx".replaceAll("x{3,5}?", "Y")); // prints "YY"

You are right that greediness is an issue:

--A--Z--A--Z--
  ^^^^^^^^^^
     A.*Z

If you want to match both A--Z, you'd have to use A.*?Z (the ? makes the * "reluctant", or lazy).

There are sometimes better ways to do this, though, e.g.

A[^Z]*+Z

This uses negated character class and possessive quantifier, to reduce backtracking, and is likely to be more efficient.

In your case, the regex would be:

/(\[[^\]]++\])/

Unfortunately Javascript regex doesn't support possessive quantifier, so you'd just have to do with:

/(\[[^\]]+\])/

Quick summary

*   Zero or more, greedy
*?  Zero or more, reluctant
*+  Zero or more, possessive

+   One or more, greedy
+?  One or more, reluctant
++  One or more, possessive

?   Zero or one, greedy
??  Zero or one, reluctant
?+  Zero or one, possessive

Note that the reluctant and possessive quantifiers are also applicable to the finite repetition {n,m} constructs.

Examples in Java:

System.out.println("aAoZbAoZc".replaceAll("A.*Z", "!"));  // prints "a!c"
System.out.println("aAoZbAoZc".replaceAll("A.*?Z", "!")); // prints "a!b!c"

System.out.println("xxxxxx".replaceAll("x{3,5}", "Y"));  // prints "Yx"
System.out.println("xxxxxx".replaceAll("x{3,5}?", "Y")); // prints "YY"

回复收藏 0 原文