正向前瞻正则表达式令人困惑

发布于 2024-12-07 08:07:43 字数 660 浏览 2 评论 0原文

我正在构建这个正则表达式，并对其进行积极的展望。基本上，它必须选择行中直到“：”之前的最后一个句点的所有文本，并添加“|”到最后来划定它。下面是一些示例文本。我正在 gskinner 和 editpadpro 中进行测试，它们显然具有完整的 grep 正则表达式支持，因此如果我能从中得到答案，我将不胜感激。

下面的正则表达式在一定程度上有效，但我不确定它是否正确。如果文本包含括号，它也会下降。

最后，我想添加另一条忽略规则，例如忽略但包含“Co”的规则。在选择中。第二个忽略规则将忽略但包括前面有一个大写字母的句点。下面还有示例文本。感谢您的所有帮助。

^(?:[^|]+\|){3}(.*?)[^(?:Co)]\.(?=[^:]*?\:)

121| Ryan, T.N. |2001. |I like regex. But does it like me (2) 2: 615-631.
122| O' Toole, H.Y. |2004. |(Note on the regex). Pages 90-91 In: Ryan, A. & Toole, B.L. (Editors) Guide to the regex functionality in php. Timmy, Tommy& Stewie, Quohog. * Produced for Family Guy in Quohog.

原文

I'm building this regex with a positive look ahead in it. Basically it must select all text in the line up to last period that precedes a ":" and add a "|" to the end to delimit it. Some sample text below. I am testing this in gskinner and editpadpro which has full grep regex support apparently so if I could get the answers in that for I'd appreciate it.

The regex below works to a degree but I am unsure if it is correct. Also it falls down if the text contains brackets.

Finally I would like to add another ignore rule like the one that ignores but includes "Co." in the selection. This second ignore rule would ignore but include periods that have a single Capital letter before them. Sample text below too. Thanks for all the help.

^(?:[^|]+\|){3}(.*?)[^(?:Co)]\.(?=[^:]*?\:)

121| Ryan, T.N. |2001. |I like regex. But does it like me (2) 2: 615-631.
122| O' Toole, H.Y. |2004. |(Note on the regex). Pages 90-91 In: Ryan, A. & Toole, B.L. (Editors) Guide to the regex functionality in php. Timmy, Tommy& Stewie, Quohog. * Produced for Family Guy in Quohog.

分享到QQ

分享到微博