如何使用 Perl 删除块注释？

发布于 2024-07-24 09:57:11 字数 266 浏览 12 评论 0原文

我正在开发一个分析 DSL 的预处理器。我的目标是删除评论。块注释工具前后由 %% 划分。根据语言的定义，我不必担心 %% 出现在字符串中。

我正在使用这个 s/// 正则表达式。不幸的是，它似乎匹配了一切并将其抹掉：

#Remove multiline comments.
$text_string =~ s/%%.*%%//msg;

我做错了什么？

原文

I am working on a preprocessor that is analyzing a DSL. My goal is to remove the comments.
The block comment facility is demarcated by %% before and after. I do not have to worry about %% being in strings, by the definition of the language.

I am using this s/// regex. Unfortunately, it seems to match everything and wipe it out:

#Remove multiline comments.
$text_string =~ s/%%.*%%//msg;

What am I doing wrong?

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

三寸金莲 2024-07-31 09:57:11

你能做的第一件事就是让它变得非贪婪：

.*?

否则，

%%一些文字%%
真实内容
%%其他文本%%

将全部被擦除。

the first thing you can do is make it non-greedy:

.*?

otherwise,

%% some text %%
real content
%% other text %%

will all be wiped out.

回复收藏 0 原文

撩起发的微风 2024-07-31 09:57:11

来自 perlfaq6：正则表达式贪婪意味着什么？我该如何解决这个问题？

大多数人的意思是贪婪的正则表达式会尽可能匹配。从技术上来说，实际上是量词（?、*、+、{}）是贪婪的，而不是整个模式； Perl 更喜欢局部贪婪和即时满足，而不是整体贪婪。要获取相同量词的非贪婪版本，请使用 (??, *?, +?, {}?)。

一个例子：

$s1 = $s2 = "I am very very cold";
$s1 =~ s/ve.*y //;      # I am cold
$s2 =~ s/ve.*?y //;     # I am very cold

注意第二个替换是如何在遇到“y”时立即停止匹配的。这 *？量词有效地告诉正则表达式引擎尽快找到匹配项，并将控制权传递给下一个队列，就像您在玩烫手山芋一样。

From perlfaq6: What does it mean that regexes are greedy? How can I get around it?

Most people mean that greedy regexes match as much as they can. Technically speaking, it's actually the quantifiers (?, *, +, {}) that are greedy rather than the whole pattern; Perl prefers local greed and immediate gratification to overall greed. To get non-greedy versions of the same quantifiers, use (??, *?, +?, {}?).

An example:

$s1 = $s2 = "I am very very cold";
$s1 =~ s/ve.*y //;      # I am cold
$s2 =~ s/ve.*?y //;     # I am very cold

Notice how the second substitution stopped matching as soon as it encountered "y ". The *? quantifier effectively tells the regular expression engine to find a match as quickly as possible and pass control on to whatever is next in line, like you would if you were playing hot potato.

回复收藏 0 原文