使用正则表达式和标签提取部分文本

发布于 2024-09-18 20:50:42 字数 217 浏览 3 评论 0原文

所以我有一个带有特殊标签的文本文件，例如：

{A1}
Text 1
{A1}

{A2}
Text 2
{A2}

如何使用 reg-ex 从文本中提取 Text 2 或 Text 1 .. 部分？所以我要能够只提取标签 A1 之间的内容或仅提取标签 A2 之间的内容..不是全部...一次！谢谢！

原文

So I have a text file that is having special tags like:

{A1}
Text 1
{A1}

{A2}
Text 2
{A2}

How can I extract from the text using reg-ex the portion Text 2 or Text 1 ..?
So I what to be able to extract only what is between tags A1 or only what is between Tags A2 .. not all of them ... at once!
thanks!

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

甚是思念 2024-09-25 20:50:42

在 C# 中，您可以执行以下操作：


string output = Regex.Replace(YOUR_TEXT, @"\{(?<Tag>\w+).*?\}(?<text>\w+).*?\{\k<Tag>\}", "$2");

不支持嵌套标签。

In C# you can do something like this:


string output = Regex.Replace(YOUR_TEXT, @"\{(?<Tag>\w+).*?\}(?<text>\w+).*?\{\k<Tag>\}", "$2");

Nested tags are not suppoerted.

回复收藏 0 原文

哭了丶谁疼 2024-09-25 20:50:42

如果您假设文档格式良好并且标签未嵌套，则可以使用以下正则表达式来执行此操作：

@"({.*?})(.*)\1"

示例：

Regex regex = new Regex(@"({.*?})(.*?)\1", RegexOptions.Singleline);
foreach (Match match in regex.Matches(s)) {
    Console.WriteLine(match.Groups[2].Value.Trim());
}

输出：

Text 1
Text 2

You can do it with the following regular expression if you assume that the document is well-formed and that your tags are not nested:

@"({.*?})(.*)\1"

Example:

Regex regex = new Regex(@"({.*?})(.*?)\1", RegexOptions.Singleline);
foreach (Match match in regex.Matches(s)) {
    Console.WriteLine(match.Groups[2].Value.Trim());
}

Output: