使用 HtmlAgilityPack 从节点获取文本

发布于 2024-09-30 08:26:36 字数 403 浏览 8 评论 0原文

我有以下 HTML：

<div class="top">
    <p>Blah.</p>
    I want <em>this</em> text.
</div>

提取字符串“I Want this text.”的 XPath 表示法是什么？编辑：我不一定需要单个 XPath 表达式来提取字符串。选择多个节点并迭代它们以生成句子也很棒。

HtmlDocument doc = new HtmlDocument();
doc.LoadHtml(myHtml);
doc.DocumentNode.SelectSingleNode("??????");

原文

I have the following HTML:

<div class="top">
    <p>Blah.</p>
    I want <em>this</em> text.
</div>

What is the XPath notation to extract the string "I want <em>this</em> text."?
EDIT: I don't necessarily want a single XPath expression to extract the string. Selecting multiple nodes, and iterating over them to produce the sentence, would be great as well.

HtmlDocument doc = new HtmlDocument();
doc.LoadHtml(myHtml);
doc.DocumentNode.SelectSingleNode("??????");

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

极度宠爱 2024-10-07 08:26:36

/div[@class='top']/p[.='Blah.']/following-sibling::node()

或者

/div[@class='top']/node()[not(self::p)]

/div[@class='top']/p[.='Blah.']/following-sibling::node()

/div[@class='top']/node()[not(self::p)]

回复收藏 0 原文

吃不饱 2024-10-07 08:26:36

你想提取什么，节点还是字符串？

如果您需要节点，“I Want this text.” 是一个 XML 片段，由 两个文本节点和一个 < 文本节点组成。 em> 元素，它有一个文本节点子节点。由于它在顶层有多个节点，因此您需要使用 SelectNodes("xpath expression a la @Alejandro") 而不是 SelectSingleNode() 来提取它们。

如果你想要一个字符串，你再次需要使用 SelectNodes();然后迭代选定的节点并连接每个节点的outerHTML。请参阅此处了解类似的一个很好的例子。

另外，从您的示例中还不清楚什么 XPath 表达式通常会给您带来您想要的东西。例如，您想要

下的初始