在 XPath 中获取（文本）

发布于 2024-10-27 07:29:13 字数 325 浏览 1 评论 0原文

我有以下 DOM 结构/HTML，我想获取（只是练习......）标记的数据。在此处输入图像描述

h2 元素下的那个。该 div[@class="coordsAgence"] 元素，下面有更多的 div 子元素和更多的 h2 元素。这样做：

div[@class="coordsAgence"]

将获得该值，但带有额外的不需要的文本。更新：我基本上想要的值（在此示例中）是：“GALLIER Dennis”文本。

原文

I have the following DOM structure / HTML, I want to get (just practicing...) the marked data.
enter image description here

The one that is under the h2 element. that div[@class="coordsAgence"] element, has some more div children below and some more h2's.. so doing:

div[@class="coordsAgence"]

Will get that value, but with additional unneeded text.
UPDATE: The value (From this example) that I basically want is that: "GALLIER Dennis" text.

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

凉城已无爱 2024-11-03 07:29:13

看来您想要该 div 中的第一个文本节点：

div[@class="coordsAgence"]/text()[1]

应该这样做。

请注意，这假设

内的这些注释之间实际上没有空格。否则该空格将构成您必须考虑的附加文本节点。

It seems you want the first text node in that div:

div[@class="coordsAgence"]/text()[1]

should do it.

Note that this assumes that there is actually no whitespace between those comments inside <div class="coordsAgence">; otherwise that whitespace will constitute additional text nodes that you'll have to account for.

回复收藏 0 原文

韬韬不绝 2024-11-03 07:29:13

使用类 "coordsAgence" 获取 div 中第一个 h2 之后的第一个文本节点：

div[@class='coordsAgence']/h2[1]/following-sibling::text()[1]

请注意，第一个表达式返回之后的第一个文本节点第一个 h2 即使两者之间出现其他节点。如果您只想在紧接第一个 h2 之后的节点时返回文本，请尝试如下操作：

div[@class='coordsAgence']/h2[1][following-sibling::node()[1][self::text()]]/following-sibling::text()[1]

Get the first text node following the first h2 in the div with class "coordsAgence":

div[@class='coordsAgence']/h2[1]/following-sibling::text()[1]

Note that this first expression returns the first text node after the first h2 even when some other node appears between the two. If you want to return the text only when it's the node that immediately follows the first h2, then try something like this:

div[@class='coordsAgence']/h2[1][following-sibling::node()[1][self::text()]]/following-sibling::text()[1]

回复收藏 0 原文

少女情怀诗 2024-11-03 07:29:13

使用Python/Scrapy从h1标签获取文本（例如）：

response.xpath(
        "//div[contains(@class, 'class_name')]//h1[contains(@class, 'class_name')]/text()"
    ).get()

using Python/Scrapy to get text from h1 tag(for example):

response.xpath(
        "//div[contains(@class, 'class_name')]//h1[contains(@class, 'class_name')]/text()"
    ).get()

回复收藏 0 原文

~没有更多了~

关于作者

南巷近海

暂无简介

0 文章

0 评论

24 人气

关注发私信

友情链接

文江博客

在 XPath 中获取（文本）

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（3）

关于作者

相关话题

热门标签

推荐作者

烙印

singlesman

给自己一个微笑

独孤求败

晨钟暮鼓

我是自愿种绣球花的

友情链接

在 XPath 中获取（文本）

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（3）

关于作者

相关话题

热门标签

推荐作者

烙印

singlesman

给自己一个微笑

独孤求败

晨钟暮鼓

我是自愿种绣球花的

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。