用于选择多个 HTML `a` 元素的 XPath

发布于 2024-12-19 00:45:01 字数 389 浏览 0 评论 0原文

我对 XPath 还很陌生，无法通过其他解决方案找到答案。

我想做的是选择给定 td 内的所有 a 元素（例如 td[2]）并运行 for语句来输出 a 元素中包含的文本。

源代码：

multiple = HTML.ElementFromURL(url).xpath('//table[contains(@class, "mg-b20")]/tr[3]/td[2]/*[self::a]')

for item in multiple:
    Log("text = %s" %item.text)

有什么指示可以让我完成这项工作吗？

谢谢！

原文

I'm pretty new to XPath and couldn't figure it out looking at other solutions.

What I'm trying to do is select all the a elements inside a given td (td[2] in example) and running a for statement to output the text contained within the a elements.

Source code:

multiple = HTML.ElementFromURL(url).xpath('//table[contains(@class, "mg-b20")]/tr[3]/td[2]/*[self::a]')

for item in multiple:
    Log("text = %s" %item.text)

Any pointer in how I can make this work?

Thanks!

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

放飞的风筝 2024-12-26 00:45:01

您需要的 XPath 非常接近：

//table[contains(@class, "mg-b20")]/tr[3]/td[2]//a

我不知道您正在使用什么库，但我怀疑它是 Plex Parsekit API。如果是这样，parsekit 使用 lxml.etree 作为其底层库，因此您可以进一步简化您的代码：

element = HTML.ElementFromURL(url)
alltext = element.xpath('string(//table[contains(@class, "mg-b20")]/tr[3]/td[2]//a)')

for item in alltext:
    Log("text = %s" % item);

这甚至可以处理混合内容等极端情况，例如：

<a href="#">I am anchor text <span>But I am too and am not in Element.text</span> and I am in Element.tail</a>

The XPath you need is pretty close:

//table[contains(@class, "mg-b20")]/tr[3]/td[2]//a

I don't know what library you're using, but I suspect it is the Plex Parsekit API. If so, parsekit uses lxml.etree as its underlying library, so you can simplify your code even further:

element = HTML.ElementFromURL(url)
alltext = element.xpath('string(//table[contains(@class, "mg-b20")]/tr[3]/td[2]//a)')

for item in alltext:
    Log("text = %s" % item);

This will even take care of corner cases like mixed content, e.g. this:

<a href="#">I am anchor text <span>But I am too and am not in Element.text</span> and I am in Element.tail</a>

回复收藏 0 原文

~没有更多了~

关于作者

丶情人眼里出诗心の

暂无简介

0 文章

0 评论

24 人气

关注发私信

lixs

文章 0 评论 0

关注

敷衍　

文章 0 评论 0

关注

盗梦空间

文章 0 评论 0

关注

tian

文章 0 评论 0

关注

13375331123

文章 0 评论 0

关注

你对谁都笑

文章 0 评论 0

友情链接

文江博客

用于选择多个 HTML `a` 元素的 XPath

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（1）

关于作者

相关话题

热门标签

推荐作者

lixs

敷衍

盗梦空间

tian

13375331123

你对谁都笑

友情链接

用于选择多个 HTML `a` 元素的 XPath

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（1）

关于作者

相关话题

热门标签

推荐作者

lixs

敷衍

盗梦空间

tian

13375331123

你对谁都笑

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

敷衍