令人难以置信的基本 lxml 问题：获取 lxml.etree._Element 的 HTML/字符串内容？

发布于 2024-10-24 15:37:11 字数 408 浏览 2 评论 0原文

这是一个非常基本的问题，我实际上在文档中找不到它：-/

在以下内容中：

img = house_tree.xpath('//img[@id="mainphoto"]')[0]

如何获取标记的 HTML？

我尝试添加 html_content() 但得到 AttributeError: 'lxml.etree._Element' object has no attribute 'html_content'。

另外，它是一个内部包含一些内容的标签（例如

text

），我如何获取内容（例如 text）？

非常感谢！

原文

This is such a basic question that I actually can't find it in the docs :-/

In the following:

img = house_tree.xpath('//img[@id="mainphoto"]')[0]

How do I get the HTML of the <img/> tag?

I've tried adding html_content() but get AttributeError: 'lxml.etree._Element' object has no attribute 'html_content'.

Also, it was a tag with some content inside (e.g. <p>text</p>) how would I get the content (e.g. text)?

Many thanks!

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

南巷近海 2024-10-31 15:37:11

我想它会像这样简单：

from lxml.etree import tostring
inner_html = tostring(img)

至于从

内部获取内容，比如说，一些选定的元素 el：

content = el.text_content()

I suppose it will be as simple as:

from lxml.etree import tostring
inner_html = tostring(img)

As for getting content from inside <p>, say, some selected element el:

content = el.text_content()

回复收藏 0 原文

~没有更多了~

关于作者

惜醉颜

暂无简介

0 文章

0 评论

786 人气

关注发私信

爱人如己

文章 0 评论 0

关注

萧瑟寒风

文章 0 评论 0

关注

云雾

文章 0 评论 0

关注

倒带

文章 0 评论 0

关注

浮世清欢

文章 0 评论 0

关注

撩起发的微风

文章 0 评论 0

友情链接

文江博客

令人难以置信的基本 lxml 问题：获取 lxml.etree._Element 的 HTML/字符串内容？

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（1）

关于作者

相关话题

热门标签

推荐作者

爱人如己

萧瑟寒风

云雾

倒带

浮世清欢

撩起发的微风

友情链接

令人难以置信的基本 lxml 问题：获取 lxml.etree._Element 的 HTML/字符串内容？

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（1）

关于作者

相关话题

热门标签

推荐作者

爱人如己

萧瑟寒风

云雾

倒带

浮世清欢

撩起发的微风

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。