如何从 python 中的 xmlNode 获取 xpathContext

发布于 2024-07-26 08:23:34 字数 902 浏览 6 评论 0原文

.net 上的 xpath 和 python 中的 sax 的忠实粉丝，但第一次在 python 中使用 xpath。

我有一个小脚本，它使用 xpath 从文档中选择一些节点，迭代它们，然后理想情况下再次使用 xpath 从中获取相关数据。然而，我无法得到最后一点，一旦我有了 xmlNode，我就无法从中获取上下文。

import libxml2
import urllib

doc = libxml2.parseDoc(
        urllib.urlopen('http://somemagicwebservice.com/').read())
ctxt = doc.xpathNewContext()
listitems = ctxt.xpathEval('//List/ListItem')
for item in listitems:
    itemctxt = item.xpathNewContext()
    title = itemctxt.xpathEval('//ItemAttributes/Title')
    asin = itemctxt.xpathEval('//Item/ASIN')
    itemctxc.xpathFreeContext()
ctxt.xpathFreeContext()
doc.freeDoc()

但是，itemctxt = item.xpathNewContext() 位失败，并显示

itemctxt = item.xpathNewContext()
AttributeError: xmlNode instance has no attribute 'xpathNewContext'

Any ideas how to use xpath on a xmlNode? 我在网上找不到任何好的信息。谢谢

原文

big fan of xpath on .net, and sax in python, but first time using xpath in python.

I have a small script, that uses xpath to select some nodes from a doc, iterates through them, and then ideally uses xpath again to get the relevant data from them. However I can't get that last bit, once I have the xmlNode I cannot get a context from it.

import libxml2
import urllib

doc = libxml2.parseDoc(
        urllib.urlopen('http://somemagicwebservice.com/').read())
ctxt = doc.xpathNewContext()
listitems = ctxt.xpathEval('//List/ListItem')
for item in listitems:
    itemctxt = item.xpathNewContext()
    title = itemctxt.xpathEval('//ItemAttributes/Title')
    asin = itemctxt.xpathEval('//Item/ASIN')
    itemctxc.xpathFreeContext()
ctxt.xpathFreeContext()
doc.freeDoc()

However the itemctxt = item.xpathNewContext() bit fails with

itemctxt = item.xpathNewContext()
AttributeError: xmlNode instance has no attribute 'xpathNewContext'

Any ideas how to use xpath on a xmlNode? I can't find any good online info.
Thanks

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

冰魂雪魄 2024-08-02 08:23:34

我认为 XPathContext 对元素没有意义？尝试创建一个新的 XPathContext，并将其节点设置为当前元素。

也就是说，我没有直接使用 libxml2，所以这有点疯狂的猜测。我通常使用 lxml，它公开了 libxml2 和 libxslt 周围的 ElementTree API。它更容易使用，并且确实允许在元素上使用 xpath() 。当然，如果您已经有很多使用 libxml2 的代码，您可能不想切换，但在这种情况下，您可能需要查看 lxmls 源代码以了解它是如何实现的。

http://codespeak.net/svn/lxml/trunk/src/ lxml/xpath.pxi

http://codespeak.net /svn/lxml/trunk/src/lxml/_elementpath.py

似乎是不错的起点。

回复收藏 0 原文

一百个冬季 2024-08-02 08:23:34

https://stackoverflow.com/a/3379708/288875 建议调用 setContextNode(..) 在新创建的上下文中：

itemctxt = doc.xpathNewContext()

for item in listitems:
    itemctxt.setContextNode(item)
    title = itemctxt.xpathEval('.//ItemAttributes/Title')

    ...

itemctxt.xpathFreeContext()

在我当前使用的 python libxml (2.9.1) 版本中，甚至可以调用：

item.xpathEval('.//ItemAttributes/Title')

请注意，您必须在 xpath 的开头添加一个点表达式 .// （而不是 //），否则您将获得相对于文档根目录的搜索结果。

https://stackoverflow.com/a/3379708/288875 suggests to call setContextNode(..) on a newly created context:

itemctxt = doc.xpathNewContext()

for item in listitems:
    itemctxt.setContextNode(item)
    title = itemctxt.xpathEval('.//ItemAttributes/Title')

    ...

itemctxt.xpathFreeContext()

In the version of python libxml (2.9.1) which I'm currently using it turns out that one can even call:

item.xpathEval('.//ItemAttributes/Title')

Note that you'll have to add a dot at the beginning of the xpath expressions .// (instead of //), otherwise you'll get search results relative to the document root.

回复收藏 0 原文

~没有更多了~