需要帮助理解 etree xml for python 使用标签结构

发布于 2024-12-06 06:54:11 字数 2688 浏览 2 评论 0 原文

我在理解 python etree 库读取 xml 文件时遇到一些问题。 我对 python 编程很陌生...所以 xml 对我来说有点古怪...

我在文件中有以下 xml 结构:

    <sss version="1.2">
    <date>2011-09-23</date>
    <time>12:32:29</time>
    <origin>OPST</origin>
    <user></user>
    <survey>
        <name>Test</name>
        <version>2011-09-02 15:50:10</version>
        <record ident="A">
            <variable ident="10" type="quantity">
                <name>v_682</name>
                <label>Another question</label>
                <position start="23" finish="24"/>
                <values>
                    <range from="0" to="32"/>
                </values>
            </variable>
            <variable ident="11" type="quantity">
                <name>v_683</name>
                <label>another totally another Question</label>
                <position start="25" finish="26"/>
                <values>
                    <range from="0" to="33"/>
                </values>
            </variable>
            <variable ident="12" type="quantity">
                <name>v_684</name>
                <label>And once more Question</label>
                <position start="27" finish="29"/>
                <values>
                    <range from="0" to="122"/>
                </values>
            </variable>
            <variable ident="20" type="single">
                <name>v_685</name>
                <label>Question with alternatives</label>
                <position start="73" finish="73"/>
                <values>
                    <range from="1" to="6"/>
                    <value code="1">Alternative 1</value>
                    <value code="2">Alternative 2</value>
                    <value code="3">Alternative 3</value>
                    <value code="6">Alternative 4</value>
                </values>
            </variable>
        </record>
    </survey>
</sss>

为了读取元素,我开发了一个非常糟糕的循环,它与 etree 的功能不匹配我猜库...

from xml.etree.cElementTree import parse
et = parse(open('scheme.xml','rb'))
root = et.getroot()
for i in range(4):
    a= str(root[4][2][i][0].text)
    if a.startswith('v'):
        print root[4][2][i][1].text 

我如何利用标签结构:例如读取“值”标签以将文本附加到列表中?

对我来说,这些 etree 教程很难读...也许有人可以向我展示如何使用基于标签的搜索...?这些循环是如此脆弱...... 非常感谢

i'm having some problems to unterstand the python etree library to read a xml file.
I pretty new with programming in python...so xml is kind of kinky for me...

I have the following xml structure in a file:

    <sss version="1.2">
    <date>2011-09-23</date>
    <time>12:32:29</time>
    <origin>OPST</origin>
    <user></user>
    <survey>
        <name>Test</name>
        <version>2011-09-02 15:50:10</version>
        <record ident="A">
            <variable ident="10" type="quantity">
                <name>v_682</name>
                <label>Another question</label>
                <position start="23" finish="24"/>
                <values>
                    <range from="0" to="32"/>
                </values>
            </variable>
            <variable ident="11" type="quantity">
                <name>v_683</name>
                <label>another totally another Question</label>
                <position start="25" finish="26"/>
                <values>
                    <range from="0" to="33"/>
                </values>
            </variable>
            <variable ident="12" type="quantity">
                <name>v_684</name>
                <label>And once more Question</label>
                <position start="27" finish="29"/>
                <values>
                    <range from="0" to="122"/>
                </values>
            </variable>
            <variable ident="20" type="single">
                <name>v_685</name>
                <label>Question with alternatives</label>
                <position start="73" finish="73"/>
                <values>
                    <range from="1" to="6"/>
                    <value code="1">Alternative 1</value>
                    <value code="2">Alternative 2</value>
                    <value code="3">Alternative 3</value>
                    <value code="6">Alternative 4</value>
                </values>
            </variable>
        </record>
    </survey>
</sss>

to read elements i developed a pretty bad loop, that does not match the capabilities of the etree library i guess...

from xml.etree.cElementTree import parse
et = parse(open('scheme.xml','rb'))
root = et.getroot()
for i in range(4):
    a= str(root[4][2][i][0].text)
    if a.startswith('v'):
        print root[4][2][i][1].text 

How can i make use of the tag structure: for instance to read the "value" tag for appending the text into a list?

For me these etree tutorials are pretty difficult to gasp...maybe someone can show me how to use a tag based search...? These loops are so fragile...
Thanx a lot

如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。

扫码二维码加入Web技术交流群

发布评论

需要 登录 才能够评论, 你可以免费 注册 一个本站的账号。

评论(1

云之铃。 2024-12-13 06:54:11

如果文件很小并且您只需要 元素:

#!/usr/bin/env python
import xml.etree.cElementTree as etree

tree = etree.parse('scheme.xml')
for value in tree.getiterator(tag='value'):
    print value.get('code'), value.text

如果文件很大:

def getelements(filename, tag):
    context = iter(etree.iterparse(filename, events=('start', 'end')))
    _, root = next(context) # get root element
    for event, elem in context:
        if event == 'end' and elem.tag == tag:
            yield elem
            root.clear() # free memory

for elem in getelements('scheme.xml', 'value'):
    print elem.get('code'), elem.text

输出

1 Alternative 1
2 Alternative 2
3 Alternative 3
6 Alternative 4

要了解更多信息,请阅读 搜索子元素ElementTree XML API

If the file is small and you only need <value/> elements:

#!/usr/bin/env python
import xml.etree.cElementTree as etree

tree = etree.parse('scheme.xml')
for value in tree.getiterator(tag='value'):
    print value.get('code'), value.text

If the file is large:

def getelements(filename, tag):
    context = iter(etree.iterparse(filename, events=('start', 'end')))
    _, root = next(context) # get root element
    for event, elem in context:
        if event == 'end' and elem.tag == tag:
            yield elem
            root.clear() # free memory

for elem in getelements('scheme.xml', 'value'):
    print elem.get('code'), elem.text

Output

1 Alternative 1
2 Alternative 2
3 Alternative 3
6 Alternative 4

To find out more, read Searching for Subelements and The ElementTree XML API.

~没有更多了~
我们使用 Cookies 和其他技术来定制您的体验包括您的登录状态等。通过阅读我们的 隐私政策 了解更多相关信息。 单击 接受 或继续使用网站,即表示您同意使用 Cookies 和您的相关数据。
原文