SimpleXML& PHP:提取XML文档的一部分&转换为数组

发布于 2024-11-28 16:20:30 字数 3678 浏览 0 评论 0原文

考虑以下 XML:

<?xml version="1.0" encoding="UTF-8"?>
<OS>
    <data>
        <OSes>
            <centos>
                <v_5>
                    <i386>
                        <id>centos5-32</id>
                        <name>CentOS 5 - 32 bit</name>
                        <version>5</version>
                        <architecture>32</architecture>
                        <os>centos</os>
                    </i386>
                    <x86_64>
                        <id>centos5-64</id>
                        <name>CentOS 5 - 64 bit</name>
                        <version>5</version>
                        <architecture>64</architecture>
                        <os>centos</os>
                    </x86_64>
                </v_5>
                <v_6>
                    <i386>
                        <id>centos6-32</id>
                        <name>CentOS 6 - 32 bit</name>
                        <version>6</version>
                        <architecture>32</architecture>
                        <os>centos</os>
                    </i386>
                    <x86_64>
                        <id>centos6-64</id>
                        <name>CentOS 6 - 64 bit</name>
                        <version>6</version>
                        <architecture>64</architecture>
                        <os>centos</os>
                    </x86_64>
                </v_6>
            </centos>
            <ubuntu>
                <v_10>
                    <i386>
                        <id>ubuntu10-32</id>
                        <name>Ubuntu 10 - 32 bit</name>
                        <version>10</version>
                        <architecture>32</architecture>
                        <os>ubuntu</os>
                    </i386>
                    <amd64>
                        <id>ubuntu10-64</id>
                        <name>Ubuntu 10 - 64 bit</name>
                        <version>10</version>
                        <architecture>64</architecture>
                        <os>ubuntu</os>
                    </amd64>
                </v_10>
            </ubuntu>
        </OSes>
    </data>
</OS>

从上面的 XML 文档中,我想提取以下 5 个元素节点

并将它们作为数组。我尝试执行以下操作:

<?php 
require_once "xml.php";

    try {
        $xml = new SimpleXMLElement($xmlstr);
        foreach($xml->xpath(' //id | //name | //version// | //architecture | //os ') as $record) {
        echo $record;
    }
    } catch(Exception $e){
        echo $e->getMessage();
    }

上面的代码有效,但每个记录都是一个单独的对象。我希望有人将所有 5 个元素节点合并为一个数组元素。像这样的事情:

$osList = Array( [0] => Array(
                               ["id"] => "<id>",
                               ["name"] => "<name>",
                               ["version"] => "<version>",
                               ....
)
 .....
);

语法不正确,但你明白了。知道怎么做吗?

Consider the following XML:

<?xml version="1.0" encoding="UTF-8"?>
<OS>
    <data>
        <OSes>
            <centos>
                <v_5>
                    <i386>
                        <id>centos5-32</id>
                        <name>CentOS 5 - 32 bit</name>
                        <version>5</version>
                        <architecture>32</architecture>
                        <os>centos</os>
                    </i386>
                    <x86_64>
                        <id>centos5-64</id>
                        <name>CentOS 5 - 64 bit</name>
                        <version>5</version>
                        <architecture>64</architecture>
                        <os>centos</os>
                    </x86_64>
                </v_5>
                <v_6>
                    <i386>
                        <id>centos6-32</id>
                        <name>CentOS 6 - 32 bit</name>
                        <version>6</version>
                        <architecture>32</architecture>
                        <os>centos</os>
                    </i386>
                    <x86_64>
                        <id>centos6-64</id>
                        <name>CentOS 6 - 64 bit</name>
                        <version>6</version>
                        <architecture>64</architecture>
                        <os>centos</os>
                    </x86_64>
                </v_6>
            </centos>
            <ubuntu>
                <v_10>
                    <i386>
                        <id>ubuntu10-32</id>
                        <name>Ubuntu 10 - 32 bit</name>
                        <version>10</version>
                        <architecture>32</architecture>
                        <os>ubuntu</os>
                    </i386>
                    <amd64>
                        <id>ubuntu10-64</id>
                        <name>Ubuntu 10 - 64 bit</name>
                        <version>10</version>
                        <architecture>64</architecture>
                        <os>ubuntu</os>
                    </amd64>
                </v_10>
            </ubuntu>
        </OSes>
    </data>
</OS>

From the XML document above, I want to extract following 5 element node

  1. <id>
  2. <name>
  3. <version>
  4. <architecture>
  5. <os>

And have them as a array. I tried doing the following:

<?php 
require_once "xml.php";

    try {
        $xml = new SimpleXMLElement($xmlstr);
        foreach($xml->xpath(' //id | //name | //version// | //architecture | //os ') as $record) {
        echo $record;
    }
    } catch(Exception $e){
        echo $e->getMessage();
    }

the above code works but each record is an separate object. I want someone to consolidate all 5 elements nodes as one array element. something like this:

$osList = Array( [0] => Array(
                               ["id"] => "<id>",
                               ["name"] => "<name>",
                               ["version"] => "<version>",
                               ....
)
 .....
);

syntax isn't correct but you get the idea. any idea how to do this?

如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。

扫码二维码加入Web技术交流群

发布评论

需要 登录 才能够评论, 你可以免费 注册 一个本站的账号。

评论(3

风柔一江水 2024-12-05 16:20:30

这可能有帮助

$obj = new SimpleXMLElement($xml);
$rtn = array();
$cnt = 0;
foreach($obj->xpath('///OSes/*/*') as $rec)
{
  foreach ($rec as $rec_obj)
  {
    if (!isset($rtn[$cnt]))
    {
      $rtn[$cnt] = array();
    }

    foreach ($rec_obj as $name=>$val)
    {
      $rtn[$cnt][(string)$name] = (string)$val;
    }
    ++$cnt;
  }
}

this might help

$obj = new SimpleXMLElement($xml);
$rtn = array();
$cnt = 0;
foreach($obj->xpath('///OSes/*/*') as $rec)
{
  foreach ($rec as $rec_obj)
  {
    if (!isset($rtn[$cnt]))
    {
      $rtn[$cnt] = array();
    }

    foreach ($rec_obj as $name=>$val)
    {
      $rtn[$cnt][(string)$name] = (string)$val;
    }
    ++$cnt;
  }
}
っ左 2024-12-05 16:20:30

通过按照其他人的建议修改 xpath,我得出了这个结论。它与一个辅助函数配合使用来重新格式化每个 xpath 结果节点,并使用 array_reduce 来迭代结果。然后它返回转换后的结果(Demo):

$xml = new SimpleXMLElement($xmlstr);
$elements = array_reduce(
    $xml->xpath('//OSes/*/*'),
    function($v, $w) {
        $w = array_values((array) $w); // convert result to array
        foreach($w as &$d) $d = (array) $d; // convert inner elements to array
        return array_merge($v, $w); // merge with existing
    }, 
    array() // empty elements at start
);

输出:

Array
(
    [0] => Array
        (
            [id] => centos5-32
            [name] => CentOS 5 - 32 bit
            [version] => 5
            [architecture] => 32
            [os] => centos
        )

    [1] => Array
        (
            [id] => centos5-64
            [name] => CentOS 5 - 64 bit
            [version] => 5
            [architecture] => 64
            [os] => centos
        )

    [2] => Array
        (
            [id] => centos6-32
            [name] => CentOS 6 - 32 bit
            [version] => 6
            [architecture] => 32
            [os] => centos
        )

    [3] => Array
        (
            [id] => centos6-64
            [name] => CentOS 6 - 64 bit
            [version] => 6
            [architecture] => 64
            [os] => centos
        )

    [4] => Array
        (
            [id] => ubuntu10-32
            [name] => Ubuntu 10 - 32 bit
            [version] => 10
            [architecture] => 32
            [os] => ubuntu
        )

    [5] => Array
        (
            [id] => ubuntu10-64
            [name] => Ubuntu 10 - 64 bit
            [version] => 10
            [architecture] => 64
            [os] => ubuntu
        )

)

我还选择转换原始 xpath 结果放入两个级别的数组中,每次在当前级别内已经存在一个键时,将当前条目移动到新条目(Demo< /a>):

try
{
    $xml = new SimpleXMLElement($xmlstr);
    $elements = array();
    $curr = NULL;
    foreach($xml->xpath('//id | //name | //version | //architecture | //os') as $record)
    {
        $key = $record->getName();
        $value = (string) $record;
        if (!$curr || array_key_exists($key, $curr)) {
            unset($curr);
            $curr = array();
            $elements[] = &$curr;
        }
        $curr[$key] = $value;
    }
    unset($curr);
}
catch(Exception $e)
{
    echo $e->getMessage();
}

结果那么是这样的:

Array
(
    [0] => Array
        (
            [id] => centos5-32
            [name] => CentOS 5 - 32 bit
            [version] => 5
            [architecture] => 32
            [os] => centos
        )

    [1] => Array
        (
            [id] => centos5-64
            [name] => CentOS 5 - 64 bit
            [version] => 5
            [architecture] => 64
            [os] => centos
        )

    [2] => Array
        (
            [id] => centos6-32
            [name] => CentOS 6 - 32 bit
            [version] => 6
            [architecture] => 32
            [os] => centos
        )

    [3] => Array
        (
            [id] => centos6-64
            [name] => CentOS 6 - 64 bit
            [version] => 6
            [architecture] => 64
            [os] => centos
        )

    [4] => Array
        (
            [id] => ubuntu10-32
            [name] => Ubuntu 10 - 32 bit
            [version] => 10
            [architecture] => 32
            [os] => ubuntu
        )

    [5] => Array
        (
            [id] => ubuntu10-64
            [name] => Ubuntu 10 - 64 bit
            [version] => 10
            [architecture] => 64
            [os] => ubuntu
        )

)

By modifying the xpath as others suggested as well, I came to this conclusion. It works with one helper function to re-format each xpath result node and uses array_reduce to iterate over the result. It then returns the converted result (Demo):

$xml = new SimpleXMLElement($xmlstr);
$elements = array_reduce(
    $xml->xpath('//OSes/*/*'),
    function($v, $w) {
        $w = array_values((array) $w); // convert result to array
        foreach($w as &$d) $d = (array) $d; // convert inner elements to array
        return array_merge($v, $w); // merge with existing
    }, 
    array() // empty elements at start
);

Output:

Array
(
    [0] => Array
        (
            [id] => centos5-32
            [name] => CentOS 5 - 32 bit
            [version] => 5
            [architecture] => 32
            [os] => centos
        )

    [1] => Array
        (
            [id] => centos5-64
            [name] => CentOS 5 - 64 bit
            [version] => 5
            [architecture] => 64
            [os] => centos
        )

    [2] => Array
        (
            [id] => centos6-32
            [name] => CentOS 6 - 32 bit
            [version] => 6
            [architecture] => 32
            [os] => centos
        )

    [3] => Array
        (
            [id] => centos6-64
            [name] => CentOS 6 - 64 bit
            [version] => 6
            [architecture] => 64
            [os] => centos
        )

    [4] => Array
        (
            [id] => ubuntu10-32
            [name] => Ubuntu 10 - 32 bit
            [version] => 10
            [architecture] => 32
            [os] => ubuntu
        )

    [5] => Array
        (
            [id] => ubuntu10-64
            [name] => Ubuntu 10 - 64 bit
            [version] => 10
            [architecture] => 64
            [os] => ubuntu
        )

)

I also opted for converting the original xpath result into an array of two levels, each time within the current level a key already exists, move the current entry to a new entry (Demo):

try
{
    $xml = new SimpleXMLElement($xmlstr);
    $elements = array();
    $curr = NULL;
    foreach($xml->xpath('//id | //name | //version | //architecture | //os') as $record)
    {
        $key = $record->getName();
        $value = (string) $record;
        if (!$curr || array_key_exists($key, $curr)) {
            unset($curr);
            $curr = array();
            $elements[] = &$curr;
        }
        $curr[$key] = $value;
    }
    unset($curr);
}
catch(Exception $e)
{
    echo $e->getMessage();
}

Result is like this then:

Array
(
    [0] => Array
        (
            [id] => centos5-32
            [name] => CentOS 5 - 32 bit
            [version] => 5
            [architecture] => 32
            [os] => centos
        )

    [1] => Array
        (
            [id] => centos5-64
            [name] => CentOS 5 - 64 bit
            [version] => 5
            [architecture] => 64
            [os] => centos
        )

    [2] => Array
        (
            [id] => centos6-32
            [name] => CentOS 6 - 32 bit
            [version] => 6
            [architecture] => 32
            [os] => centos
        )

    [3] => Array
        (
            [id] => centos6-64
            [name] => CentOS 6 - 64 bit
            [version] => 6
            [architecture] => 64
            [os] => centos
        )

    [4] => Array
        (
            [id] => ubuntu10-32
            [name] => Ubuntu 10 - 32 bit
            [version] => 10
            [architecture] => 32
            [os] => ubuntu
        )

    [5] => Array
        (
            [id] => ubuntu10-64
            [name] => Ubuntu 10 - 64 bit
            [version] => 10
            [architecture] => 64
            [os] => ubuntu
        )

)
梦里寻她 2024-12-05 16:20:30

试试这个:

// flatten:
function arrayval1($any) {
  return array_values((array)$any);
}
function arrayval2($any) {
  return (array)$any;
}

// xml objects with xml objects:
$oses = $xml->xpath('//OSes/*/*');
// an array of xml objects:
$oses = array_map('arrayval1', $oses);
// merge to a flat array:
$oses = call_user_func_array('array_merge', $oses);
// xml objects -> arrays
$oses = array_map('arrayval2', $oses);
print_r($oses);

我的结果:

Array
(
    [0] => Array
        (
            [id] => centos5-32
            [name] => CentOS 5 - 32 bit
            [version] => 5
            [architecture] => 32
            [os] => centos
        )

    [1] => Array
        (
            [id] => centos5-64
            [name] => CentOS 5 - 64 bit
            [version] => 5
            [architecture] => 64
            [os] => centos
        )

    [2] => Array
        (
            [id] => centos6-32
            [name] => CentOS 6 - 32 bit
            [version] => 6
            [architecture] => 32
            [os] => centos
        )

    [3] => Array
        (
            [id] => centos6-64
            [name] => CentOS 6 - 64 bit
            [version] => 6
            [architecture] => 64
            [os] => centos
        )

    [4] => Array
        (
            [id] => ubuntu10-32
            [name] => Ubuntu 10 - 32 bit
            [version] => 10
            [architecture] => 32
            [os] => ubuntu
        )

    [5] => Array
        (
            [id] => ubuntu10-64
            [name] => Ubuntu 10 - 64 bit
            [version] => 10
            [architecture] => 64
            [os] => ubuntu
        )

)

如果你使用 PHP >= 5.3 (当然你是,为什么你不呢)你可以省略讨厌的 tmp 函数定义并使用很酷的匿名函数进行映射:

// an array of xml objects:
$oses = array_map(function($os) {
  return array_values((array)$os);
}, $oses);

Try this:

// flatten:
function arrayval1($any) {
  return array_values((array)$any);
}
function arrayval2($any) {
  return (array)$any;
}

// xml objects with xml objects:
$oses = $xml->xpath('//OSes/*/*');
// an array of xml objects:
$oses = array_map('arrayval1', $oses);
// merge to a flat array:
$oses = call_user_func_array('array_merge', $oses);
// xml objects -> arrays
$oses = array_map('arrayval2', $oses);
print_r($oses);

My result:

Array
(
    [0] => Array
        (
            [id] => centos5-32
            [name] => CentOS 5 - 32 bit
            [version] => 5
            [architecture] => 32
            [os] => centos
        )

    [1] => Array
        (
            [id] => centos5-64
            [name] => CentOS 5 - 64 bit
            [version] => 5
            [architecture] => 64
            [os] => centos
        )

    [2] => Array
        (
            [id] => centos6-32
            [name] => CentOS 6 - 32 bit
            [version] => 6
            [architecture] => 32
            [os] => centos
        )

    [3] => Array
        (
            [id] => centos6-64
            [name] => CentOS 6 - 64 bit
            [version] => 6
            [architecture] => 64
            [os] => centos
        )

    [4] => Array
        (
            [id] => ubuntu10-32
            [name] => Ubuntu 10 - 32 bit
            [version] => 10
            [architecture] => 32
            [os] => ubuntu
        )

    [5] => Array
        (
            [id] => ubuntu10-64
            [name] => Ubuntu 10 - 64 bit
            [version] => 10
            [architecture] => 64
            [os] => ubuntu
        )

)

If you're using PHP >= 5.3 (ofcourse you are, why whouldn't you) you can omit the nasty tmp function definitions and use cool anonymous functions for the mapping:

// an array of xml objects:
$oses = array_map(function($os) {
  return array_values((array)$os);
}, $oses);
~没有更多了~
我们使用 Cookies 和其他技术来定制您的体验包括您的登录状态等。通过阅读我们的 隐私政策 了解更多相关信息。 单击 接受 或继续使用网站,即表示您同意使用 Cookies 和您的相关数据。
原文