在大型 XML 文档中查找特定属性

发布于 2024-10-08 09:18:39 字数 462 浏览 2 评论 0原文

我有一个大约 100mb 的大型 XML 文档。我需要在本文档中查找两个标签的属性。我可以通过使用与以下类似的代码来完成此操作：

XmlDocument xmlDocument = new XmlDocument ( );
xmlDocument.Load ( "C:\\myxml.xml" );

XmlNode node1 = xmlDocument.SelectSingleNode ( "/data/objects[@type='data type 1']" );
if ( null != node1 )
{
   result = node1 [ "Version" ].Value;
}

但是这样做会将整个 XML 加载到内存中，这似乎需要大约 200mb。无论如何，我可以提高效率吗？

编辑：使用 XmlTextReader 有很多很好的答案，我已经编写了代码供现在使用。（这会提高内存效率，但很难看:)。

原文

I have a large XML document that is around 100mb. I need to find attributes for two tags in this document. I can do this by using similar code to the following:

XmlDocument xmlDocument = new XmlDocument ( );
xmlDocument.Load ( "C:\\myxml.xml" );

XmlNode node1 = xmlDocument.SelectSingleNode ( "/data/objects[@type='data type 1']" );
if ( null != node1 )
{
   result = node1 [ "Version" ].Value;
}

But doing so loads the entire XML into memory which seems to take around 200mb. Is there anyway I can make this more efficient?

Edit: Lots of nice answers using the XmlTextReader which I have written my code to use now. (It will be more memory efficient, but ugly :).

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

怪我鬧 2024-10-15 09:18:39

就性能而言，SAX 比 DOM 好得多，因为您实际上只需要一个值。 .NET Framework 中的 SAX 实现是 XmlTextReader。

回复收藏 0 原文

诗酒趁年少 2024-10-15 09:18:39

您应该尝试使用 XmlReader。

来自 MSDN ：

与 SAX 读取器一样，XmlReader 是只进、只读游标。它提供对输入的快速、非缓存流访问。它可以读取流或文档。它允许用户提取数据并跳过应用程序不感兴趣的记录。最大的区别在于，SAX 模型是“推送”模型，解析器将事件推送到应用程序，每次读取新节点时通知应用程序，而使用 XmlReader 的应用程序可以从读取器中提取节点将要。

示例此处。

回复收藏 0 原文

与风相奔跑 2024-10-15 09:18:39

您可以使用 XmlReader 类来执行此操作。一个简单但有效的示例，其功能与上面的代码相同，如下所示：

string result = null;

using (var reader = XmlReader.Create(@"c:\\myxml.xml"))
{
    while (reader.Read())
    {
        if (reader.NodeType == XmlNodeType.Element
            && reader.Depth == 1
            && reader.LocalName == "objects"
            && reader.GetAttribute("type") == "data type 1")
        {
            result = reader.GetAttribute("Version");
            break;
        }
    }
}

You can use the XmlReader class to do this. A simple but working example that does the same as your code above looks like this:

string result = null;

using (var reader = XmlReader.Create(@"c:\\myxml.xml"))
{
    while (reader.Read())
    {
        if (reader.NodeType == XmlNodeType.Element
            && reader.Depth == 1
            && reader.LocalName == "objects"
            && reader.GetAttribute("type") == "data type 1")
        {
            result = reader.GetAttribute("Version");
            break;
        }
    }
}

回复收藏 0 原文

~没有更多了~