如何使用 XmlSerializer 获取 XML 元素的内容？

发布于 2024-07-07 04:51:59 字数 1329 浏览 8 评论 0原文

我有一个关于此 XML 字符串的 XML 阅读器：

<?xml version="1.0" encoding="UTF-8" ?>
<story id="1224488641nL21535800" date="20 Oct 2008" time="07:44">
<title>PRESS DIGEST - PORTUGAL - Oct 20</title>
<text>
<p>    LISBON, Oct 20 (Reuters) - Following are some of the main
 stories in Portuguese newspapers on Monday. Reuters has not
verified these stories and does not vouch for their accuracy. </p>
<p>More HTML stuff here</p>
</text>
</story>

我创建了一个 XSD 和一个用于反序列化的相应类。

[System.Xml.Serialization.XmlRootAttribute(Namespace="", IsNullable=false)]
public class story {
    [System.Xml.Serialization.XmlAttributeAttribute()]
    public string id;
    [System.Xml.Serialization.XmlAttributeAttribute()]
    public string date;
    [System.Xml.Serialization.XmlAttributeAttribute()]
    public string time;
    public string title;
    public string text;
}

然后，我使用 XmlSerializer 的 Deserialize 方法创建该类的实例。

XmlSerializer ser = new XmlSerializer(typeof(story));
return (story)ser.Deserialize(xr);

现在，story 的 text 成员始终为 null。如何更改我的 story 类以便按预期解析 XML？

编辑：

使用 XmlText 不起作用，并且我无法控制正在解析的 XML。

原文

I have an XML reader on this XML string:

<?xml version="1.0" encoding="UTF-8" ?>
<story id="1224488641nL21535800" date="20 Oct 2008" time="07:44">
<title>PRESS DIGEST - PORTUGAL - Oct 20</title>
<text>
<p>    LISBON, Oct 20 (Reuters) - Following are some of the main
 stories in Portuguese newspapers on Monday. Reuters has not
verified these stories and does not vouch for their accuracy. </p>
<p>More HTML stuff here</p>
</text>
</story>

I created an XSD and a corresponding class for deserialization.

[System.Xml.Serialization.XmlRootAttribute(Namespace="", IsNullable=false)]
public class story {
    [System.Xml.Serialization.XmlAttributeAttribute()]
    public string id;
    [System.Xml.Serialization.XmlAttributeAttribute()]
    public string date;
    [System.Xml.Serialization.XmlAttributeAttribute()]
    public string time;
    public string title;
    public string text;
}

I then create an instance of the class using the Deserialize method of XmlSerializer.

XmlSerializer ser = new XmlSerializer(typeof(story));
return (story)ser.Deserialize(xr);

Now, the text member of story is always null. How do I change my story class so that the XML is parsed as expected?

EDIT:

Using an XmlText does not work and I have no control over the XML I'm parsing.

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

永言不败 2024-07-14 04:51:59

我发现了一个非常令人不满意的解决方案。

像这样更改类（呃！）

// ...
[XmlElement("HACK - this should never match anything")]
public string text;
// ...

并像这样更改调用代码（恶心！）

XmlSerializer ser = new XmlSerializer(typeof(story));
string text = string.Empty;
ser.UnknownElement += delegate(object sender, XmlElementEventArgs e) {
    if (e.Element.Name != "text")
        throw new XmlException(
              string.Format(CultureInfo.InvariantCulture, 
             "Unknown element '{0}' cannot be deserialized.",
             e.Element.Name));
    text += e.Element.InnerXml;
};

story result = (story)ser.Deserialize(xr);
result.text = text;
return result;

这是一种非常糟糕的方法，因为它破坏了封装。有更好的方法吗？

I found a very unsatisfactory solution.

Change the class like this (ugh!)

// ...
[XmlElement("HACK - this should never match anything")]
public string text;
// ...

And change the calling code like this (yuck!)

XmlSerializer ser = new XmlSerializer(typeof(story));
string text = string.Empty;
ser.UnknownElement += delegate(object sender, XmlElementEventArgs e) {
    if (e.Element.Name != "text")
        throw new XmlException(
              string.Format(CultureInfo.InvariantCulture, 
             "Unknown element '{0}' cannot be deserialized.",
             e.Element.Name));
    text += e.Element.InnerXml;
};

story result = (story)ser.Deserialize(xr);
result.text = text;
return result;

This is a really bad way of doing it because it breaks encapsulation. Is there a better way of doing it?

回复收藏 0 原文

紫瑟鸿黎 2024-07-14 04:51:59

如果文本标签只包含 p 标签，我将提出的建议如下，它在短期内可能有用。

您可以将故事作为字符串数组，而不是将文本字段作为字符串。然后，您可以使用正确的 XmlArray 属性（无法记住确切的名称，例如 XmlArrayItemAttribute）和正确的参数，使其看起来像：

<text>
   <p>blah</p>
   <p>blib</p>
</text>

这更近了一步，但不完全是您所需要的。

另一种选择是创建一个类似的类：

public class Text //Obviously a bad name for a class...
{
   public string[] p;
   public string[] pre;
}

再次使用 XmlArray 属性使其看起来正确，不确定它们是否可配置，因为我之前只将它们用于简单类型。

编辑：

使用：

[System.Xml.Serialization.XmlRootAttribute(Namespace = "", IsNullable = false)]
    public class story
    {
        [System.Xml.Serialization.XmlAttributeAttribute()]
        public string id;
        [System.Xml.Serialization.XmlAttributeAttribute()]
        public string date;
        [System.Xml.Serialization.XmlAttributeAttribute()]
        public string time;
        public string title;

        [XmlArrayItem("p")]
        public string[] text;

    }

与提供的 XML 配合良好，但拥有该类似乎有点复杂。它最终的结果类似于：

    <text>
       <p>
          <p>qwertyuiop</p>
          <p>asdfghjkl</p>
       </p>
       <pre>
          <pre>stuff</pre>
          <pre>nonsense</pre>
       </pre>
   </text>

这显然不是我们想要的。

The suggestion that I was going to make if the text tag only ever contained p tags was the following, it may be useful in the short term.

Instead of story having the text field as a string, you could have it as an array of strings. You could then use the right XmlArray attributes (can't remember the exact names, something like XmlArrayItemAttribute), with the right parameters to make it look like:

<text>
   <p>blah</p>
   <p>blib</p>
</text>

Which is a step closer, but not completely what you need.

Another option is to make a class like:

public class Text //Obviously a bad name for a class...
{
   public string[] p;
   public string[] pre;
}

And again use the XmlArray attributes to get it to look right, not sure if they are as configurable as that because I've only used them for simple types before.

Edit:

Using:

[System.Xml.Serialization.XmlRootAttribute(Namespace = "", IsNullable = false)]
    public class story
    {
        [System.Xml.Serialization.XmlAttributeAttribute()]
        public string id;
        [System.Xml.Serialization.XmlAttributeAttribute()]
        public string date;
        [System.Xml.Serialization.XmlAttributeAttribute()]
        public string time;
        public string title;

        [XmlArrayItem("p")]
        public string[] text;

    }

Works well with the supplied XML, but having the class seems a little more complicated. It ends up as something similar to:

    <text>
       <p>
          <p>qwertyuiop</p>
          <p>asdfghjkl</p>
       </p>
       <pre>
          <pre>stuff</pre>
          <pre>nonsense</pre>
       </pre>
   </text>

which is obviously not what is desired.

回复收藏 0 原文