Android Pull 解析 RSS feed 遇到麻烦

发布于 2024-10-07 09:28:15 字数 3918 浏览 4 评论 0原文

我正在为 Android 开发一个非常简单的 RSS 阅读器，作为一种学习体验。我决定使用 XmlPullParser 来解析提要，因为它非常简单并且具有可接受的效率水平（满足我的需要）。我在尝试解析我的测试提要 (rss.slashdot.org/slashdot/slashdot) 时遇到错误，尽管在网络上搜索答案，但我似乎无法解决该错误。错误（来自 Eclipse）是：

START_TAG <image>@2:1252 in java.io.InputStreamReader@43e7a488
START_TAG (empty) <{http://www.w3.org/2005/Atom}atom10:link rel='self' type='application/rss+xml' href='http://rss.slashdot.org/Slashdot/slashdot'>@2:1517 in java.io.InputStreamReader@43e7a488
DEBUG/JRSS(313): java.net.MalformedURLException: Protocol not found:

有问题的文件是：

<image>
    ...
</image>
<atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" type="application/rss+xml" href="http://rss.slashdot.org/Slashdot/slashdot" />
<feedburner:info uri="slashdot/slashdot" />
<atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com/" />
    ...

因此错误似乎发生在 feedburner 标记处。

最后，我的代码是：

public class XmlHelper
{
    private XmlPullParserFactory factory;
    private XmlPullParser xpp;
    private final int START_TAG = XmlPullParser.START_TAG;

    // Debugging Tag
    private final String TAG = "JRSS";

    // for channels and items
    private final String TITLE = "title";
    private final String LINK = "link";
    private final String DESCRIPTION = "description";
    private final String PUBDATE = "pubDate";

    // element keys for channel
    private final String LANGUAGE = "language";
    private final String IMAGE = "image";
    private final String ITEM = "item";

    // for items
    private final String AUTHOR = "author";

    // for images
    private final String URL = "url";
    private final String WIDTH = "width";
    private final String HEIGHT = "height";

    public XmlHelper(Context context)
    {
        try
    {
        factory = XmlPullParserFactory.newInstance();
    }
    catch (XmlPullParserException e)
    {
        Log.d(TAG, e.toString());
    }
        factory.setNamespaceAware(true);
    }

    public Channel addFeed(URL url) throws XmlPullParserException, IOException
    {       
        Channel c = new Channel();
        c.items = new ArrayList<Item>();

        xpp = factory.newPullParser();
        xpp.setInput(url.openStream(), null);

        // move past rss element
        xpp.nextTag();
        // move past channel element
        xpp.nextTag();

        while(xpp.nextTag() == START_TAG)
        {
            Log.d(TAG, xpp.getPositionDescription());

            if(xpp.getName().equals(TITLE))
                c.title = xpp.nextText();

            else if(xpp.getName().equals(LINK))
                c.url = new URL(xpp.nextText());

            else if(xpp.getName().equals(DESCRIPTION))
                c.description = xpp.nextText();

            else if(xpp.getName().equals(LANGUAGE))
                c.language = xpp.nextText();

            else if(xpp.getName().equals(ITEM))
            {
                Item i = parseItem(xpp);
                c.items.add(i);
            }

            else if(xpp.getName().equals(IMAGE))
            {
                parseImage(xpp);
            }

            else
                xpp.nextText();
        }

        return c;
    }

    public Item parseItem(XmlPullParser xpp) throws MalformedURLException, XmlPullParserException, IOException
    {
    Item i = new Item();

    while(xpp.nextTag() == START_TAG)
    {
            // do nothing for now
        xpp.nextText();
    }

        return i;
    }

    private void parseImage(XmlPullParser xpp) throws XmlPullParserException, IOException
    {
        // do nothing for now
        while(xpp.nextTag() == START_TAG)
        {
            xpp.nextText();
        }
    }

我真的不知道是否有办法忽略它（因为此时我不关心 feedburner 标签）或者是否有解析器的某些功能可以设置为使这项工作成功，或者如果我以错误的方式处理这件事。任何帮助/建议/指导将不胜感激。

原文

I am working on a very simple RSS reader for Android as a learning experience. I decided to use the XmlPullParser for parsing the feeds as it is quite simple and has an acceptable level of efficiency (for my needs). I am getting an error while trying to parse my test feed (rss.slashdot.org/slashdot/slashdot) that I can't seem to resolve despite scouring the web for answers. The error (from eclipse) is:

START_TAG <image>@2:1252 in java.io.InputStreamReader@43e7a488
START_TAG (empty) <{http://www.w3.org/2005/Atom}atom10:link rel='self' type='application/rss+xml' href='http://rss.slashdot.org/Slashdot/slashdot'>@2:1517 in java.io.InputStreamReader@43e7a488
DEBUG/JRSS(313): java.net.MalformedURLException: Protocol not found:

The file in question is:

<image>
    ...
</image>
<atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" type="application/rss+xml" href="http://rss.slashdot.org/Slashdot/slashdot" />
<feedburner:info uri="slashdot/slashdot" />
<atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com/" />
    ...

so the error appears to occur at the feedburner tag.

finally, my code is:

public class XmlHelper
{
    private XmlPullParserFactory factory;
    private XmlPullParser xpp;
    private final int START_TAG = XmlPullParser.START_TAG;

    // Debugging Tag
    private final String TAG = "JRSS";

    // for channels and items
    private final String TITLE = "title";
    private final String LINK = "link";
    private final String DESCRIPTION = "description";
    private final String PUBDATE = "pubDate";

    // element keys for channel
    private final String LANGUAGE = "language";
    private final String IMAGE = "image";
    private final String ITEM = "item";

    // for items
    private final String AUTHOR = "author";

    // for images
    private final String URL = "url";
    private final String WIDTH = "width";
    private final String HEIGHT = "height";

    public XmlHelper(Context context)
    {
        try
    {
        factory = XmlPullParserFactory.newInstance();
    }
    catch (XmlPullParserException e)
    {
        Log.d(TAG, e.toString());
    }
        factory.setNamespaceAware(true);
    }

    public Channel addFeed(URL url) throws XmlPullParserException, IOException
    {       
        Channel c = new Channel();
        c.items = new ArrayList<Item>();

        xpp = factory.newPullParser();
        xpp.setInput(url.openStream(), null);

        // move past rss element
        xpp.nextTag();
        // move past channel element
        xpp.nextTag();

        while(xpp.nextTag() == START_TAG)
        {
            Log.d(TAG, xpp.getPositionDescription());

            if(xpp.getName().equals(TITLE))
                c.title = xpp.nextText();

            else if(xpp.getName().equals(LINK))
                c.url = new URL(xpp.nextText());

            else if(xpp.getName().equals(DESCRIPTION))
                c.description = xpp.nextText();

            else if(xpp.getName().equals(LANGUAGE))
                c.language = xpp.nextText();

            else if(xpp.getName().equals(ITEM))
            {
                Item i = parseItem(xpp);
                c.items.add(i);
            }

            else if(xpp.getName().equals(IMAGE))
            {
                parseImage(xpp);
            }

            else
                xpp.nextText();
        }

        return c;
    }

    public Item parseItem(XmlPullParser xpp) throws MalformedURLException, XmlPullParserException, IOException
    {
    Item i = new Item();

    while(xpp.nextTag() == START_TAG)
    {
            // do nothing for now
        xpp.nextText();
    }

        return i;
    }

    private void parseImage(XmlPullParser xpp) throws XmlPullParserException, IOException
    {
        // do nothing for now
        while(xpp.nextTag() == START_TAG)
        {
            xpp.nextText();
        }
    }

I don't really know if there is a way to just ignore this (because at this point I don't care about the feedburner tag) or if there is some feature of the parser that I can set to make this work, or if I'm going about this the wrong way. Any help / advice / guidance would be appreciated.

分享到QQ

分享到微博