当前位置：文江博客话题详情

通过python加载网站内容

发布于 2024-10-25 22:13:17 字数 62 浏览 5 评论 0原文

如何通过python从网站加载特定内容？例如，我想加载博客的一些帖子并将它们显示到我自己的网站上。我该怎么做？

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

反话 2024-11-01 22:13:18

答案：

import urllib2
from BeautifulSoup import BeautifulSoup

def fetchtags(req, name, attrs, num):
        try:
            website = urllib2.urlopen(req)
        except urllib2.HTTPError, e:
            print 'A problem occured. Please try again.'
            return
        soup = BeautifulSoup(website,
                             convertEntities=BeautifulSoup.HTML_ENTITIES)
        tags = soup.findAll(name=name,
                            attrs=attrs,
                            limit=num)
        return tags

然后你可以像这样使用它：

fetchtags('http://www.website.com', 'div', {'class':'c'}, 10)

从指定的url获取c类的10个div...

有关返回对象的更多详细信息，请参阅Beautiful Soup。

An answer:

import urllib2
from BeautifulSoup import BeautifulSoup

def fetchtags(req, name, attrs, num):
        try:
            website = urllib2.urlopen(req)
        except urllib2.HTTPError, e:
            print 'A problem occured. Please try again.'
            return
        soup = BeautifulSoup(website,
                             convertEntities=BeautifulSoup.HTML_ENTITIES)
        tags = soup.findAll(name=name,
                            attrs=attrs,
                            limit=num)
        return tags

Then you can use it like:

fetchtags('http://www.website.com', 'div', {'class':'c'}, 10)

To get 10 divs of class c from the specified url...

See Beautiful Soup for more details on the returned object.

回复收藏 0 原文

草莓味的萝莉 2024-11-01 22:13:18

urllib 和 urllib2 将允许您加载原始 HTML。 HTML 解析器（例如 BeautifulSoup 和 lxml）将允许您解析原始 HTML，以便您可以获取您关心的部分。 Mako、Cheetah 等模板引擎可以让您生成 HTML，以便您可以显示网页。

回复收藏 0 原文

~没有更多了~

关于作者

╰沐子

暂无简介

文章

28 人气

关注发私信

微信用户

文章 0 评论 0

关注

夜夜流光相皎洁

文章 0 评论 0

关注

零度℉

文章 0 评论 0

关注

百度③文鱼

文章 0 评论 0

关注

qq_O3Ao6frw

文章 0 评论 0

关注

Wugswg

文章 0 评论 0

友情链接

文江博客

通过python加载网站内容

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（2）

关于作者

相关话题

热门标签

推荐作者

微信用户

夜夜流光相皎洁

零度℉

百度③文鱼

qq_O3Ao6frw

Wugswg

友情链接

通过python加载网站内容

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（2）

关于作者

相关话题

热门标签

推荐作者

微信用户

夜夜流光相皎洁

零度℉

百度③文鱼

qq_O3Ao6frw

Wugswg

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。