“内存不足”机械化错误

发布于 2025-01-07 00:18:34 字数 495 浏览 6 评论 0原文

我试图从网站上一页一页地抓取一些信息，基本上这就是我所做的：

import mechanize
MechBrowser = mechanize.Browser()

Counter = 0

while Counter < 5000:
    Response = MechBrowser.open("http://example.com/page" + str(Counter))
    Html = Response.read()
    Response.close()

    OutputFile = open("Output.txt", "a")
    OutputFile.write(Html)
    OutputFile.close()

    Counter = Counter + 1

嗯，上面的代码最终抛出了“内存不足”错误，并且在任务管理器中显示该脚本在执行完之后使用了几乎 1GB 内存连续几个小时...怎么会这样？！

有人能告诉我出了什么问题吗？

原文

I was trying to scrape some information from a website page by page, basically here's what I did:

import mechanize
MechBrowser = mechanize.Browser()

Counter = 0

while Counter < 5000:
    Response = MechBrowser.open("http://example.com/page" + str(Counter))
    Html = Response.read()
    Response.close()

    OutputFile = open("Output.txt", "a")
    OutputFile.write(Html)
    OutputFile.close()

    Counter = Counter + 1

Well, the above codes ended up throwing out "Out of Memory" error and in task manager it shows that the script used up almost 1GB memory after several hours running... how come?!

Would anybody tell me what went wrong?

分享到QQ

分享到微博