想要的建议:带有 Perl 的 web-spider
蜘蛛应该:
- 通过发布登录表单登录
- 通过 cookies 维护登录统计
- 信息点击主页中的随机链接
蜘蛛的目的是让新网站保持繁忙,模拟许多用户正在浏览它。
我看过 LWP 和 WWW::Mechanize。不确定应该使用哪个模块,或任何其他更好的建议。如果有代码示例就很好了。
谢谢。
The spider should:
- Login via post a log in form
- Maintain login stat by cookies
- Follow random link in the homepage
The purpose of the spider is to keep a new website busy, simulating that many users are browsing it.
I had look at LWP and WWW::Mechanize. Not sure which module should use, or any other better suggestion. If you have code example is very good.
Thanks.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
WWW::Mechanize,在这两个选择之间。它的底层有 LWP::UserAgent。我没有使用 CPAN 模块 WWW::Spider 的经验,并且知道它远不如 WWW::Mechanize 那样出名,但您也可以看一下它,看看它是否满足您的需求。不过,我会投票支持经过充分验证的选项。
WWW::Mechanize, between those two choices. It has LWP::UserAgent under the hood. I have no experience with the CPAN module WWW::Spider, and know that it's nowhere nearly as well known as WWW::Mechanize, but you may have a look at it too to see if it meets your needs. I'd vote for the well proven option though.