从 Mechanical Turk 下载图像
我在 Mechanical Turk 中有一堆网址,我可以从中下载上传的图像。我编写了一个 Python 脚本来下载所有文件。但是,我必须先登录 Mechanical Turk 才能访问这些文件。是否有从 Mechanical Turk 下载大量文件的 API 或最佳实践?
I have a bunch of urls in Mechanical Turk where I can download the uploaded images from. I wrote a Python script to download all the files. However, I have to be logged in to Mechanical Turk to access the files first. Is there an API or best practice for downloading a large number of files from Mechanical Turk?
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
Boto http://boto.cloudhackers.com/ 是 Amazon Web Services 的 Python 包装器,包括 Mechanical Turk;请参阅 Mechanical Turk API 参考 http://boto.cloudhackers.com/ref/mturk.html< /a>
或者,您可以使用 urllib2、mechanize 或 Scrapy 直接获取它们。所有这三个都将支持基本身份验证或基于 cookie 的表单身份验证。
Boto http://boto.cloudhackers.com/ is a Python wrapper for Amazon Web Services, including Mechanical Turk; see the Mechanical Turk API reference at http://boto.cloudhackers.com/ref/mturk.html
Alternatively you could use urllib2, mechanize, or Scrapy to grab them directly. All three will support either basic authentication or cookie-based form authentication.