pyspider调试的时候没有任何问题,点run就报编码问题

发布于 2022-09-05 03:45:57 字数 3266 浏览 22 评论 0

调试的时候没有任何问题,点run就报编码问题。同样2个采集就这一个老报错,另外一个完全没问题

taskid

d7221a2be620c4ef60e874a1d93e79d1

lastcrawltime

1499144004.75187 (20 minutes ago)

updatetime

1499144004.7518892 (20 minutes ago)

exetime

1499144014.7518687 (20 minutes ago)

track.fetch 1.32ms

{
  "content": "",
  "encoding": null,
  "error": "'ascii' codec can't encode character '\\uff09' in position 94: ordinal not in range(128)",
  "headers": {},
  "ok": false,
  "redirect_url": null,
  "status_code": 599,
  "time": 0.0013222694396972656
}

track.process 0.83ms

'ascii' codec can't encode character '\uff09' in position 94: ordinal not in range(128)
 = self.gen.throw(*exc_info)
      File "/root/workspaces/pyspider3/lib/python3.5/site-packages/pyspider/fetcher/tornado_fetcher.py", line 378, in http_fetch
        response = yield gen.maybe_future(self.http_client.fetch(request))
      File "/root/workspaces/pyspider3/lib/python3.5/site-packages/tornado/gen.py", line 1055, in run
        value = future.result()
      File "/root/workspaces/pyspider3/lib/python3.5/site-packages/tornado/concurrent.py", line 238, in result
        raise_exc_info(self._exc_info)
      File "<string>", line 4, in raise_exc_info
      File "/root/workspaces/pyspider3/lib/python3.5/site-packages/tornado/curl_httpclient.py", line 214, in _process_queue
        curl.info["headers"])
      File "/root/workspaces/pyspider3/lib/python3.5/site-packages/tornado/curl_httpclient.py", line 306, in _curl_setup_request
        for k, v in request.headers.get_all()])
    Exception: 'ascii' codec can't encode character '\uff09' in position 94: ordinal not in range(128)

{
  "exception": "'ascii' codec can't encode character '\\uff09' in position 94: ordinal not in range(128)",
  "follows": 0,
  "logs": " = self.gen.throw(*exc_info)\n      File \"/root/workspaces/pyspider3/lib/python3.5/site-packages/pyspider/fetcher/tornado_fetcher.py\", line 378, in http_fetch\n        response = yield gen.maybe_future(self.http_client.fetch(request))\n      File \"/root/workspaces/pyspider3/lib/python3.5/site-packages/tornado/gen.py\", line 1055, in run\n        value = future.result()\n      File \"/root/workspaces/pyspider3/lib/python3.5/site-packages/tornado/concurrent.py\", line 238, in result\n        raise_exc_info(self._exc_info)\n      File \"<string>\", line 4, in raise_exc_info\n      File \"/root/workspaces/pyspider3/lib/python3.5/site-packages/tornado/curl_httpclient.py\", line 214, in _process_queue\n        curl.info[\"headers\"])\n      File \"/root/workspaces/pyspider3/lib/python3.5/site-packages/tornado/curl_httpclient.py\", line 306, in _curl_setup_request\n        for k, v in request.headers.get_all()])\n    Exception: 'ascii' codec can't encode character '\\uff09' in position 94: ordinal not in range(128)\n",
  "ok": false,
  "result": null,
  "time": 0.0008292198181152344
}

schedule

{
  "age": 10,
  "exetime": 1499144014.7518687,
  "retried": 3
}

process

{
  "callback": "index_page"
}

fetch

{}

如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。

扫码二维码加入Web技术交流群

发布评论

需要 登录 才能够评论, 你可以免费 注册 一个本站的账号。

评论(2

冷夜 2022-09-12 03:45:57

headers设置有问题,我删掉直接秒好.

彼岸花似海 2022-09-12 03:45:57
#!/usr/bin/env python
# -*- encoding: utf-8 -*-

看看你的代码前两行是这个吧

~没有更多了~
我们使用 Cookies 和其他技术来定制您的体验包括您的登录状态等。通过阅读我们的 隐私政策 了解更多相关信息。 单击 接受 或继续使用网站,即表示您同意使用 Cookies 和您的相关数据。
原文