当前位置：文江博客话题详情

我可以关闭隐式 Python unicode 转换来查找混合字符串错误吗？

发布于 2024-09-01 12:17:24 字数 640 浏览 3 评论 0原文

在分析我们的代码时，我惊讶地发现有数百万次调用
C:\Python26\lib\encodings\utf_8.py:15(decode)

我开始调试，发现我们的代码库中有很多小错误，通常是将字符串与 unicode 进行比较，或者添加一个字符串和一个 unicode。 Python 会优雅地解码字符串并以 unicode 执行以下操作。

多么亲切啊。但很贵！

我对 unicode 很流利，阅读了 Joel Spolsky 和深入了解 Python...

我尝试仅将代码内部保持在 unicode 中。

我的问题 - 我可以关闭这种Python式的好人行为吗？至少在我找到所有这些错误并修复它们之前（通常通过添加 u'u'）？

其中一些非常难以找到（有时是字符串的变量......）。

Python 2.6.5（我无法切换到3.x）。

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

妳是的陽光 2024-09-08 12:17:24

以下内容应该有效：

>>> import sys
>>> reload(sys)
<module 'sys' (built-in)>
>>> sys.setdefaultencoding('undefined')
>>> u"abc" + u"xyz"
u'abcxyz'
>>> u"abc" + "xyz"
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "/System/Library/Frameworks/Python.framework/Versions/2.6/lib/python2.6/encodings/undefined.py", line 22, in decode
    raise UnicodeError("undefined encoding")
UnicodeError: undefined encoding

上面代码片段中的 reload(sys) 仅在此处才需要，因为通常 sys.setdefaultencoding 应该位于 sitecustomize.py 文件位于 Python site-packages 目录中（建议这样做）。

The following should work:

>>> import sys
>>> reload(sys)
<module 'sys' (built-in)>
>>> sys.setdefaultencoding('undefined')
>>> u"abc" + u"xyz"
u'abcxyz'
>>> u"abc" + "xyz"
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "/System/Library/Frameworks/Python.framework/Versions/2.6/lib/python2.6/encodings/undefined.py", line 22, in decode
    raise UnicodeError("undefined encoding")
UnicodeError: undefined encoding

reload(sys) in the snippet above is only necessary here since normally sys.setdefaultencoding is supposed to go in a sitecustomize.py file in your Python site-packages directory (it's advisable to do that).

回复收藏 0 原文

~没有更多了~