在python 2中用十六进制字符解码字符串

发布于 2024-09-06 02:09:06 字数 168 浏览 2 评论 0原文

我有一个十六进制字符串，我想将其转换为utf8以插入mysql。（我的数据库是utf8）

hex_string = 'kitap ara\xfet\xfdrmas\xfd'
...
result = 'kitap araştırması'

我该怎么做？

原文

I have a hex string and i want to convert it utf8 to insert mysql. (my database is utf8)

hex_string = 'kitap ara\xfet\xfdrmas\xfd'
...
result = 'kitap araştırması'

How can I do that?

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

九命猫 2024-09-13 02:09:06

尝试（Python 3.x）：

import codecs
codecs.decode("707974686f6e2d666f72756d2e696f", "hex").decode('utf-8')

来自此处。

Try(Python 3.x):

import codecs
codecs.decode("707974686f6e2d666f72756d2e696f", "hex").decode('utf-8')

From here.

回复收藏 0 原文

还如梦归 2024-09-13 02:09:06

假设Python 2.6，

>>> print('kitap ara\xfet\xfdrmas\xfd'.decode('iso-8859-9'))
kitap araştırması
>>> 'kitap ara\xfet\xfdrmas\xfd'.decode('iso-8859-9').encode('utf-8')
'kitap ara\xc5\x9ft\xc4\xb1rmas\xc4\xb1'

Assuming Python 2.6,

>>> print('kitap ara\xfet\xfdrmas\xfd'.decode('iso-8859-9'))
kitap araştırması
>>> 'kitap ara\xfet\xfdrmas\xfd'.decode('iso-8859-9').encode('utf-8')
'kitap ara\xc5\x9ft\xc4\xb1rmas\xc4\xb1'

回复收藏 0 原文

电影里的梦 2024-09-13 02:09:06

字符串文字解释了如何在 Python 源代码中使用 UTF8 字符串。

回复收藏 0 原文

中性美 2024-09-13 02:09:06

尝试

hex_string.decode("cp1254").encode("utf-8")

（cp1254 或 iso-8859-9 是土耳其语代码页，前者是 Windows 平台上的常用名称，但在 Python 中，两者都同样有效）

Try

hex_string.decode("cp1254").encode("utf-8")

(cp1254 or iso-8859-9 are the Turkish codepages, the former being the usual name on Windows platforms, but in Python, both work equally well)

回复收藏 0 原文

愁杀 2024-09-13 02:09:06

首先，您需要从您拥有的编码字节中对其进行解码。这似乎是 ISO-8859-9 (latin-5)，或者，如果您使用的是 Windows，可能是代码页 1254，基于 latin-5。

>>> 'kitap ara\xfet\xfdrmas\xfd'.decode('cp1254')
u'kitap ara\u015ft\u0131rmas\u0131' # u'kitap araştırması'

如果您使用Windows，那么根据您获取这些字节的位置，可能将它们解码为mbcs更合适，它翻译为到“本地系统正在使用的代码页”。如果字符串仅位于 .py 文件中，则最好在源代码中编写 u'kitap araştırması' 并设置 -*- coding 声明来指导 Python 对其进行解码。请参阅 PEP 263。

至于如何将数据库的 unicode 字符串编码为 UTF-8，如果您愿意，您可以手动执行：

>>> u'kitap ara\u015ft\u0131rmas\u0131'.encode('utf-8')
'kitap ara\xc5\x9ft\xc4\xb1rmas\xc4\xb1'

但是如果您有COLLATION。

First you need to decode it from the encoded bytes you have. That appears to be ISO-8859-9 (latin-5), or, if you are using Windows, probably code page 1254, which is based on latin-5.

>>> 'kitap ara\xfet\xfdrmas\xfd'.decode('cp1254')
u'kitap ara\u015ft\u0131rmas\u0131' # u'kitap araştırması'

If you are using Windows, then depending on where you are getting those bytes, it might be more appropriate to decode them as mbcs, which translates to ‘whichever code page the local system is using’. If the string is just sitting in a .py file, you would be better off just writing u'kitap araştırması' in the source and setting a -*- coding declaration to direct Python to decode it. See PEP 263.

As to how to encode unicode strings to UTF-8 for the database, well, if you want to you can do it manually:

>>> u'kitap ara\u015ft\u0131rmas\u0131'.encode('utf-8')
'kitap ara\xc5\x9ft\xc4\xb1rmas\xc4\xb1'

but a good data access layer is likely to do that automatically for you, if you've got the COLLATION of the tables the data is going into right.

回复收藏 0 原文

~没有更多了~

关于作者

花期渐远

暂无简介

0 文章

0 评论

23 人气

关注发私信

友情链接

文江博客

在python 2中用十六进制字符解码字符串

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（5）

关于作者

相关话题

热门标签

推荐作者

1CH1MKgiKxn9p

ゞ记忆︶ㄣ

JackDx

信远

yaoduoduo1995

霞映澄塘

友情链接

在python 2中用十六进制字符解码字符串

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（5）

关于作者

相关话题

热门标签

推荐作者

1CH1MKgiKxn9p

ゞ记忆︶ㄣ

JackDx

信远

yaoduoduo1995

霞映澄塘

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。