使用 python suds 从 Web 服务响应时出现 Unicode 错误

发布于 2024-11-11 18:33:35 字数 1200 浏览 2 评论 0原文

我看过有关此问题的其他帖子，但没有看到对我有帮助的答案。

我的问题与上一篇文章中使用“CJ 糟糕的网络服务”的人非常相似。

我正在使用 python 2.5 和 suds 库（版本 0.4.1）。我通过 Web 服务从数据库请求一些记录。然后我尝试打印返回记录的一些字段。这些记录的某些标题包含导致异常的字符。我得到的例外是：

UnicodeEncodeError: 'ascii' codec can't encode character u'\u201d' in position 39: ordinal not in range(128)

我的代码如下所示：（sr 是服务请求，我从数据库检索的记录类型）

response = client.service.QuerySRByExample(input_data)
for sr in response:
    print sr.SRNumber, sr.Title

如果我使用 ord() 循环遍历有问题的标题，我可以看到有一些代码点为 8220 和 8221 的双引号字符。这些是导致错误的原因（根据错误消息，第一个双引号位于标题字符串的位置 39。）

... 114 111 108 108 101 114 32 65 8221 32 43 32 8220 68 67 78 ...

如果我改为使用，

    print sr.SRNumber, sr.Title.encode('ascii', 'ignore')

则不会得到错误。它只是删除有问题的字符（代码点 > 127 的任何字符）。

有更好的方法来处理这个问题吗？看来我应该能够以某种方式将 utf-8 双引号转换为 ascii 双引号。

Web 服务表示它正在使用 utf-8 编码。从 Web 服务返回的响应的第一部分是：

 <?xml version="1.0" encoding="UTF-8" ?> 
 <SOAP-ENV:Envelope xmlns:SOAP-ENV="http://schemas.xmlsoap.org/soap/envelope/">

在另一个线程中，一位用户说他在 suds 代码中发现了某些内容并且能够修复它。我不知道这是否已纳入 suds 库中。

任何帮助将不胜感激。

原文

I've seen the other threads about this issue, but I haven't seen an answer that helps me.

My issue is very similar to the person using "CJ's horrible web services" in a previous post.

I'm using python 2.5 and the suds library (version 0.4.1). I request some records from a database through a web service. I then try to print some of the fields of the returned records. Some of the titles of those records contain characters that cause an exception. The exception I get is:

UnicodeEncodeError: 'ascii' codec can't encode character u'\u201d' in position 39: ordinal not in range(128)

My code looks like this: (sr is a Service Request, the type of record I'm retrieving from the DB)

response = client.service.QuerySRByExample(input_data)
for sr in response:
    print sr.SRNumber, sr.Title

If I loop through the offending title using ord(), I can see that there are some double-quote characters that have code point 8220 and 8221. These are what is causing the error (The first double-quote is at position 39 of the title string, as per the error message.)

... 114 111 108 108 101 114 32 65 8221 32 43 32 8220 68 67 78 ...

If I instead use

    print sr.SRNumber, sr.Title.encode('ascii', 'ignore')

I don't get the error. It just drops the offending characters (anything with code point > 127).

Is there a better way to handle this? It seems like I should be able to convert the utf-8 double-quotes into ascii double-quotes somehow.

The web service says it is using utf-8 enoding. The first part of the response back from the web service is:

 <?xml version="1.0" encoding="UTF-8" ?> 
 <SOAP-ENV:Envelope xmlns:SOAP-ENV="http://schemas.xmlsoap.org/soap/envelope/">

In the other thread, one user said he found something in the suds code and was able to fix it. I don't know if that was incorporated into the suds library.

Any help would be greatly appreciated.

分享到QQ

分享到微博