当前位置：文江博客话题详情

解析电子邮件标题文本抄送字段的方法？

发布于 2024-10-25 23:29:32 字数 583 浏览 10 评论 0原文

我有一个抄送标头字段的纯文本，如下所示：

[电子邮件受保护]，John Smith <[电子邮件受保护]>,"史密斯，简" [电子邮件受保护]> ？

是否有经过实战测试的模块可以正确解析此内容

（如果是在Python中，那就更好了！电子邮件模块只返回原始文本，没有任何分割它的方法，据我所知）（如果它将姓名和地址拆分为字段，也会有好处）

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

方觉久 2024-11-01 23:29:32

有很多函数可用作标准 python 模块，但我认为您正在寻找
email.utils.parseaddr() 或 email.utils.getaddresses()

>>> addresses = '[email protected], John Smith <[email protected]>,"Smith, Jane" <[email protected]>'
>>> email.utils.getaddresses([addresses])
[('', '[email protected]'), ('John Smith', '[email protected]'), ('Smith, Jane', '[email protected]')]

There are a bunch of function available as a standard python module, but I think you're looking for
email.utils.parseaddr() or email.utils.getaddresses()

>>> addresses = '[email protected], John Smith <[email protected]>,"Smith, Jane" <[email protected]>'
>>> email.utils.getaddresses([addresses])
[('', '[email protected]'), ('John Smith', '[email protected]'), ('Smith, Jane', '[email protected]')]

回复收藏 0 原文

守不住的情 2024-11-01 23:29:32

我自己没有使用过它，但在我看来你可以使用 csv打包很容易解析数据。

回复收藏 0 原文

╭ゆ眷念 2024-11-01 23:29:32

下面的内容完全没有必要。我在意识到您可以传递 getaddresses() 一个包含单个字符串（包含多个地址）的列表之前就写了它。

我还没有机会查看电子邮件标头中地址的规范，但根据您提供的字符串，此代码应该将其拆分为一个列表，并确保忽略引号内的逗号（因此是名称的一部分）。

from email.utils import getaddresses

addrstring = ',[email protected], John Smith <[email protected]>,"Smith, Jane" <[email protected]>,'

def addrparser(addrstring):
    addrlist = ['']
    quoted = False

    # ignore comma at beginning or end
    addrstring = addrstring.strip(',')

    for char in addrstring:
        if char == '"':
            # toggle quoted mode
            quoted = not quoted
            addrlist[-1] += char
        # a comma outside of quotes means a new address
        elif char == ',' and not quoted:
            addrlist.append('')
        # anything else is the next letter of the current address
        else:
            addrlist[-1] += char

    return getaddresses(addrlist)

print addrparser(addrstring)

给出：

[('', '[email protected]'), ('John Smith', '[email protected]'),
 ('Smith, Jane', '[email protected]')]

我很想知道其他人会如何解决这个问题！

The bellow is completely unnecessary. I wrote it before realising that you could pass getaddresses() a list containing a single string containing multiple addresses.

I haven't had a chance to look at the specifications for addresses in email headers, but based on the string you provided, this code should do the job splitting it into a list, making sure to ignore commas if they are within quotes (and therefore part of a name).

from email.utils import getaddresses

addrstring = ',[email protected], John Smith <[email protected]>,"Smith, Jane" <[email protected]>,'

def addrparser(addrstring):
    addrlist = ['']
    quoted = False

    # ignore comma at beginning or end
    addrstring = addrstring.strip(',')

    for char in addrstring:
        if char == '"':
            # toggle quoted mode
            quoted = not quoted
            addrlist[-1] += char
        # a comma outside of quotes means a new address
        elif char == ',' and not quoted:
            addrlist.append('')
        # anything else is the next letter of the current address
        else:
            addrlist[-1] += char

    return getaddresses(addrlist)

print addrparser(addrstring)

Gives:

[('', '[email protected]'), ('John Smith', '[email protected]'),
 ('Smith, Jane', '[email protected]')]

I'd be interested to see how other people would go about this problem!

回复收藏 0 原文

晚风撩人 2024-11-01 23:29:32

将多个电子邮件字符串转换为字典（将多个带有名称的电子邮件转换为一个字符串）。

emailstring = 'Friends <[email protected]>, John Smith <[email protected]>,"Smith" <[email protected]>'

用逗号分割字符串

email_list = emailstring.split(',')

名称是键，电子邮件是值并制作字典。

email_dict = dict(map(lambda x: email.utils.parseaddr(x), email_list))

结果如下：

{'John Smith': '[email protected]', 'Friends': '[email protected]', 'Smith': '[email protected]'}

注意：

如果有相同的姓名和不同的电子邮件 ID，则跳过一条记录。

'Friends <[email protected]>, John Smith <[email protected]>,"Smith" <[email protected]>, Friends <[email protected]>'

《老友记》重复了两次。

Convert multiple E-mail string in to dictionary (Multiple E-Mail with name in to one string).

emailstring = 'Friends <[email protected]>, John Smith <[email protected]>,"Smith" <[email protected]>'

Split string by Comma

email_list = emailstring.split(',')

name is key and email is value and make dictionary.

email_dict = dict(map(lambda x: email.utils.parseaddr(x), email_list))

Result like this:

{'John Smith': '[email protected]', 'Friends': '[email protected]', 'Smith': '[email protected]'}

Note:

If there is same name with different email id then one record is skip.

'Friends <[email protected]>, John Smith <[email protected]>,"Smith" <[email protected]>, Friends <[email protected]>'

"Friends" is duplicate 2 time.

回复收藏 0 原文

~没有更多了~

关于作者

醉生梦死

暂无简介

文章

391 人气

关注发私信

櫻之舞

文章 0 评论 0

关注

弥枳

文章 0 评论 0

关注

m2429

文章 0 评论 0

关注

寻找一个思念的角度

文章 0 评论 0

关注

野却迷人

文章 0 评论 0

关注

我怀念的。

文章 0 评论 0

友情链接

文江博客

解析电子邮件标题文本抄送字段的方法？

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（4）

关于作者

相关话题

热门标签

推荐作者

櫻之舞

弥枳

m2429

寻找一个思念的角度

野却迷人

我怀念的。

友情链接

解析电子邮件标题文本抄送字段的方法？

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（4）

关于作者

相关话题

热门标签

推荐作者

櫻之舞

弥枳

m2429

寻找一个思念的角度

野却迷人

我怀念的。

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。