如何解析超过30k的电子邮件bodys
我正在使用MS Excel Enterprise,并通过Exchange Server连接到Outlook,并通过电源查询及其各自的主体(Body.Textbody)获取所有电子邮件。现在,我有一个超过15K行的Excel文件,并且电子邮件未以任何特定形式发送。我需要处理所有这些。
第一步是清洁每个身体,以便只剩下最后一个答复。之后,如果列表在列表中,我想删除任何员工的名称(我有一个带有所有名称的列表)。
因此,我将Excel文件转换为CSV文件,然后通过Pandas读取它。现在,我试图使用不同的功能,但它不起作用。例如,我很像使用library email_reply_parser中的函数。如果我尝试在列上使用Apply()使用任何功能,则无效。我觉得我误解了完全错误的问题,我以错误的方式解决了问题?
我有不同的错误,例如
AttributeError: 'float' object has no attribute 'lower'
TypeError: expected string or bytes-like object
AttributeError: 'float' object has no attribute 'replace'
我在此任务的正确路径上吗?
I am using MS Excel Enterprise and connected via Exchange Server to Outlook and got all the E-Mails via Power Query and their respective Body (Body.TextBody). Now I have an excel file with over 15k rows and the E-Mails are not sent in any particular form. I need to process all of them.
The first step is to clean each of the Body so that only the last reply is left. Afterwards I'd want to remove any names of employees (I have an list with all names) if it is in the list.
So i converted the excel file to an csv file and then read it via pandas. Now I'm trying to use different kind of functions but it doesn't work. As an example I'dlike to use a function from the library email_reply_parser. If I try to use any functions with apply() on my column it doesn't work. I feel like I misunderstood something completely wrong about it and I tackled the problem in the wrong way?
I had different errors like
AttributeError: 'float' object has no attribute 'lower'
TypeError: expected string or bytes-like object
AttributeError: 'float' object has no attribute 'replace'
Am I even on the correct path for this task`?
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
data:image/s3,"s3://crabby-images/d5906/d59060df4059a6cc364216c4d63ceec29ef7fe66" alt="扫码二维码加入Web技术交流群"
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论