我已经尝试了许多方法,但未能将正确的数据获取到新列。
列描述包含很多信息,我想提取到分开的列
。
我想将上述内容提取到以下列:
部门 |
日期 |
支持时间表 |
日期 |
日期时间 |
变化时间 |
老年护理HCP |
29/06/2021 |
社会支持 |
工作日 |
上午7点至7pm |
09:00 am -12:00 PM |
任何人可以指导我或为我提供给我解决我如何清洁上述列的数据的解决方案?我已经为您的细读附加了数据。
我尝试使用
下面使用Excel Power查询是文档的链接
I have tried numerous ways but have failed to get the right data to the new columns.
The column description contains a lot of information in which I want to extract to separate columns. Examples
Aged Care HCP - 29/6/2021 - Social Support Weekday 7am – 7pm 9:00AM 12:00PM
I want to extract the above to the below Columns:
Sector |
Date |
Support Schedule |
Day |
Day time |
Shift Time |
Aged Care HCP |
29/06/2021 |
Social Support |
Weekday |
7am - 7pm |
09:00am - 12:00pm |
Can anyone please guide me or provide me the solution on how I can clean the data to the above said columns? I have attached the data for your perusal.
I have tried using excel power query
Below is the link for the document
https://www.dropbox.com/scl/fi/4o09adq936cc7cls28sfz/Worksheet.xlsx?dl=0&rlkey=zalomc9pagexd435z014q6qjc
发布评论
评论(2)
非结构化数据很困难,但这是电源查询的一个示例,说明了如何解析其中的一些示例。根据实际数据和品种,您必须对其进行调整。
It's difficult with unstructured data, but here's an example in Power Query of how you can parse some of it out. Depending on the actual data and variety, you will have to tweak it.
使用pandas
use pandas https://pypi.org/project/pandas/