当隐含 ID 列时,如何合并 csv 文件中的多个数据框?
我想将一堆数据框合并在一起(因为如果您只处理一个数据框,则许多操作似乎会更容易,但如果我错了,请纠正我)。
目前我有一个像这样的数据框:
ID, var1, var2
A, 2, 2
B, 4, 5
.
.
Z, 3, 2
每个 ID 都在一行上,带有多个单个测量值
我还有一个 csv 文件,每个 ID 都有重复测量值,例如:
filename = ID_B.csv
time, var4, var5
0, 1, 2
1, 4, 5
2, 1, 6
...
我想要的是:
ID, time, va1, var2, var4, var5
...
B, 0, 4, 5, 1, 2,
B, 1, 4, 5, 4, 5,
B, 2, 4, 5, 1, 6,
...
I并不真正关心列顺序。我能想到的唯一解决方案是将 ID 列添加到每个 csv 文件中,然后多次调用 merge()
循环遍历它们。有更优雅的方法吗?
I'd like to merge a bunch of data frames together (because it seems many operations are easier if you're only dealing w/ one, but correct me if I'm wrong).
Currently I have one data frame like this:
ID, var1, var2
A, 2, 2
B, 4, 5
.
.
Z, 3, 2
Each ID is on a single row w/ several single measurements
I also have a csv file w/ repeated measurement for each ID, like:
filename = ID_B.csv
time, var4, var5
0, 1, 2
1, 4, 5
2, 1, 6
...
What I'd like is:
ID, time, va1, var2, var4, var5
...
B, 0, 4, 5, 1, 2,
B, 1, 4, 5, 4, 5,
B, 2, 4, 5, 1, 6,
...
I don't really care about the column order. The only solution I can think of is to add the ID column to each csv file then loop through them calling merge()
several times. Is there a more elegant approach?
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
data:image/s3,"s3://crabby-images/d5906/d59060df4059a6cc364216c4d63ceec29ef7fe66" alt="扫码二维码加入Web技术交流群"
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
我的理解是,您需要从文件名中提取 ID,然后将导入的 csv 与现有数据框合并。
结果:
My understanding is that you need to extract the ID from the filename, and then merge the imported csv with the existing dataframe.
Result: