我有两个不同的 csv
文件,分别对应于一个人的 HRV (csv
no1) 和他们的情绪 (csv
no2)。第一个数据集使用 UNIX 时间戳来捕获 HRV 值,另一个数据集记录人们每 5 秒观察自己时的情绪。
由于情绪每五秒捕获一次,HRV 值每秒捕获一次,
我想迭代 HRV 值数据集的行并创建一个新的数据集(或者只是一个新列,无论有效),其中包含每组 5 行的平均总和。
例如,前 5 行的平均值对应于该情绪,接下来的 5 行对应于其他情绪等。
我想这样做,以便最终能够将它们相互链接。
关于如何做到这一点有什么想法吗?
不幸的是,我无法提供易于复制的代码片段,因为该数据集不是我共享的,但是,我可以通过一些屏幕截图指出我的数据集的外观:
这是具有 HRV 值的数据集:
data:image/s3,"s3://crabby-images/71c55/71c55fd87e4be068c5eb29a0abc306bb3a2555d5" alt="输入图像描述这里"
这是带有情感值的数据集:
data:image/s3,"s3://crabby-images/cc032/cc03208018d83e2fd9956f70c3c714a3f20d5d0d" alt="输入图片此处描述"
I have two different csv
files that correspond to a person's HRV (csv
no1) and their emotions (csv
no2). The first dataset used UNIX timestamps to capture the HRV values and the other recorded the person's emotions while they were watching themselves every 5 seconds.
Since the emotions are captured every five seconds and the HRV values are captured every second,
I want to iterate through the rows of the HRV values dataset and create a new one (or just a new column, whatever works) that contains the average sum of each set of 5 rows.
For example the mean value of the first 5 rows corresponds to that emotion, the next 5 rows correspond to that other emotion etc.
I want to do that so I can eventually be able to link them with each other.
Any ideas on how to do that?
Unfortunately, I am not able to provide an easily-reproduced code snippet since the dataset is not mine to share, however, I can point out with a few screenshots how my datasets look:
This is the dataset with the HRV values:
data:image/s3,"s3://crabby-images/71c55/71c55fd87e4be068c5eb29a0abc306bb3a2555d5" alt="enter image description here"
And this is the dataset with the emotion values:
data:image/s3,"s3://crabby-images/cc032/cc03208018d83e2fd9956f70c3c714a3f20d5d0d" alt="enter image description here"
发布评论
评论(1)
如果您可以提供数据进行测试,那就很好。
I create data with the next code:
I think that 重新样本 pandas可能很有用。查看“ nofollow noreferrer”> offset别名 。
Note that in my example the timestamp is the index, also, I use the mean as the value to pass, but I don't really know what you would like to use.之后,您可以合并数据。
It would be good if you could provide data to test even if it is not real.
I create data with the next code:
I think that resample from pandas could be useful. Review the Offset aliases in the documentation.
Note that in my example the timestamp is the index, also, I use the mean as the value to pass, but I don't really know what you would like to use. After this, you could just merge the data.