连接不共享时间戳值的数据集

发布于 2025-01-19 22:46:37 字数 720 浏览 1 评论 0 原文

我有两个不同的 csv 文件，分别对应于一个人的 HRV (csv no1) 和他们的情绪 (csv no2)。第一个数据集使用 UNIX 时间戳来捕获 HRV 值，另一个数据集记录人们每 5 秒观察自己时的情绪。

由于情绪每五秒捕获一次，HRV 值每秒捕获一次，我想迭代 HRV 值数据集的行并创建一个新的数据集（或者只是一个新列，无论有效），其中包含每组 5 行的平均总和。 例如，前 5 行的平均值对应于该情绪，接下来的 5 行对应于其他情绪等。

我想这样做，以便最终能够将它们相互链接。

关于如何做到这一点有什么想法吗？

不幸的是，我无法提供易于复制的代码片段，因为该数据集不是我共享的，但是，我可以通过一些屏幕截图指出我的数据集的外观：

这是具有 HRV 值的数据集：

这是带有情感值的数据集：

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

谈下烟灰 2025-01-26 22:46:37

如果您可以提供数据进行测试，那就很好。
I create data with the next code:

dates = pd.date_range('10-01-2016', periods=50, freq='S')
df = pd.DataFrame({'value': 100 + np.random.randint(-5, 10, 50).cumsum()},index=dates)
df.head()

I think that 重新样本 pandas可能很有用。查看“ nofollow noreferrer”> offset别名。

df.resample('5S').mean().head()

Note that in my example the timestamp is the index, also, I use the mean as the value to pass, but I don't really know what you would like to use.之后，您可以合并数据。

It would be good if you could provide data to test even if it is not real.
I create data with the next code:

dates = pd.date_range('10-01-2016', periods=50, freq='S')
df = pd.DataFrame({'value': 100 + np.random.randint(-5, 10, 50).cumsum()},index=dates)
df.head()

I think that resample from pandas could be useful. Review the Offset aliases in the documentation.

df.resample('5S').mean().head()

Note that in my example the timestamp is the index, also, I use the mean as the value to pass, but I don't really know what you would like to use. After this, you could just merge the data.

回复收藏 0 原文

~没有更多了~