groupby 显示每人每天的时间 pandas

发布于 2025-01-11 21:21:30 字数 903 浏览 0 评论 0原文

我试图按 id、时间戳过滤此数据帧，第三列是条目之间的时间差异。我可以让它显示每个 id 所有日期的总和，但无法让它显示每个 id 每天的总和。

import datetime
import pandas as pd
timestamps = [
    datetime.datetime(2018, 1, 1, 10, 0, 0, 0), # person 1
    datetime.datetime(2018, 1, 1, 10, 0, 0, 0), # person 2
    datetime.datetime(2018, 1, 1, 11, 0, 0, 0), # person 2
    datetime.datetime(2018, 1, 2, 11, 0, 0, 0), # person 2
    datetime.datetime(2018, 1, 1, 10, 0, 0, 0), # person 3
    datetime.datetime(2018, 1, 2, 11, 0, 0, 0), # person 3
    datetime.datetime(2018, 1, 4, 10, 0, 0, 0), # person 3
    datetime.datetime(2018, 1, 5, 12, 0, 0, 0)  # person 3
]
df1 = pd.DataFrame({'person': [1, 2, 1, 3, 2, 1, 3, 2], 'timestamp': timestamps}) 
df1['new'] = df1.groupby('person').timestamp.transform(pd.Series.diff).dropna()
                               
df1.groupby('person')['timestamp','new'].sum()

这只是给我总数，而不是每天。我每天如何组合它们？

原文

I'm trying to filter this dataframe by id, timestamp and my third column is the time diff between entries. I can get it to display the total sum per id for all days but can't make it work to display sum per day per id.

import datetime
import pandas as pd
timestamps = [
    datetime.datetime(2018, 1, 1, 10, 0, 0, 0), # person 1
    datetime.datetime(2018, 1, 1, 10, 0, 0, 0), # person 2
    datetime.datetime(2018, 1, 1, 11, 0, 0, 0), # person 2
    datetime.datetime(2018, 1, 2, 11, 0, 0, 0), # person 2
    datetime.datetime(2018, 1, 1, 10, 0, 0, 0), # person 3
    datetime.datetime(2018, 1, 2, 11, 0, 0, 0), # person 3
    datetime.datetime(2018, 1, 4, 10, 0, 0, 0), # person 3
    datetime.datetime(2018, 1, 5, 12, 0, 0, 0)  # person 3
]
df1 = pd.DataFrame({'person': [1, 2, 1, 3, 2, 1, 3, 2], 'timestamp': timestamps}) 
df1['new'] = df1.groupby('person').timestamp.transform(pd.Series.diff).dropna()
                               
df1.groupby('person')['timestamp','new'].sum()

This just gives me the total, not per day. How do I combine them per day?

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

小霸王臭丫头 2025-01-18 21:21:30

您可以在分组条件中包含“时间戳”列的日期部分，如下所示：

>>> df1.groupby(["person", df1.timestamp.dt.date])["new"].sum()

此外，如果您愿意，您可以使用时间戳中的日期创建一个新列，然后按该列进行分组：

>>> df1["date"] = df1["timestamp"].dt.date
>>> df1.groupby(["person", "date"])["new"].sum()

或者，您可以< code>.reset_index() 最后将您的组值包含在新列中。

You can just include the date part of the "timestamp" column in your groupby condition like this:

>>> df1.groupby(["person", df1.timestamp.dt.date])["new"].sum()

Also, if you prefer, you could create a new column with the date from the timestamp and then group by that column:

>>> df1["date"] = df1["timestamp"].dt.date
>>> df1.groupby(["person", "date"])["new"].sum()

Optionally, you can .reset_index() at the end to contain your group values in new columns.

回复收藏 0 原文

~没有更多了~

关于作者

高速公鹿

暂无简介

文章

27 人气

关注发私信

紫罗兰の梦幻

文章 0 评论 0

关注

-2134

文章 0 评论 0

关注

liuxuanli

文章 0 评论 0

关注

意中人

文章 0 评论 0

关注

○愚か者の日

文章 0 评论 0

关注

xxhui

文章 0 评论 0

友情链接

文江博客

groupby 显示每人每天的时间 pandas

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（1）

关于作者

相关话题

热门标签

推荐作者

紫罗兰の梦幻

-2134

liuxuanli

意中人

○愚か者の日

xxhui

友情链接

groupby 显示每人每天的时间 pandas

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（1）

关于作者

相关话题

热门标签

推荐作者

紫罗兰の梦幻

-2134

liuxuanli

意中人

○愚か者の日

xxhui

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。