当前位置：文江博客话题详情

pandas fillna

熊猫：填充列中具有同一组的值

发布于 2025-02-10 06:12:48 字数 357 浏览 3 评论 0 原文

我需要填充列中的无空值，而不是同一组的零值。

我尝试使用变换与模式使用，但它没有完成工作。

test['col2']=test['col2'].transform(lambda x:x.fillna(x.mode())

原文

I need to fill null values in the column with not null value of the same group.

Example

Desired Outcome

I tried using transform with mode, but it didn't do the job.

test['col2']=test['col2'].transform(lambda x:x.fillna(x.mode())

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

无语# 2025-02-17 06:12:48

使用使用模式，如果存在，则选择第一个值，else none ，最后传递给

s = df.groupby('col1')['col2'].transform(lambda x: next(iter(x.mode()), None))
df['col2'] = df['col2'].fillna(s)
print (df)
  col1   col2
0  gr1  test1
1  gr2  test2
2  gr1  test1
3  gr1  test1
4  gr2  test2
5  gr3  test3
6  gr2  test2

Use GroupBy.transform with mode and select first value if exist, else None, last pass to Series.fillna:

s = df.groupby('col1')['col2'].transform(lambda x: next(iter(x.mode()), None))
df['col2'] = df['col2'].fillna(s)
print (df)
  col1   col2
0  gr1  test1
1  gr2  test2
2  gr1  test1
3  gr1  test1
4  gr2  test2
5  gr3  test3
6  gr2  test2

回复收藏 0 原文

痴骨ら 2025-02-17 06:12:48

我将使用 .assign 和 .apply 遍历每一行，然后找到模式：

import pandas
import numpy

df = pandas.DataFrame({
    'col1':['gr1', 'gr2', 'gr1', 'gr1', 'gr2', 'gr3', 'gr2', numpy.nan], 
    'col2':['test1', 'test2', 'test', numpy.nan, numpy.nan, 'test3', numpy.nan, numpy.nan],
})

def fill_value(x):
    if x['col2'] is numpy.nan:
        mode = df.loc[df['col1'] == x['col1'], 'col2'].mode()
        default = numpy.nan
        return mode.iloc[0] if not mode.empty else default
    else:
        return x['col2']
    
df = df.assign(col2=df.apply(fill_value, axis=1))

输出：

  col1   col2
0  gr1  test1
1  gr2  test2
2  gr1   test
3  gr1   test
4  gr2  test2
5  gr3  test3
6  gr2  test2
7  NaN    NaN

I would use .assign and .apply to go through each row and then find the mode:

import pandas
import numpy

df = pandas.DataFrame({
    'col1':['gr1', 'gr2', 'gr1', 'gr1', 'gr2', 'gr3', 'gr2', numpy.nan], 
    'col2':['test1', 'test2', 'test', numpy.nan, numpy.nan, 'test3', numpy.nan, numpy.nan],
})

def fill_value(x):
    if x['col2'] is numpy.nan:
        mode = df.loc[df['col1'] == x['col1'], 'col2'].mode()
        default = numpy.nan
        return mode.iloc[0] if not mode.empty else default
    else:
        return x['col2']
    
df = df.assign(col2=df.apply(fill_value, axis=1))

output:

  col1   col2
0  gr1  test1
1  gr2  test2
2  gr1   test
3  gr1   test
4  gr2  test2
5  gr3  test3
6  gr2  test2
7  NaN    NaN

回复收藏 0 原文

~没有更多了~

关于作者

故事未完

暂无简介

文章

27 人气

关注发私信

陪我终i

文章 0 评论 0

关注

别忘他

文章 0 评论 0

关注

野心澎湃

文章 0 评论 0

关注

蒲公英的约定

文章 0 评论 0

关注

。

文章 0 评论 0

关注

旧时模样

文章 0 评论 0

友情链接

文江博客

熊猫：填充列中具有同一组的值

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（2）

关于作者

相关话题

热门标签

推荐作者

陪我终i

别忘他

野心澎湃

蒲公英的约定

。

旧时模样

友情链接

熊猫：填充列中具有同一组的值

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（2）

关于作者

相关话题

热门标签

推荐作者

陪我终i

别忘他

野心澎湃

蒲公英的约定

。

旧时模样

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。