巨大的期望列出了总体价值总数
我已经运行了很高的期望检查Expect_column_values_to_be_unique在其中一列上检查。它产生了以下结果,如下所示,有62个重复项,但在输出列表中,它仅返回20个元素。如何在该列中检索所有重复记录。 df.expect_column_values_to_be_unique('a')
"exception_info": null,
"expectation_config": {
"expectation_type": "expect_column_values_to_be_unique",
"kwargs": {
"column": "A",
"result_format": "BASIC"
},
"meta": {}
},
"meta": {},
"success": false,
"result": {
"element_count": 100,
"missing_count": 0,
"missing_percent": 0.0,
"unexpected_count": 62,
"unexpected_percent": 62.0,
"unexpected_percent_nonmissing": 62.0,
"partial_unexpected_list": [
37,
62,
72,
53,
22,
61,
95,
21,
64,
59,
77,
53,
0,
22,
24,
46,
0,
16,
78,
60
]
}
}
I have run Great Expectation check expect_column_values_to_be_unique check on one of the column. It produced the following result as below.Total There are 62 Duplicates but in the output list it is returning only 20 elements. How to retrieve all duplicate records in that column.df.expect_column_values_to_be_unique('A')
"exception_info": null,
"expectation_config": {
"expectation_type": "expect_column_values_to_be_unique",
"kwargs": {
"column": "A",
"result_format": "BASIC"
},
"meta": {}
},
"meta": {},
"success": false,
"result": {
"element_count": 100,
"missing_count": 0,
"missing_percent": 0.0,
"unexpected_count": 62,
"unexpected_percent": 62.0,
"unexpected_percent_nonmissing": 62.0,
"partial_unexpected_list": [
37,
62,
72,
53,
22,
61,
95,
21,
64,
59,
77,
53,
0,
22,
24,
46,
0,
16,
78,
60
]
}
}
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。

绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(2)
您当前正在将
result_format
作为基本
传递。为了获取您要寻找的细节级别,您需要将result_format
作为完成
以获取意外值的完整列表。例如:请参阅此文档有关
result> result_format 。
You're currently passing
result_format
asBASIC
. To get the level of detail you're looking for, you'll want to instead passresult_format
for this Expectation asCOMPLETE
to get the full list of unexpected values. For example:See this documentation for more on
result_format
.我认为您正在使用“ show 没有参数。默认情况下,这仅显示前20行。如果您希望看到更多
I think you are using "show" without parameters. By default this only shows the first 20 rows. If you wish to see more you need to pass in how many rows you want to see: (This will show you 200 rows, and not truncate the length of the column)