This question does not appear to be about a specific programming problem, a software algorithm, or software tools primarily used by programmers. If you believe the question would be on-topic on another Stack Exchange site, you can leave a comment to explain where the question may be able to be answered.
Closed 1 year ago.
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
接受
或继续使用网站,即表示您同意使用 Cookies 和您的相关数据。
发布评论
评论(1)
您不应因失衡而影响火车测试分裂。火车测试拆分必须对应于实际测试分布。如果您的问题不平衡 - 您的测试套装也应该是!
您可以更改的是是您使用和/或培训制度的度量,例如:
这两种技术在技术上都将以同样重要的方式对待课程的效果相同,但是您不必以不同的方式“拆分”。
而且,如果您真的坚持以一种奇怪的方式拆分数据,那就可以手工
做
You shouldn't affect train-test split because of imbalance. Train-test split has to correspond to actual testing distribution. If your problem is imbalanced - so should your test set be!
What you can change though is a metric you use and/or training regime, e.g.:
Both these will technically same the same effect of treating classes in an equally important way, but you do not have to "split things" differently.
And if you really insist on splitting data in such an odd way just do it by hand
Which gives