SSAS 中的集成分类器(随机森林、Bagging、Boosting 等)
我正在使用 SSAS (SQL Server 2008 R2) 为缺失 80% 值的数据集开发分类模型。基于树的集成分类器被认为是最好的解决方案(例如随机森林)。
有没有什么好的方法可以将集成分类器添加到 SSAS 中?例如 AdaBoost 或任何其他 Bagging 或 Boosting 分类器?
我知道 SSAS 提供了插件功能,但我还没有遇到过任何人在做任何集成解决方案......更不用说任何您可以直接下载并开始使用的东西了。
如果没有,是否有任何有效的方法来连接SSAS中的各种分类器?我希望我遗漏了一些明显的东西。
I am using SSAS (SQL Server 2008 R2) to develop a classification model for a data set where 80% of values are missing. Ensemble classifiers based on trees are supposedly the best solution (Random Forest for example).
Is there any nice way of adding an ensemble classifier into SSAS? For example an AdaBoost or any other Bagging or Boosting classifier?
I know SSAS provides plug-in functionality, but I have not come across anyone doing any ensemble solutions... Not to mention anything that you can just download and start using.
If not, is there any efficient method to connect various classifiers in SSAS? I hope I am missing something obvious that is there.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
我不太熟悉您所询问的主题,但从技术上讲,在 SSAS 中您可以注册程序集并在 MDX 中使用它们。因此,您可以在 .NET 中对其进行编码并在 SSAS 中使用逻辑。请查看以下 MSDN 页面以获取更多信息,以防这听起来值得探索。
http://technet.microsoft.com/en-us/library/ms175398.aspx
此外,请查看 SSAS 提供的开箱即用的数据挖掘,因为通过使用包含的算法可以实现一些分类目标:
http://technet.microsoft.com/en-us/library/ms175595.aspx
I am not too familiar with the topic you are asking about, but technologically in SSAS you can register assemblies and use them in MDX. Therefore, it could be possible for you to code this in .NET and use the logic in SSAS. Please have a look at the following MSDN page for more information in case this sounds like something worth exploring.
http://technet.microsoft.com/en-us/library/ms175398.aspx
Additionally, have a look at the Data Mining SSAS provides out-of-the box as some classification objectives could be achievable by using the included algorithms:
http://technet.microsoft.com/en-us/library/ms175595.aspx