可以轻松与 Free Pascal 集成的数据挖掘库吗?
我一直在尝试不同的模式匹配、分析和预测时间序列数据的方法。
由于我没有数据挖掘或相关领域的专业经验,所以我从头开始想出了自己的方法。
经过几个月零星浏览数据挖掘文章(我理解的少数文章)后,我意识到我的方法非常基本。
例如,我实现模式匹配的方式是通过多维 k 最近邻方法。
我现在才开始了解我应该使用一些常见的技术,例如决策树、主成分分析、协方差矩阵等。
我一直在尝试RapidMiner,它是一个GUI 数据挖掘工具。 RapidMiner 允许您拖放复杂技术的各种实现并将它们连接在一起,而无需编写一行代码。这对我来说是一个很棒的学习工具。 它看起来像这样:
RapidMiner 有一个 API,但不幸的是它是用 Java 编写的,而我当前的所有代码都是免费的帕斯卡和MySQL。
我正在寻找某种方法将 RapidMiner 集成到我的应用程序中,这样我就可以通过 RapidMiner GUI 尝试不同的数据处理方法,并将它们与我当前的代码无缝地使用。
如果这不可行,我会选择另一个数据挖掘包,只要它很容易集成到我当前的 Free Pascal 代码中即可。
我研究了 R - 这似乎是正确的,但与 Free Pascal 集成似乎也不容易。
为了清楚起见,我的数据集相当大(500,000 多行),并且计算必须实时进行。该软件在 Windows 7 上运行。
I have been experimenting with different approaches for pattern matching, analyzing, and predicting time-series data.
Since I have no professional experience in data mining or related fields I came up with my own methods from scratch.
After a few months of sporadically skimming data mining articles (the few that I understood) I realized that my methods are very basic.
For example, the way I implemented pattern matching is through a multidimensional k-Nearest Neighbor approach.
I am just now starting to understand that there are common techniques such as Decision Trees, Principal Component analysis, Covarience matricies, etc that I should be utilizing.
I have been experimenting with RapidMiner, which is a GUI data mining tool. RapidMiner lets you drag-and-drop various implementations of sophisticated techniques and connect them together without writing a line of code. It has been an awesome learning tool for me.
It looks like this:
RapidMiner has an API but unfortunately it is in Java, and all of my current code is in Free Pascal and MySQL.
I am looking for some way to integrate RapidMiner into my application, so I could experiment with different data processing methods through the RapidMiner GUI and use them seamlessly with my current code.
If that is not feasible, I would settle for another data mining package, as long as it is easy to integrate into my current Free Pascal code.
I looked into R - and that seems about right, but it does not seem to be easy to integrate with Free Pascal either.
For clarity, my data sets are fairly large (500,000+ rows) and computation must occur in real-time. The software runs on Windows 7.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
我不知道该软件包,但有一些常规选项:
I don't know the package, but a few general options: