聚类遗传编程/算法
最近我一直在尝试了解hadoop集群是否可以用于遗传算法/编程工作。我一直在阅读有关 hadoop 的内容,并且了解它可以并行化大型数据集的处理。就我而言,我不会拥有大型数据集..但我发现真正有用的是hadoop 的并行化功能。所以,我的问题是像 hadoop 这样的程序是否可以用于评估或处理遗传算法/编程,我认为它会更面向处理而不是面向 I/O ?
Recently I have been trying to understand if hadoop clusters can be used for genetic algorithms/programming jobs. I've been reading about hadoop and I understand that it can parallize processing of large datasets. in my case, I wouldn't have large data sets.. but what i would find really useful are the parallelizing capabilities of hadoop. So, my question is whether a program like hadoop can be used for evaluating or processing genetic algorithms/programming which I think will be more processing oriented as opposed I/O oriented?
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。

绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
如您所知,最大的生物信息学算法基于组合、切割、拼接、编辑距离、神经网络等,以及像 dfs(用于部分摘要)那样的回溯。如果您让它们像特定的映射减少作业一样分布实例或长度 ex :
或者如果您想将 Bio-info 算法与 hadoop 的架构进行比较,您可以在此链接中找到简单的算法>>> http://matrixsust.blogspot.com/2011/01/introduction-to -bioinformatics.html
希望有帮助。
As you know maximum Bio-informatics algo is based on Combination,cutting,splicing,edit distance,Neural network,..etc and also backtracking like dfs (for partial digest).If you make them distribute like a map-reduce job for particular instance or length ex :
Or if you want to compare Bio-info algo with the architecture of hadoop you can find simple algo in this link >> http://matrixsust.blogspot.com/2011/01/introduction-to-bioinformatics.html
Hope it helps.