“集群”的作用是什么?与 MPI 相关的软件?
我对集群实现(“Beowulf 集群”)与 MPI 等通信协议的关系有点困惑。使用 OpenMPI 等工具设置“集群”需要哪些软件组件?
I'm a little confused regarding how a cluster implementation ("Beowulf cluster") relates to a communication protocol such as MPI. What software components are needed to set up a "cluster" using something like OpenMPI?
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(3)
如您所知,集群是一组联网的计算机。当你有这样的配置时,通常会安装和使用以下内容:
等等。所有这些东西完全是 MPI 的附加内容。 MPI只是进程之间的通信通道。 MPI 本身并不能“创建集群”。
A cluster, as you know, is a bunch of computers networked together. When you have such configuration, you normally install and use the following:
And much more. All this stuff is completely additional to MPI. MPI is just a communication channel between processes. MPI alone does not "make the cluster".
正如您所指出的,MPI 仅提供进程之间的通信。如果没有几个人使用集群,那么您实际上不需要更多(除了一些在所有节点上启动程序的脚本)。
但遗憾的是,实际上我们很少拥有自己的个人集群。这时候你就需要一个调度程序了。调度程序通常处理作业提交和资源分配,还可能处理优先级、用户管理和其他事情,以使您的生活更轻松。
MPI, as you noted, will only provide communication between processes. If there will not be several people using the cluster, you really need nothing more (apart from some script to launch your program on all the nodes).
But, in reality we sadly seldom have our personal cluster. That's when you need a scheduler. The scheduler typically handles job submissions and resource allocation, possibly also taking care of prioritization, user management and other things to make your life easier.
查看 Oracle Grid Engine(又称为 Sun Grid Engine 或 CODINE)。
Take a look at Oracle Grid Engine (nee Sun Grid Engine or CODINE).