openmosix和pbs的区别体现在哪里?

发布于 2022-07-20 11:09:37 字数 174 浏览 17 评论 2

在openmosix的主页看了一会,主要看了introduction和faq,发现一个问题,openmosix和pbs系统的区别在哪?openmosix无法将一个程序并行化,他只是将一堆计算机整合成一台逻辑上的smp的机器,让多个任务可以透明的在集群上运行,openmosix则负责资源的调度和任务的管理。试想这样的事情pbs不是也能很好的做到嘛?

如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。

扫码二维码加入Web技术交流群

发布评论

需要 登录 才能够评论, 你可以免费 注册 一个本站的账号。

评论(2

冧九 2022-07-24 15:26:36

谢谢NNTP,这里贴一些其他人给的意见,供大家参考,从你们的建议中,我学到了很多。

Not sure exactly what a PBS is but form what you describe it would
run on top of an openmosix cluster. OpenMosix creates what is called a
Single System Image. This wya your jobs do not have to know anything
about a cluster, they do not have to be submitted to a scheduler, you
just run and forget. The cluster will automatically shift the load
around to get best cpu per job usage. A scheduler on the other hand
requires either the jobs to understnad how to do parrallel work or for
the submitter to pre-split the job to make use of the cluster. That is
a bit of a simple explantation but Ithin it should give you a good
idea of the difference.

================================================================

While having similar aims, the way batch systems like the grid engine
achieve them is quite different from the openMosix approach. Roughly
speaking, batch systems start jobs on free nodes by "ssh-ing" to the
given node (or do something a bit more clever but still somehow
equivalent). This is also the reason why they have bring their own job
management tools (e.g. qstat, qsub etc.) -- jobs just have to be
processes on your local node. This does also limit the kind of jobs
you can use with batch systems, as they're usually unable to execute
interactive or X applications.

Contrary to that, openMosix engages in a much lower level. It is a
kernel patch that allows processes running locally on a node to be
transmitted to another node transparently during runtime. The last
part is quite important, as it has some interesting consequences:

* If there is a load inequality among the cluster nodes, it can be
   equalized much smoother by simply migrating some jobs to the idling
   machines within seconds. Batch systems can only equalize load by
   starting new jobs which isn't as elegant and, more important, will
   fail if the queue is empty.

* You don't have to use special job management tools, as openMosix
   can migrated nearly every process on your node (ok, there are some
   limitations: no multithreading, no shared memory, but for oM's use
   case this is a weak limitation).

* oM does also work with interactive and X applications. For instance
   if you have a graphical fractal generator which is creating high
   load on your login machine, oM could easily migrate it to an idling
   machine without you noticing it.

So, oM is, despite some limitations, way more elegant. Give it a try.

================================================================

By using the openMosix kernel along side other clustering apps, a more
generalised beowulf style cluster can be built to cater for all types of
use.

I have used PBS and found it tricky to set up jobs to run quickly across
nodes but that would NOT mean that you cannot use PBS along side openMosix.

If a job you schedule for a particular node is openMosix firiendly, then
openMosix could cause that particular job to migrate on to a faster free
node and if your particular job spawns sub processes that are openMosix
friendly, then each one of those processes could infact migrate in order to
get 100% CPU usage from all the openMosix nodes in your network.

ie
PBS spawns 10 openMosix friendly processes for 1 node on the network,
openMosix would migrate each of those processes to a different node.
If one node is then used for something else, then openMosix could migrate
the process again to find the maximum CPU use for that process.
Without openMosix, PBS would only allow you to set the same 10 processes to
run across 10 nodes and stay where they run.

Quote from Andreas Sch鋐er:
So, oM is, despite some limitations, way more elegant. Give it a try.

*Yes, and use it along side PBS and other clustering apps.*

I also use DSH from within cron...
My cron scheduler runs on a designated master node, jobs are set using
crontab -e from any node and DSH is used to run them.

eg:

0 * * * * dsh -c -m 192.168.1.20 -m 192.168.1.21 /home/mydir/myscript.sh
Would cause the script 'myscript.sh' to run hourly on nodes 192.168.1.20 and
21 concurrently.
(please note that /home and /var/spool/cron/ are available from NFS )
If myscript.sh contains oM friendly processes, then those too will migrate
across the network to other nodes.

迷雾森÷林ヴ 2022-07-22 21:36:42

PBS is just only a scheduler, It do nothing except arranging the job in Q under user predefined policy. Other compute nodes in this hpc cluster "execute" the jobs, processing the parallel codes themselves and exchanging the computing message base upon PVM/MPI.

openmosix is sophisticated cluster system who "execute" the jobs, sharing computing load between different nodes and different processor(maybe core), openmosix's kernel is responsible for not only "schedule" but "executing". And in it's "schedule" is vary different from PBS's.

I suggest you to install both of them even in virtual environment, you'll get the "Exact" answer by yourself.

[ 本帖最后由 nntp 于 2006-7-18 23:39 编辑 ]

~没有更多了~
我们使用 Cookies 和其他技术来定制您的体验包括您的登录状态等。通过阅读我们的 隐私政策 了解更多相关信息。 单击 接受 或继续使用网站,即表示您同意使用 Cookies 和您的相关数据。
原文