如何限制同时运行的 PBS 作业的数量
我有一个 64 节点集群,运行 PBS Pro。如果我提交数百个作业,我可以同时运行 64 个作业。这很棒,除非所有 64 个作业恰好都接近 I/O 限制,并且正在读/写同一磁盘。在这种情况下,我希望仍然能够提交所有作业,但在给定时间最多运行(例如)10 个作业。 qsub 是否有一个咒语允许我这样做,而无需对集群的 PBS 服务器进行管理访问?
I have a 64-node cluster, running PBS Pro. If I submit many hundreds of jobs, I can get 64 running at once. This is great, except when all 64 jobs happen to be nearly I/O bound, and are reading/writing to the same disk. In such cases, I'd like to be able to still submit all the jobs, but have a max of (say) 10 jobs running at a given time. Is there an incantation to qsub that will allow me to do such, without having administrative access to the cluster's PBS server?
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(2)
在 TORQUE 中,您可以通过在作业数组上设置插槽限制来实现此目的,只要您可以将作业排列为数组即可:
将限制其中 10 个作业同时运行。如果 PBSPro 有与此等效的工具,那么您可以使用它。
In TORQUE you can do this by setting a slot limit on a job array, as long as you can arrange the jobs as an array:
would limit 10 of them to running at once. If PBSPro has an equivalent to this then you can use that.
你可以让他们相互依赖。
或安排它们在不同的时间点开始。
否则,您的管理员可以减少您可以同时运行的同时作业的数量。
you could make them dependent on each other.
or schedule them to start at different timepoints.
otherwise, your admin can reduce the number of simultaneous jobs you can run at the same time.