如何在 python 中有效地运行大量子进程？

发布于 2024-11-08 05:39:02 字数 616 浏览 6 评论 0原文

基本设置：
我正在使用 python 脚本来自动测试我正在处理的编程项目。在测试中，我使用许多不同的选项运行可执行文件，并将结果与以前的运行进行比较。由于我要运行大约 60 万个不同的测试，因此测试需要相当多的时间。

目前，我已将脚本分为两部分：一个测试模块，用于从作业队列中获取测试并将结果放入结果队列中；一个主模块用于创建作业队列，然后检查结果。这使我可以尝试使用多个测试进程/线程，但到目前为止，测试速度没有任何改进（我在双核计算机上运行它，我希望更多的测试进程在四核计算机上运行得更好） -核）。

在测试模块中，我创建一个命令字符串，然后使用它执行，

subprocess.Popen(cmd, shell=True, stdout=subprocess.PIPE)

然后从管道读取结果并将其放入结果队列中。

问题：

这是在多核系统上运行大量命令字符串的最有效方法吗？我所做的每个 Popen 都会创建一个新进程，这似乎可能会产生相当多的开销，但我真的想不出更好的方法来做到这一点。
（我目前正在使用 python 2.7，以防万一这很重要。）

编辑：
操作系统是Linux
我生成的子进程是带有参数的命令行 C 可执行文件。

原文

Basic setup:
I am using a python script for automatic testing of a programming project that I am working on. In the test, I run my executable with lots of different options and compare the result with previous runs. The testing takes quite a lot of time since I have roughly 600k different tests to run.

At the moment, I have split my script into two parts, a test-module that grabs tests from a job-queue and places results in a result-queue, and a main-module that creates the job-queue and then checks the results. This allows me to play around with using several test-processes/threads which so far has not given any improvement in testing speed (I am running this on a dual-core computer, I would expect more test-processes to work better on a quad-core).

In the test module, I create a command string that I then execute using

subprocess.Popen(cmd, shell=True, stdout=subprocess.PIPE)

I then read the results from the pipe and place it in the result-queue.

Question:

Is this the most efficient way of running lots and lots of command strings on a multi-core system? Every Popen I do creates a new process, which seems like it might create quite a bit of overhead, but I can't really think of a better way to do it.
(I am currently using python 2.7 in case this matters.)

EDIT:
OS is Linux
The subprocesses that I spawn are commandline C-executables with arguments.

分享到QQ

分享到微博