多处理池挂起且无法脱离应用程序

发布于 2025-01-07 03:01:28 字数 670 浏览 0 评论 0原文

我确信这是一个菜鸟错误，但我无法弄清楚我在多处理方面做错了什么。我有这个代码（只是坐在周围，什么都不做）

if __name__ == '__main__':
    pool = Pool(processes=4)  
    for i, x in enumerate(data): 
        pool.apply_async(new_awesome_function, (i, x))
    pool.close()
    pool.join()

数据是一个列表（[1,2,3,4,5]），我试图将列表发送到多个CPU上完成的每个项目，但是当我将工作命令包装到一个函数中并发送此代码，它不会执行任何操作（当我在没有上述代码的情况下调用函数本身时，它工作正常）。所以我认为我使用了多重处理错误（尽管我从网站上获取了示例），有什么建议吗？

更新：我注意到当它用 control-c 冻结时我什至无法摆脱它......它总是可以摆脱我的有缺陷的程序。我查看了 python2.5 多处理池并尝试遵循建议并将导入添加到我的if 语句但没有运气

更新2：对不起，刚刚意识到感谢下面的答案，该命令有效，但它似乎并没有终止程序或让我强制退出。

原文

I'm sure this is a rookie mistake but I can't figure out what I'm doing wrong with multiprocessing. I have this code(that just sits around and does nothing)

if __name__ == '__main__':
    pool = Pool(processes=4)  
    for i, x in enumerate(data): 
        pool.apply_async(new_awesome_function, (i, x))
    pool.close()
    pool.join()

data is a list([1,2,3,4,5]) and I'm trying to take the list send each item to be done over multiple cpu but when I wrap my working command into a function and send this code it doesn't do anything(when I call the function itself without above code it works fine). So I think I'm using multiprocessing wrong(although I took examples from sites), any suggestions?

Update: I noticed that I can't even break out of it when it freezes with control-c..that always works to get out of my buggy programs. I looked at python2.5 multiprocessing Pool and tried to follow the advice and added the import inside my if statement but no luck

Update2: I'm sorry, just realized thanks to the answer below that the command works but it doesn't seem to be terminating the program or letting me force quit.

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

这样的小城市 2025-01-14 03:01:28

多处理不是线程。

您可能正在做这样的事情：

data = {}

def new_awesome_function(a, b):
    data[a] = b

运行脚本后，数据没有改变。这是因为多处理使用程序的副本。您的函数正在运行，但它们是在您的程序的副本中运行的，因此对您的原始程序没有影响。

为了利用多处理，您需要显式地从一个进程到另一个进程进行通信。对于线程来说，一切都是共享的，但是对于多处理来说，除非您明确共享，否则什么都不会共享。

最简单的方法是使用返回值：

def new_awesome_function(a, b):
    return a + b

result = pool.apply_async(new_awesome_function, (1, 2))
# later...
value = result.get()

请参阅 python 文档： http://docs.python.org /library/multiprocessing.html，用于其他方法，例如队列、管道和管理器。你不能做的是改变你的程序状态并期望它能够工作。

Multiprocessing isn't threading.

You're probably doing something sorta like this

data = {}

def new_awesome_function(a, b):
    data[a] = b

After you run the script, data has not changed. This is because multiprocessing uses copies of your program. Your functions are being run, but they are run in copies of your program and thus have no effect on your original program.

In order to make use of multiprocessing you need to explicitly communicate from one process to another. With threading everything is shared, but with multiprocessing nothing is shared unless you explicitly share it.

The simplest way is to use return values:

def new_awesome_function(a, b):
    return a + b

result = pool.apply_async(new_awesome_function, (1, 2))
# later...
value = result.get()

See the python documentation: http://docs.python.org/library/multiprocessing.html, for other methods such as Queues, Pipes, and Managers. What you can't do is change your program state and expect that to work.

回复收藏 0 原文

撩人痒 2025-01-14 03:01:28

我不知道您正在使用什么数据库，但很可能您无法像这样在进程之间共享数据库连接。

在Linux上，使用fork()，它在启动子进程时复制内存中的所有内容。然而，除非专门设计，否则诸如套接字、打开文件和数据库连接之类的东西将无法正常工作。

在 Window 上，fork() 不可用，因此它将重新运行您的脚本。就你而言，这将非常糟糕，因为它会再次丢弃所有内容。您可以通过放入 if __name__ == '__main__': 位来防止这种情况发生。

您应该能够在 my_awesome_function 中重新打开数据库连接，从而能够成功地与数据库交互。

说实话，这样做你不会获得任何速度。事实上，我预计这会慢一些。看到数据库真的很慢。您的进程将花费大部分时间等待数据库。现在你只有多个进程在等待数据库，这确实不会改善情况。

但数据库是用来存储东西的。只要您正在进行处理，就应该在访问数据库之前在代码中进行处理。您基本上使用数据库作为集合，并且使用 python 集合您的代码会更好。如果您确实需要将这些内容放入数据库中，请在程序末尾执行此操作。

回复收藏 0 原文

怎会甘心 2025-01-14 03:01:28

你的代码似乎对我有用：

from multiprocessing import Pool
import time

def new_awesome_function(a,b):
    print(a,b, 'start')
    time.sleep(1)
    print(a,b, 'end')

if __name__ == '__main__':
    data = [1,2,3,4,5]
    pool = Pool(processes=4)
    for i, x in enumerate(data): 
        pool.apply_async(new_awesome_function, (i, x))
    pool.close()
    pool.join()

给了我：

0 1 start
1 2 start
2 3 start
3 4 start
1 2 end
0 1 end
4 5 start
2 3 end
3 4 end
4 5 end

是什么让你认为它不起作用？

编辑：尝试运行它并查看输出：

from multiprocessing import Pool
import time

def new_awesome_function(a,b):
    print(a,b, 'start')
    time.sleep(1)
    print(a,b, 'end')
    return a + b

if __name__ == '__main__':
    data = [1,2,3,4,5]
    pool = Pool(processes=4)
    results = []
        for i, x in enumerate(data): 
        r = pool.apply_async(new_awesome_function, (i, x))
        results.append((i,r))
    pool.close()
    already = []
    while len(already) < len(data):
        for i,r in results:
            if r.ready() and i not in already:
                already.append(i)
                print(i, 'is ready!')
    pool.join()

我的是：

0 1 start
1 2 start
2 3 start
3 4 start
0 1 end
4 5 start
1 2 end
2 3 end
0 is ready!
3 4 end
1 is ready!
2 is ready!
3 is ready!
4 5 end
4 is ready!

Your code seems to work for me:

from multiprocessing import Pool
import time

def new_awesome_function(a,b):
    print(a,b, 'start')
    time.sleep(1)
    print(a,b, 'end')

if __name__ == '__main__':
    data = [1,2,3,4,5]
    pool = Pool(processes=4)
    for i, x in enumerate(data): 
        pool.apply_async(new_awesome_function, (i, x))
    pool.close()
    pool.join()

gave me:

0 1 start
1 2 start
2 3 start
3 4 start
1 2 end
0 1 end
4 5 start
2 3 end
3 4 end
4 5 end

What makes you think it doesn't work?

Edit: Try to run this and look at the output:

from multiprocessing import Pool
import time

def new_awesome_function(a,b):
    print(a,b, 'start')
    time.sleep(1)
    print(a,b, 'end')
    return a + b

if __name__ == '__main__':
    data = [1,2,3,4,5]
    pool = Pool(processes=4)
    results = []
        for i, x in enumerate(data): 
        r = pool.apply_async(new_awesome_function, (i, x))
        results.append((i,r))
    pool.close()
    already = []
    while len(already) < len(data):
        for i,r in results:
            if r.ready() and i not in already:
                already.append(i)
                print(i, 'is ready!')
    pool.join()

Mine is:

0 1 start
1 2 start
2 3 start
3 4 start
0 1 end
4 5 start
1 2 end
2 3 end
0 is ready!
3 4 end
1 is ready!
2 is ready!
3 is ready!
4 5 end
4 is ready!

回复收藏 0 原文

~没有更多了~

关于作者

梅倚清风

暂无简介

文章

27 人气

关注发私信

友情链接

文江博客

多处理池挂起且无法脱离应用程序

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（3）

关于作者

相关话题

热门标签

推荐作者

卷耳

佚名

℉服软

qq_2gSKZM

凉宸

gyhjy

友情链接

多处理池挂起且无法脱离应用程序

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（3）

关于作者

相关话题

热门标签

推荐作者

卷耳

佚名

℉服软

qq_2gSKZM

凉宸

gyhjy

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。