我正在使用多处理池来updte矩阵值,但是值不更改
我有一个简单的函数,该功能采用矩阵“ H”和更多参数,并在矩阵的单列顶部添加了一些计算的向量。然后,我将该函数应用于矩阵的每一列 - 我有一个代码,该代码依次执行此功能,那里的一切都很好。但是,由于列的操作是独立的,所以我想并行进行。但是,当我将相同的函数应用于“ yuptrocessing.pool()”时,矩阵的值不会从初始值中变化。
Bellow使用顺序和并行实现的脚本。最后,矩阵“ H1”和“ H2”的值应该是相同的,但不是,实际上'H2'的值与开始时具有相同的值(也就是说,作为矩阵“ deltas” )。
我不是程序员,也没有多处处理库的经验,所以也许我在这里做一些愚蠢的事情...
from multiprocessing import Pool
from multiprocessing import set_start_method
import time
import numpy as np
from functools import partial
def h_single_ctr(ctr,C,keys1,bs1,h):
indices1 = np.where(keys1[:,1]==ctr)[0]
indices2 = np.where(keys1[:,0]==ctr)[0]
h[:,ctr] += (C[keys1[:,0][indices1]]).dot(bs1[indices1])
h[:,ctr] += (C[keys1[:,1][indices2]]).dot(bs1[indices2])
if __name__ == '__main__':
m,n = 100,15000
deltas = np.random.rand(n,m)
C = np.random.rand(m)
mbs = 150
bs1 = np.random.rand(mbs,n)
keys1 = np.random.randint(m,size=(mbs,2))
# Sequential
tic = time.time()
h1 = 0. + deltas
for ctr in range(m):
# Update each column of a matrix h1, using function h_single_ctr
h_single_ctr(ctr,C,keys1,bs1,h1)
toc = time.time()
print('Done in {:.4f} seconds'.format(toc-tic))
# Multiprocessing / Pool
tic = time.time()
h2 = 0. + deltas
p = Pool(5)
# Update each column of a matrix h2, using function h_single_ctr, in parallel
p.map(partial(h_single_ctr,C=C,keys1=keys1,bs1=bs1,h=h2), range(m))
p.close()
p.join()
toc = time.time()
print('Done in {:.4f} seconds'.format(toc-tic))
print(np.linalg.norm(h1-h2))
I have a simple function that takes a matrix 'h' and some more arguments, and adds some computed vector on top of a single column of the matrix. Then I apply that function for each column of a matrix - I have a code that does this sequentially and everything is fine there; but, since the column-wise operations are independent I want to do it in parallel. However, when I apply the same function with 'multiprocessing.Pool()', the values of the matrix don't change from the initial value.
Bellow goes a script with both sequential and parallel implementation. In the end, the values of matrices 'h1' and 'h2' should be the same, but they are not, and actually 'h2' has the same value that it had in the beginning (that is, as a matrix 'deltas').
I am not a programmer, and don't have much experience with multiprocessing library, so maybe I am doing something stupid here...
from multiprocessing import Pool
from multiprocessing import set_start_method
import time
import numpy as np
from functools import partial
def h_single_ctr(ctr,C,keys1,bs1,h):
indices1 = np.where(keys1[:,1]==ctr)[0]
indices2 = np.where(keys1[:,0]==ctr)[0]
h[:,ctr] += (C[keys1[:,0][indices1]]).dot(bs1[indices1])
h[:,ctr] += (C[keys1[:,1][indices2]]).dot(bs1[indices2])
if __name__ == '__main__':
m,n = 100,15000
deltas = np.random.rand(n,m)
C = np.random.rand(m)
mbs = 150
bs1 = np.random.rand(mbs,n)
keys1 = np.random.randint(m,size=(mbs,2))
# Sequential
tic = time.time()
h1 = 0. + deltas
for ctr in range(m):
# Update each column of a matrix h1, using function h_single_ctr
h_single_ctr(ctr,C,keys1,bs1,h1)
toc = time.time()
print('Done in {:.4f} seconds'.format(toc-tic))
# Multiprocessing / Pool
tic = time.time()
h2 = 0. + deltas
p = Pool(5)
# Update each column of a matrix h2, using function h_single_ctr, in parallel
p.map(partial(h_single_ctr,C=C,keys1=keys1,bs1=bs1,h=h2), range(m))
p.close()
p.join()
toc = time.time()
print('Done in {:.4f} seconds'.format(toc-tic))
print(np.linalg.norm(h1-h2))
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
data:image/s3,"s3://crabby-images/d5906/d59060df4059a6cc364216c4d63ceec29ef7fe66" alt="扫码二维码加入Web技术交流群"
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
您可以为每个过程创建一个新变量,然后将它们全部添加到全局h之上。请注意,您需要具有一维数组,而不是该过程中的矩阵。
You can create a new variable for each process and then add them all on top of your global h. Notice that you need to have a one-dimensional array, and not a matrix within the process.