并行 Cholesky 分解用于训练机器学习算法

发布于 2024-09-07 12:51:42 字数 987 浏览 13 评论 0原文

我正在尝试弄清楚是否可以并行机器学习算法的训练方面。训练中计算成本较高的部分涉及 Cholesky 分解正定矩阵（协方差矩阵）。我将尝试纯粹用矩阵代数来构建这个问题。如果您需要更多信息，请告诉我。

假设我们有一个块矩阵（协方差矩阵，但这与问题无关），

 
M = A  B  
    B* C

其中 A 和 C 与来自两个不同集合的训练数据相关。 A 和 B 都是正定的。为了简单起见，我们还假设 A 和 C 的大小为 nxn。

有一个进行分块 Cholesky 分解的公式。请参阅http://en.wikipedia.org/wiki/Block_LU_decomposition。总结一下我们有以下结果。

M = LU

where (* 表示转置)

L = A^{1/2}      0 
    B*A^{-*/2}  Q^{1/2}

where

Q = C - B*A^{-1}B

现在假设已经进行了与矩阵 A 和 C 相关的训练，因此我们对 A 和 C 进行了乔列斯基分解，得到 A^{1/2} 和 C^{1 /2} （因此，使用前向替换可以直接计算逆 A^{-1/2} 和 C^{-1/2}）。

用我们现在拥有的这些数量重写 Q。

Q = Q^{1/2} Q^{*/2} = C^{1/2} C^{*/2} - B* A^{-*/2}A^{-1/2} B

我的问题是：给定这个设置，是否可以代数计算 Q^{1/2}，而不必对 Q 应用乔列斯基分解。或者换句话说，我可以使用 C^{1/2} 来帮助我Q^{1/2} 的计算。如果这是可能的，那么就可以轻松地并行训练。

预先感谢您的回复。抱歉矩阵排版。有没有什么合理的方法来排版数学或矩阵？

马特。

原文

I am trying to work out if I can parallelise the training aspect of a machine learning algorithm. The computationally expensive part of the training involves Cholesky decomposing a positive-definite matrix (covariance matrix). I'll try and frame the question purely in terms of the matrix algebra. Let me know if you need any more info.

Lets say we have a block matrix (covariance matrix, but that's not relevant to the problem)

 
M = A  B  
    B* C

where A and C relate to training data from two different sets. Both A , and B are positive definite. Lets also assume for simplicity that A and C have size nxn.

There is a formula for carrying out block Cholesky decomposition. See http://en.wikipedia.org/wiki/Block_LU_decomposition. Summarising we have the following result.

M = LU

where (* indicates transpose)

L = A^{1/2}      0 
    B*A^{-*/2}  Q^{1/2}

where

Q = C - B*A^{-1}B

Now lets say training related to matrices A and C has already been carried out, so we have carried out the cholesky decomposition for A, and C giving A^{1/2}, and C^{1/2} (It is therefore straightforward to calculate the inverses A^{-1/2}, and C^{-1/2} using forward substitution).

Rewriting the Q in terms of these quantities we now have.

Q = Q^{1/2} Q^{*/2} = C^{1/2} C^{*/2} - B* A^{-*/2}A^{-1/2} B

My question is this: Given this set up is it possible to algebraicly calculate Q^{1/2} without having to apply cholesky decomposition to Q. Or in other words can I use C^{1/2} to help me in the calculation of Q^{1/2}. If this were possible it would then be possible to easily parallelise the training.

Thanks in advance for any replies. Sorry about the matrix typesetting. Is there any way sensible way to typeset maths or matrices in particular?

Matt.

分享到QQ

分享到微博