BLAS 向量乘法catlas_saxpby 无法正常工作

发布于 2024-11-18 08:45:44 字数 1112 浏览 1 评论 0原文

我试图有两个任意长度的向量（典型长度为 2048）并逐个元素相乘。因此对于所有 n，Z[n] = X[n] * Y[n]。

我设置来测试的代码相当基本：

float inputX[4] = { 2, 4, 8, 16 };
float inputY[4] = { 2, 4, 8, 16 };

catlas_saxpby(4, 1, inputX, 1, 1, inputY, 1);

结果进入 inputY，结果是

4.000000, 8.000000, 16.000000, 32.000000

Which，如果它们相乘，它应该是 4, 16, 64, 256。但它看起来像是相加。

所以这没有达到我的预期，并且文档没有给我足够的信息来弄清楚它在做什么。

有什么想法吗？

Apple's documentation for BLAS says this: 

Computes the product of two vectors, scaling each one separately (single-precision).

void catlas_saxpby (
   const int N,
   const float alpha,
   const float *X,
   const int incX,
   const float beta,
   float *Y,
   const int incY
);
Parameters
N
Number of elements in the vector.
alpha
Scaling factor for X.
X
Input vector X.
incX
Stride within X. For example, if incX is 7, every 7th element is used.
beta
Scaling factor for Y.
Y
Input vector Y.
incY
Stride within Y. For example, if incY is 7, every 7th element is used.
Discussion
On return, the contents of vector Y are replaced with the result.

原文

I am trying to have two arbitrary length vectors (typical length will be 2048) and multiply element by element. So Z[n] = X[n] * Y[n] for all n.

The code I have setup to test is fairly basic:

float inputX[4] = { 2, 4, 8, 16 };
float inputY[4] = { 2, 4, 8, 16 };

catlas_saxpby(4, 1, inputX, 1, 1, inputY, 1);

The result goes into inputY, and the result is

4.000000, 8.000000, 16.000000, 32.000000

Which if they were multiplying it should be 4, 16, 64, 256. But it looks like it is adding.

So this is not doing what I expect, and the documentation doesn't give me enough information to figure what it is doing.

Any ideas?

Apple's documentation for BLAS says this: 

Computes the product of two vectors, scaling each one separately (single-precision).

void catlas_saxpby (
   const int N,
   const float alpha,
   const float *X,
   const int incX,
   const float beta,
   float *Y,
   const int incY
);
Parameters
N
Number of elements in the vector.
alpha
Scaling factor for X.
X
Input vector X.
incX
Stride within X. For example, if incX is 7, every 7th element is used.
beta
Scaling factor for Y.
Y
Input vector Y.
incY
Stride within Y. For example, if incY is 7, every 7th element is used.
Discussion
On return, the contents of vector Y are replaced with the result.

分享到QQ

分享到微博