Cuda 更改数组中的单个值

发布于 2024-12-05 08:40:21 字数 159 浏览 0 评论 0原文

我在 CUDA 设备内存中计算了一个名为 d_index 的向量，我只想更改一个值，如下所示...

d_index[columnsA-rowsA]=columnsA;

我怎样才能做到这一点，而不必将其复制到系统内存然后再返回到设备内存？

原文

I have a vector called d_index calculated in the CUDA device memory and I want to change just one value, like this...

d_index[columnsA-rowsA]=columnsA;

How can I do this without having to copy it to the system memory and then back to the device memory?

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

清眉祭 2024-12-12 08:40:21

您可以在 <<<1,1>>> 网格上调用 kernel，仅更改所需的元素：

__global__ void change_elem(int *arr, int idx, int val) {
    arr[idx] = val;
}
// ....
// Somewhere in CPU code
change_elem<<<1,1>>>(d_index, columnsA-rowsA, columnsA);

，或者使用类似以下内容：

int tmp = columnsA;
cudaMemcpy(&d_index[columnsA-rowsA], &tmp, sizeof(int), cudaMemcpyHostToDevice);

如果您只执行一次，我认为使用哪个版本没有太大区别。如果您经常调用此代码，您最好考虑将此数组修改包含到其他内核中，以避免调用开销。

You could either call kernel on <<<1,1>>> grid, that changes only the desired element:

__global__ void change_elem(int *arr, int idx, int val) {
    arr[idx] = val;
}
// ....
// Somewhere in CPU code
change_elem<<<1,1>>>(d_index, columnsA-rowsA, columnsA);

, or use something like:

int tmp = columnsA;
cudaMemcpy(&d_index[columnsA-rowsA], &tmp, sizeof(int), cudaMemcpyHostToDevice);

If you only do this once, I think there is no big difference which version to use. If you call this code often, you better consider including this array modification into some other kernel to avoid invocation overhead.

回复收藏 0 原文