使用并行扫描构建堆
我在 NVIDIA 的网站上读到,可以使用 CUDA 和扫描操作并行构建一棵树。我想知道是否有任何文档解释如何使用扫描操作并行构建堆。任何包含此代码的文档也都很棒。我的 GPU 内存中有一些值,我需要从中返回前 k 个值。我在想并行构建一个堆并返回前 k 个结果将是一个好主意。
或者,如果没有这样的文档,我想知道是否有任何文档解释如何使用 scan 并行构建一棵树。
I read on NVIDIA's website that one can build a tree in parallel using CUDA and scan operation. I was wondering if there are any documents that explain how one can build a heap in parallel using the scan operation. Any documents that contain the code for this will also be great. I have some values on the GPU memory out of which I need to return the top k values. I was thinking building a heap in Parallel and returning top k results would be a good idea.
Alternatively, if there are no such documents, I was wondering if there are any documents that explain how one can build a tree in parallel using scan.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
Sean Baxter 在此处讨论订单统计:
http://www.moderngpu.com/select/mgpuselect.html< /a>
虽然他的函数实现了第 k 个元素的选择,而不是前 k 个元素的计算。
他的网站还广泛报道了 Scan。
Sean Baxter discusses order statistics here:
http://www.moderngpu.com/select/mgpuselect.html
although his function implements selection of the k'th element, not computation of the top k elements.
His Web site also features extensive coverage of Scan.