屈服于 OpenMP 中的其他线程/任务

发布于 2024-12-07 22:41:24 字数 549 浏览 0 评论 0原文

我想将 OpenMP 与 CUDA 结合使用来实现重叠内核执行。这些内核调用都是异步的，但我在启动之间的代码很少，因此各个 OpenMP 线程在尝试启动另一个内核或执行内存复制时往往会阻塞（我并不总是在调用后立即获得内存副本，因此异步内存副本不一定是解决方案）。我想要一种方法来向 OpenMP 调度程序发出信号以切换到另一个 OpenMP 线程。这在 OpenMP 中可能吗？

例子：

int main() {
   #pragma omp parallel for
   for(int i=0;i<10;i++) {
       for(int j=0;j<10;j++) {
           //call kernel here

           // ---->   Would like to signal to continue with other  
           //           threads as next call will block

           //copy data from kernel
       }
   }
}

原文

I want to use OpenMP with CUDA to achieve overlapping kernel executions. Ther kernel calls are all asynchronous, but I have very little code between launches so the individual OpenMP threads tend to block as they try to launch another kernel, or do a mem copy (I don't always have mem copys right after the call so async mem copys aren't necessarily the solution). I would like a way to signal to the OpenMP schedular to switch to another OpenMP thread. Is this possible in OpenMP?

Example:

int main() {
   #pragma omp parallel for
   for(int i=0;i<10;i++) {
       for(int j=0;j<10;j++) {
           //call kernel here

           // ---->   Would like to signal to continue with other  
           //           threads as next call will block

           //copy data from kernel
       }
   }
}

分享到QQ

分享到微博