nvblas通过cblas
我有一个C ++代码,并且使用CBLA来计算dgemm
和dtrsm
。我有兴趣使用GPU进行性能。 通过我完成的测试,我可以使用nvblas.h
使用NVBLA,但是它不接近CBLA。我想我可以将代码更改为dgemm
就像nvblas.h
一样。是否有一种更简单的方法可以使用NVBLA链接CBLA?
I have a C++ code and I used CBLAS to compute dgemm
and dtrsm
. I am interested to use GPUs for performance.
With the tests I have done I could use NVBLAS using nvblas.h
, however it is not close to CBLAS. I think I can change my code to call dgemm
just like nvblas.h
. Is there an easier way to link CBLAS using NVBLAS?
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。

绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
https://docs.nvidia.com/cuda/cuda/cuda/nvblas/nvblas/index.htmll#配置
由于NVBLA是Blas的倒数替换,因此必须通过ASCII文本文件进行配置,该文件描述了多少gpus可以参与截取的BLAS调用。配置文件在库加载时被解析。配置文件的格式基于关键字,然后是一个或多个用户定义的参数。最多允许每行关键字。以字符#开头的空白行或行被忽略。
https://docs.nvidia.com/cuda/nvblas/index.html#configuration
Because NVBLAS is a drop-in replacement of BLAS, it must be configured through an ASCII text file that describes how many and which GPUs can participate in the intercepted BLAS calls. The configuration file is parsed at the time of the loading of the library. The format of the configuration file is based on keywords optionally followed by one or more user-defined parameters. At most one keyword per line is allowed. Blank lines or lines beginning with the character # are ignored.