如何创建 CUDA makefile 以便在 CPU 中执行以测试 CPU FLOP？

发布于 2024-09-28 19:11:50 字数 1001 浏览 9 评论 0原文

我正在尝试计算 GPU 和 CPU FLOPs，并且我从此处

我将其重命名为 cudaflops.cu 并使用此 makefile 对其进行编译

################################################################################
#
# Build script for project
#
################################################################################

# Add source files here 
EXECUTABLE  := benchmark
# Cuda source files (compiled with cudacc) 
CUFILES     := cudaflops.cu
# C/C++ source files (compiled with gcc / c++) 
CCFILES     := 


################################################################################
# Rules and targets

include ../../common/common.mk

#########################################

Tt 工作正常并给出结果 367 GFlOPs

但现在，我不知道在 CPU 中测试此源，我阅读了 this 表示源可以运行在CPU上。

那么修改makefile怎么办呢？

原文

I'm trying to count the GPU and CPU FLOPs and I've got the source from here

I renamed it to cudaflops.cu and compiled it with this makefile

################################################################################
#
# Build script for project
#
################################################################################

# Add source files here 
EXECUTABLE  := benchmark
# Cuda source files (compiled with cudacc) 
CUFILES     := cudaflops.cu
# C/C++ source files (compiled with gcc / c++) 
CCFILES     := 


################################################################################
# Rules and targets

include ../../common/common.mk

#########################################

Tt works fine and gives result 367 GFlOPs

But now, I don't know to test this source in CPU, I read this which say that the source could run on CPU.

So how the modified makefile to do it??

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

只怪假的太真实 2024-10-05 19:11:50

嘿，问题是你需要 portland group 编译器才能在 x86 上运行你的代码：
hxxp://www.prnewswire.com/news-releases/pgi-to-develop-compiler-based-on-nvidia-cuda-c-architecture-for-x86-platforms-103457159.html

另外，该文章表示编译器将于 2010 年 11 月 13 日至 15 日进行演示，因此我不确定何时会公开发布（可能是 beta 版本）。（即，不，您还不能在 x86 上本地运行 CUDA）。

现在最简单的事情就是编写一个 C/C++ 函数，该函数完全执行该基准测试的功能（它应该非常容易移植）。他们的 SDK 中有一些 CUDA 示例比较了 CPU 和 GPU（我认为是矩阵乘法），所以首先尝试一下（它基本上应该与基准代码执行完全相同的操作，除了“现实世界”的情况）如果你只是想提高 GPU/CPU 性能。

甚至更简单：在 NVIDIA 论坛上询问有关您的显卡的信息 - 他们喜欢告诉每个人他们的 GPU 与 CPU 性能（只需说“我有 x GPU，我得到 y GFLOPS - 其他人得到的 GPU 与 CPU 的结果如何？”）。

回复收藏 0 原文

~没有更多了~