错误cl_invalid_value在简单的C+&#x2B上OPENCL图像操纵程序
我在C ++中编写一个简单的OpenCL程序,其中我需要颠倒地翻转输入图像,我使用CIMG读取和编写图像文件。 问题在于,即使程序编译并运行而没有任何错误,输出文件也为空白。
这是CL内核代码:
const sampler_t sampler = CLK_ADDRESS_CLAMP_TO_EDGE | CLK_FILTER_NEAREST;
__kernel void img_turn(
read_only image2d_t I,
write_only image2d_t O
)
{
int gid_x = get_global_id(0);
int gid_y = get_global_id(1);
int w = get_image_width(I);
int h = get_image_height(I);
if (gid_x >= w || gid_y >= h)
return;
uint4 p = read_imageui(I, sampler, (int2)(gid_x, gid_y));
write_imageui(O, (int2)(gid_x, h - gid_y), p);
}
这是主机代码的位,首先是输入映像(编辑):
CImg<unsigned char> img_in(img_file_name);
cl_image_format format = {
CL_RGBA,
CL_UNSIGNED_INT8,
};
cl_image_desc desc = {
.image_type = CL_MEM_OBJECT_IMAGE2D,
.image_width = (size_t) img_in.width(),
.image_height = (size_t) img_in.height(),
.image_row_pitch = 0,
.image_slice_pitch = 0,
.num_mip_levels = 0,
.num_samples = 0,
.buffer = NULL,
};
cl_mem input_img = clCreateImage(
context,
CL_MEM_READ_ONLY | CL_MEM_USE_HOST_PTR,
(const cl_image_format *) &format,
(const cl_image_desc *) &desc,
img_in.data(),
&errNum
);
输出图像的定义(编辑):
CImg<unsigned char> img_out(img_in.width(), img_in.height(), 1, 4);
format = {
CL_RGBA,
CL_UNSIGNED_INT8,
};
desc = {
.image_type = CL_MEM_OBJECT_IMAGE2D,
.image_width = (size_t) img_out.width(),
.image_height = (size_t) img_out.height(),
.image_row_pitch = 0,
.image_slice_pitch = 0,
.num_mip_levels = 0,
.num_samples = 0,
.buffer = NULL,
};
cl_mem output_img = clCreateImage(
context,
CL_MEM_WRITE_ONLY | CL_MEM_USE_HOST_PTR,
(const cl_image_format *) &format,
(const cl_image_desc *) &desc,
img_out.data(),
NULL
);
代码的最后一部分,我在其中加入图像并运行程序(编辑):
size_t origins[3] = {0, 0, 0};
size_t region_in[3] = {(size_t) img_in.width(), (size_t) img_in.height(), (size_t) 1};
errNum = clSetKernelArg(kernel, 0, sizeof(cl_mem), input_img);
errNum |= clSetKernelArg(kernel, 1, sizeof(cl_mem), output_img);
size_t global[2] = {(size_t) img_in.width(), (size_t) img_in.height()};
clEnqueueNDRangeKernel(command_queue, kernel, 2, NULL, global, NULL, 0, NULL, &kernel_event);
errNum = clEnqueueWriteImage(command_queue, input_img, CL_TRUE, origins, region_in, 0, 0, img_in.data(), 0, NULL, NULL);
size_t region_out[3] = {(size_t) img_out.width(), (size_t) img_out.height(), (size_t) 1};
errNum = clEnqueueReadImage(command_queue, output_img, CL_TRUE, origins, region_out, 0, 0, img_out.data(), 0, NULL, NULL);
clWaitForEvents(1, &kernel_event);
img_out.save("./output_img.png");
编译和运行程序后,创建了“ output_img.png”图像文件,但使用文本编辑器打开时,它是空白的:0bytes,没有任何数据。
编辑: 因此,在Petert的建议(以及对我犯了一些愚蠢错误的修正)之后,该程序现在似乎正在做某事(它执行了3秒钟),但仍然什么也没有产生。
编辑2: 调试后,我指出了问题:clenqueuereadimage
返回错误cl_invalid_value
和如果由原点指定的区域读取的区域不超出范围... 但是我不知道为什么。它的大小与输入映像的大小相同,但是clenquewriteImage
即使使用相同的参数调用,也不会返回任何错误。
I'm writing a simple OpenCL program in C++ where i need to flip an input image upside-down, i'm using CImg to read and write image files.
the problem is that even though the program compiles and run without any error, the output file is blank.
Here's the cl kernel code:
const sampler_t sampler = CLK_ADDRESS_CLAMP_TO_EDGE | CLK_FILTER_NEAREST;
__kernel void img_turn(
read_only image2d_t I,
write_only image2d_t O
)
{
int gid_x = get_global_id(0);
int gid_y = get_global_id(1);
int w = get_image_width(I);
int h = get_image_height(I);
if (gid_x >= w || gid_y >= h)
return;
uint4 p = read_imageui(I, sampler, (int2)(gid_x, gid_y));
write_imageui(O, (int2)(gid_x, h - gid_y), p);
}
and here's bits of the host code, first the input image (Edited):
CImg<unsigned char> img_in(img_file_name);
cl_image_format format = {
CL_RGBA,
CL_UNSIGNED_INT8,
};
cl_image_desc desc = {
.image_type = CL_MEM_OBJECT_IMAGE2D,
.image_width = (size_t) img_in.width(),
.image_height = (size_t) img_in.height(),
.image_row_pitch = 0,
.image_slice_pitch = 0,
.num_mip_levels = 0,
.num_samples = 0,
.buffer = NULL,
};
cl_mem input_img = clCreateImage(
context,
CL_MEM_READ_ONLY | CL_MEM_USE_HOST_PTR,
(const cl_image_format *) &format,
(const cl_image_desc *) &desc,
img_in.data(),
&errNum
);
the definition of the output image (Edited):
CImg<unsigned char> img_out(img_in.width(), img_in.height(), 1, 4);
format = {
CL_RGBA,
CL_UNSIGNED_INT8,
};
desc = {
.image_type = CL_MEM_OBJECT_IMAGE2D,
.image_width = (size_t) img_out.width(),
.image_height = (size_t) img_out.height(),
.image_row_pitch = 0,
.image_slice_pitch = 0,
.num_mip_levels = 0,
.num_samples = 0,
.buffer = NULL,
};
cl_mem output_img = clCreateImage(
context,
CL_MEM_WRITE_ONLY | CL_MEM_USE_HOST_PTR,
(const cl_image_format *) &format,
(const cl_image_desc *) &desc,
img_out.data(),
NULL
);
and the last part of the code, where i enqueue the images and run the program (Edited):
size_t origins[3] = {0, 0, 0};
size_t region_in[3] = {(size_t) img_in.width(), (size_t) img_in.height(), (size_t) 1};
errNum = clSetKernelArg(kernel, 0, sizeof(cl_mem), input_img);
errNum |= clSetKernelArg(kernel, 1, sizeof(cl_mem), output_img);
size_t global[2] = {(size_t) img_in.width(), (size_t) img_in.height()};
clEnqueueNDRangeKernel(command_queue, kernel, 2, NULL, global, NULL, 0, NULL, &kernel_event);
errNum = clEnqueueWriteImage(command_queue, input_img, CL_TRUE, origins, region_in, 0, 0, img_in.data(), 0, NULL, NULL);
size_t region_out[3] = {(size_t) img_out.width(), (size_t) img_out.height(), (size_t) 1};
errNum = clEnqueueReadImage(command_queue, output_img, CL_TRUE, origins, region_out, 0, 0, img_out.data(), 0, NULL, NULL);
clWaitForEvents(1, &kernel_event);
img_out.save("./output_img.png");
after compiling and running the program the 'output_img.png' image file is created but it's blank: 0Bytes and no data whatsoever when opened with a text editor.
Edit:
So after PeterT's suggestion (and after some corrections of some dumb mistakes i made), the program now seems to be doing something (it executes for 3 seconds), but still produces nothing.
Edit 2:
After a bit of debugging, i pinpointed the problem: clEnqueueReadImage
returns the error CL_INVALID_VALUE
, and the documentation specifies that it returns that error if the region being read specified by origin and region is out of bounds ...
But i don't know why. It's the same size of the input image, but clEnqueueWriteImage
doesn't return any error, even if called with the same parameters.
Edit 3:
The problem has been fixed by Egor's response. But now it doesn't output the wanted result:
Input image:
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
data:image/s3,"s3://crabby-images/d5906/d59060df4059a6cc364216c4d63ceec29ef7fe66" alt="扫码二维码加入Web技术交流群"
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
首先,您使用
cl_rgba
格式创建OpenCl Image对象,然后将指针传递给CIMG
Pixel数据。但是cimg
使用“平面”结构来保留数据,并且颜色通道的值未交错(有关更多信息,请参见如何使用CIMG存储像素数据?)。例如,带有alpha通道的彩色图像将存储在内存中:r1r2r3 ... b1b2b3 ... g1g2g3 ... a1a2a3 ...
,但
cl_rgba
格式意味着图像的交错通道:R1G1B1A1R2G2B2B2A2R3G3B3A3 ...
。因此,在将图像复制到设备内存之前,必须将图像转换为cl_rgba
格式。例如,使用以下函数:因此,将图像复制到设备的代码看起来像:
另外,在保存之前必须转换输出图像。例如,使用以下函数:
和从设备复制图像的代码看起来像:
接下来,您可以使用默认属性创建命令标题。这意味着将按顺序执行命令标题的命令。另外,您可以使用阻止读写(
blocking_read
和blocking_write
flags设置为cl_true
forclenqueEeReadImage
and code> andclenquewriteImage
函数调用)。在这种情况下,代码可以无需使用OpenCL事件即可同步命令的执行。只需要以正确的顺序加入命令并使用阻止读取命令以获取结果:最后,应该将新的
y
位置计算为get_image_height() - (gid_y) + 1)
是因为gid_y
是间隔[0,get_image_height())
。因此,内核代码应该看起来像:次要注意,如果您使用
ClenqueWriteImage
直接将图像复制到设备,则可以省略cl_mem_use_host_ptr_ptr
forclcreateeimage
flag flag称呼。First, you create OpenCL image object using
CL_RGBA
format and pass the pointer toCImg
pixel data. ButCImg
uses "planar" structure to keep the data and the values for color channels are not interleaved (for more information please see How pixel data are stored with CImg?). For example, colored image with alpha channel will be stored in memory as:R1R2R3...B1B2B3...G1G2G3...A1A2A3...
But
CL_RGBA
format implies the interleaved channels for the image:R1G1B1A1R2G2B2A2R3G3B3A3...
. Therefore, it is necessary to convert the image toCL_RGBA
format before copying it to the device memory. For example, using following function:So the code to copy the image to the device will look like:
Also, it will be necessary to convert the output image before saving it. For example using following function:
And the code to copy the image from the device will look like:
Next, you create the command-queue with default properties. It means that the commands enqueued to the command-queue will be executed in order. Also, you use blocking read and write (
blocking_read
andblocking_write
flags are set toCL_TRUE
forclEnqueueReadImage
andclEnqueueWriteImage
function calls). In this case the code can work without using OpenCL events to synchronize the execution of the commands. It is just necessary to enqueue the commands in the correct order and use blocking read command to get the result:Finally, new
y
position for the pixel should be calculated asget_image_height() - (gid_y + 1)
becausegid_y
is in interval[0, get_image_height())
. So the kernel code should look like:Minor note, if you directly copy the image to the device using
clEnqueueWriteImage
you can omitCL_MEM_USE_HOST_PTR
flag forclCreateImage
call.