在 OpenCV 中使用 imread() 加载图像时如何捕获损坏的 JPEG?

发布于 2025-01-03 00:27:10 字数 344 浏览 2 评论 0原文

OpenCV 会显示类似

Corrupt JPEG data: premature end of data segment

Corrupt JPEG data: bad Huffman code

Corrupt JPEG data: 22 extraneous bytes before marker 0xd9

当使用 imread() 加载损坏的 jpeg 图像时, 内容。 我能以某种方式抓住它吗?否则我为什么会得到这些信息? 我必须自己检查二进制文件吗?

OpenCV says something like

Corrupt JPEG data: premature end of data segment

or

Corrupt JPEG data: bad Huffman code

or

Corrupt JPEG data: 22 extraneous bytes before marker 0xd9

when loading a corrupt jpeg image with imread().
Can I somehow catch that? Why would I get this information otherwise?
Do I have to check the binary file on my own?

如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。

扫码二维码加入Web技术交流群

发布评论

需要 登录 才能够评论, 你可以免费 注册 一个本站的账号。

评论(11

逆光飞翔i 2025-01-10 00:27:10

OpenCV(版本 2.4)不会覆盖 libjpeg 的基本错误处理,使它们“无法捕获”。将以下方法添加到 modules/highgui/src/grfmt_jpeg.cpp 中,位于 error_exit() 定义的正下方:

METHODDEF(void)
output_message( j_common_ptr cinfo )
{
    char buffer[JMSG_LENGTH_MAX];

    /* Create the message */
    (*cinfo->err->format_message) (cinfo, buffer);

    /* Default OpenCV error handling instead of print */
    CV_Error(CV_StsError, buffer);
}

现在将该方法应用于解码器错误处理程序

state->cinfo.err = jpeg_std_error(&state->jerr.pub);
state->jerr.pub.error_exit = error_exit;
state->jerr.pub.output_message = output_message; /* Add this line */

:编码器错误处理程序的方法:

cinfo.err = jpeg_std_error(&jerr.pub);
jerr.pub.error_exit = error_exit;
jerr.pub.output_message = output_message; /* Add this line */

像往常一样重新编译并安装 OpenCV。从现在开始,您应该能够像捕获任何其他 OpenCV 错误一样捕获 libjpeg 错误。示例:(

>>> cv2.imread("/var/opencv/bad_image.jpg")
OpenCV Error: Unspecified error (Corrupt JPEG data: 1137 extraneous bytes before marker 0xc4) in output_message, file /var/opencv/opencv-2.4.9/modules/highgui/src/grfmt_jpeg.cpp, line 180
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
cv2.error: /var/opencv/opencv-2.4.9/modules/highgui/src/grfmt_jpeg.cpp:180: error: (-2) Corrupt JPEG data: 1137 extraneous bytes before marker 0xc4 in function output_message

我已经针对上述内容提交了拉取请求,但它被拒绝了,因为它会导致人们在没有异常捕获的情况下阅读图像时出现问题。)

希望这可以帮助任何仍在解决此问题的人。祝你好运。

OpenCV (version 2.4) does not overwrite the basic error handling for libjpeg, making them 'uncatchable'. Add the following method to modules/highgui/src/grfmt_jpeg.cpp, right below the definition of error_exit():

METHODDEF(void)
output_message( j_common_ptr cinfo )
{
    char buffer[JMSG_LENGTH_MAX];

    /* Create the message */
    (*cinfo->err->format_message) (cinfo, buffer);

    /* Default OpenCV error handling instead of print */
    CV_Error(CV_StsError, buffer);
}

Now apply the method to the decoder error handler:

state->cinfo.err = jpeg_std_error(&state->jerr.pub);
state->jerr.pub.error_exit = error_exit;
state->jerr.pub.output_message = output_message; /* Add this line */

Apply the method to the encoder error handler as well:

cinfo.err = jpeg_std_error(&jerr.pub);
jerr.pub.error_exit = error_exit;
jerr.pub.output_message = output_message; /* Add this line */

Recompile and install OpenCV as usual. From now on you should be able to catch libjpeg errors like any other OpenCV error. Example:

>>> cv2.imread("/var/opencv/bad_image.jpg")
OpenCV Error: Unspecified error (Corrupt JPEG data: 1137 extraneous bytes before marker 0xc4) in output_message, file /var/opencv/opencv-2.4.9/modules/highgui/src/grfmt_jpeg.cpp, line 180
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
cv2.error: /var/opencv/opencv-2.4.9/modules/highgui/src/grfmt_jpeg.cpp:180: error: (-2) Corrupt JPEG data: 1137 extraneous bytes before marker 0xc4 in function output_message

(I've submitted a pull request for the above but it got rejected because it would cause issues with people reading images without exception catching.)

Hope this helps anyone still struggling with this issue. Good luck.

寻梦旅人 2025-01-10 00:27:10

修复文件中的错误可能比尝试修复 OpenCV 的加载功能更容易。如果您使用的是 Linux,则可以使用 ImageMagick 来修复一组图像(通常默认安装它):

$ mogrify -set comment 'Image rewriting with ImageMagick' *.jpg

这命令更改文件的属性,保持图像数据不变。但是,图像会被加载并重新保存,从而消除导致损坏错误的额外信息。

如果您需要有关 ImageMagick 的更多信息,可以访问他们的网站: http://www.imagemagick.org/脚本/index.php

It could be easier to fix the error in the file instead of trying to repair the loading function of OpenCV. If you are using Linux you can use ImageMagick to make reparation to a set of images (is usual to have it installed by default):

$ mogrify -set comment 'Image rewritten with ImageMagick' *.jpg

This command changes a property of the file leaving the image data untouched. However, the image is loaded and resaved, eliminating the extra information that causes the corruption error.

If you need more information about ImageMagick you can visit their website: http://www.imagemagick.org/script/index.php

佼人 2025-01-10 00:27:10

如果使用imread(),则无法捕获它。但是,有一个由 imread() 调用的 imdecode() 函数。 也许它会给你更多反馈。为此,您必须自行将图像加载到内存中,然后调用解码器。

归结为:你必须挖掘 OpenCV 源代码来解决你的问题。

You cannot catch it if you use imread(). However there is imdecode() function that is called by imread(). Maybe it gives you more feedback. For this you would have to load the image into memory on your own and then call the decoder.

It boils down to: You have to dig through the OpenCV sources to solve your problem.

思念绕指尖 2025-01-10 00:27:10

我最近不得不处理这个问题,并在这里找到了解决方案

http:// artax.karlin.mff.cuni.cz/~isa_j1am/other/opencv/

我只需要进行 2 次编辑@ $cv\modules\highgui\src\grfmt_jpeg.cpp。

--- opencv-1.0.0.orig/otherlibs/highgui/grfmt_jpeg.cpp  2006-10-16 13:02:49.000000000 +0200
+++ opencv-1.0.0/otherlibs/highgui/grfmt_jpeg.cpp   2007-08-11 09:10:28.000000000 +0200
@@ -181,7 +181,7 @@
             m_height = cinfo->image_height;
             m_iscolor = cinfo->num_components > 1;

-            result = true;
+            result = (cinfo->err->num_warnings == 0);
         }
     }

@@ -405,8 +405,9 @@
                         icvCvt_CMYK2Gray_8u_C4C1R( buffer[0], 0, data, 0, cvSize(m_width,1) );
                 }
             }
-            result = true;
+
             jpeg_finish_decompress( cinfo );
+            result = (cinfo->err->num_warnings == 0);
         }
     }

i had to deal with this recently and found a solution over here

http://artax.karlin.mff.cuni.cz/~isa_j1am/other/opencv/

i just need to make 2 edits @ $cv\modules\highgui\src\grfmt_jpeg.cpp.

--- opencv-1.0.0.orig/otherlibs/highgui/grfmt_jpeg.cpp  2006-10-16 13:02:49.000000000 +0200
+++ opencv-1.0.0/otherlibs/highgui/grfmt_jpeg.cpp   2007-08-11 09:10:28.000000000 +0200
@@ -181,7 +181,7 @@
             m_height = cinfo->image_height;
             m_iscolor = cinfo->num_components > 1;

-            result = true;
+            result = (cinfo->err->num_warnings == 0);
         }
     }

@@ -405,8 +405,9 @@
                         icvCvt_CMYK2Gray_8u_C4C1R( buffer[0], 0, data, 0, cvSize(m_width,1) );
                 }
             }
-            result = true;
+
             jpeg_finish_decompress( cinfo );
+            result = (cinfo->err->num_warnings == 0);
         }
     }
栀梦 2025-01-10 00:27:10

我正在使用 opencv python 包读取一些图像,也遇到了这个错误消息。 Python 无法捕获此错误。但是如果你想找到哪个图像被损坏而不按照@Robbert的建议重新编译opencv,你可以尝试以下方法。

首先,您可以查明损坏图像所在的目录,这相当容易。然后进入该目录,使用 提供的 mogrify 命令行工具ImageMagick 更改图像元信息,如 @goe 的建议。

mogrify -set comment "errors fixed in meta info" -format png *.jpg

上面的命令会将原始 jpg 图像转换为 png 格式,并清理原始图像以删除元信息中的错误。当您运行mogrify命令时,它还会输出一些有关目录中哪个图像损坏的消息,以便您可以准确地找到损坏的图像。

之后,您可以对原始损坏的 jpg 图像执行任何操作。

I am using opencv python package to read some image and also met this error message. This error can not be catch by Python. But if you want to find which image is corrupted without recompiling opencv as @Robbert suggested, you can try the following method.

First, you can pinpoint the directory where the corrupt images reside, which is fairly easy. Then you go to the directory, and use mogrify command line tool provided by ImageMagick to change the image meta info, as suggest by @goe.

mogrify -set comment "errors fixed in meta info" -format png *.jpg

The above command will convert the original jpg image to png format and also clean the original image to remove errors in meta info. When you run mogrify command, it will also output some message about which image is corrupted in the directory so that you can accurately find the corrupted image.

After that, you can do whatever you want with the original corrupted jpg image.

停顿的约定 2025-01-10 00:27:10

任何人偶然发现这篇文章并阅读这个答案。

我必须找到一个损坏的图像文件。

这些网站可以帮助您损坏文件

第一个和第三个网站没有多大用处。

第二个网站很有趣,因为我可以设置需要损坏的文件量。

我在这里使用的 OpenCV 版本是 3.4.0

我使用正常的 cv2.imread(fileLocation)

fileLocation 损坏图像文件的位置

OpenCV 没有显示此处使用的任何损坏文件的任何错误消息

第一和第三网站只提供了一个文件,并且当我尝试打印它们时,两个网站都没有存储它们

第二网站确实让我决定文件的数量需要被破坏

Corruption% 打印图像时的 Opencv 消息

4% None

10% None

25% None

50%损坏的 JPEG 数据:标记 0x4f 之前有 3 个无关字节

75%损坏的 JPEG 数据:标记之前有 153 个无关字节0xb2

100% 损坏的 JPEG 数据:标记 0xc6 之前有 330 个无关字节

我想我们在这里要做的唯一检查是

如果图像是不是无:
执行您的代码,否则会弹出错误

Any one stumbles upon this post and reads this answer.

I had to get hold of a corrupted image file.

These websites can help you corrupt your file

First and the third website was not that much useful.

Second website is interesting as I could set the amount of file that I need to corrupt.

OpenCV version I used here is 3.4.0

I used normal cv2.imread(fileLocation)

fileLocation Location of corrupted image file

OpenCV didn't show any error message for any of the corrupted files used here

First and Third website only gave one file and both had None stored in them, when I tried to print them

Second website did let me decide the amount of file that was needed to be corrupted

Corruption% Opencv message on printing the image

4% None

10% None

25% None

50% None Corrupt JPEG data: 3 extraneous bytes before marker 0x4f

75% None Corrupt JPEG data: 153 extraneous bytes before marker 0xb2

100% Corrupt JPEG data: 330 extraneous bytes before marker 0xc6 None

I guess the only check we have to make here would be

if image is not None:
Do your code or else pop an error

谁的新欢旧爱 2025-01-10 00:27:10

您可以将 stderr 重定向到一个文件,然后在 imread 之后,在该文件中搜索字符串“Huffman”。搜索文件后,将其清空。它对我有用,现在我可以丢弃损坏的图像并只处理好的图像。

You can redirect stderr to a file, then after imread, search for the string "Huffman" inside that file. After searching the file, empty it. It works for me and now I am able to discard corrupted images and just process good ones.

┼── 2025-01-10 00:27:10

如果您使用 imdecode 加载图像,您可以检查 errno :

  std::vector<char> datas();
  //Load yout image in datas here
  errno = 0;
  cv::Mat mat = cv::imdecode(datas, -1);
  if (errno != 0)
  {
    //Error
  }

(在 OpenCV 3.4.1 上测试)

If you load your image with imdecode, you can check errno :

  std::vector<char> datas();
  //Load yout image in datas here
  errno = 0;
  cv::Mat mat = cv::imdecode(datas, -1);
  if (errno != 0)
  {
    //Error
  }

(tested on OpenCV 3.4.1)

誰ツ都不明白 2025-01-10 00:27:10

我发现问题出在libjpeg上。如果OpenCV使用它,就会出错

损坏的 JPEG 数据:标记 0xd9 之前有 22 个无关字节

您可以尝试 我的解决方案来解决它。它在编译期间禁用 JPEG。之后 OpenCV 无法读/写,但可以工作。

cmake -D CMAKE_BUILD_TYPE=Release -D CMAKE_INSTALL_PREFIX=/usr/local -D BUILD_SHARED_LIBS=OFF -D BUILD_EXAMPLES=OFF -D BUILD_TESTS=OFF -D BUILD_PERF_TESTS=OFF -D WITH_JPEG=OFF -D WITH_IPP=OFF  ..

I found that the issue is in libjpeg. If OpenCV uses it, it gets error

Corrupt JPEG data: 22 extraneous bytes before marker 0xd9

You can try my solution to solve it. It disables JPEG during compilation. After that OpenCV cannot read/write, but it works.

cmake -D CMAKE_BUILD_TYPE=Release -D CMAKE_INSTALL_PREFIX=/usr/local -D BUILD_SHARED_LIBS=OFF -D BUILD_EXAMPLES=OFF -D BUILD_TESTS=OFF -D BUILD_PERF_TESTS=OFF -D WITH_JPEG=OFF -D WITH_IPP=OFF  ..
时光病人 2025-01-10 00:27:10

我找到了一个简单的解决方案,无需重新编译 openCV。
您可以使用 imagemagick 检测相同的错误,但它会按预期返回错误。请参阅此处的说明:https://stackoverflow.com/a/66283167/2887398

I found an easy solution without the need to recompile openCV.
You can use imagemagick to detect the same errors, however it returns an error as expected. See the description here: https://stackoverflow.com/a/66283167/2887398

橪书 2025-01-10 00:27:10

我们使用 OpenCV 读取指定文件夹中的每个 JPEG 图像,并通过以质量设置 95 重新保存来覆盖它。此重新编码过程通过标准化图像数据格式来修复任何损坏问题。

for filename in tqdm(image_files, desc="Re-encoding images"):

    image_path = os.path.join(folder_path, filename)
    img = cv2.imread(image_path)

    if img is not None:
       # Re-encode and overwrite the image
       cv2.imwrite(image_path, img, [int(cv2.IMWRITE_JPEG_QUALITY), 95])
    else:
        print(f"Failed to read image data: {image_path}")

经过长时间的调试,这个解决方案解决了我的问题。

We read each JPEG image in the specified folder using OpenCV and overwrite it by re-saving with a quality setting of 95. This re-encoding process fixes any corruption issues by standardizing the image data format.

for filename in tqdm(image_files, desc="Re-encoding images"):

    image_path = os.path.join(folder_path, filename)
    img = cv2.imread(image_path)

    if img is not None:
       # Re-encode and overwrite the image
       cv2.imwrite(image_path, img, [int(cv2.IMWRITE_JPEG_QUALITY), 95])
    else:
        print(f"Failed to read image data: {image_path}")

This solution fixed my problem after a long time of debugging.

~没有更多了~
我们使用 Cookies 和其他技术来定制您的体验包括您的登录状态等。通过阅读我们的 隐私政策 了解更多相关信息。 单击 接受 或继续使用网站,即表示您同意使用 Cookies 和您的相关数据。
原文