当前位置：文江博客话题详情

OpenCV中的高斯滤波器算法是如何工作的

发布于 2024-07-21 15:05:55 字数 199 浏览 11 评论 0原文

我写了自己的高斯滤波器，但它真的很慢。

OpenCV的高斯算法快得多，比我的高斯滤波器快20倍。我想在我的项目中重写OpenCV的高斯算法，并且我不想在我的项目中包含opencv。

然而，

谁能给我算法描述，opencv的源代码似乎很难理解？

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

扛起拖把扫天下 2024-07-28 15:05:55

高斯滤波器有一个非常容易加速的特性：滤波器可以独立地应用于两个维度。您定义一个垂直操作的一维过滤器和另一个水平操作的一维过滤器，然后应用它们；这产生与在二维中应用单个滤波器相同的效果。

除此之外，您可能需要查看 SIMD 说明例如 SSE3 适用于您的处理器。

回复收藏 0 原文

琴流音 2024-07-28 15:05:55

为了回答问题的第二部分，高斯模糊只是将 3 维高斯表面用作图像上的卷积核。维基百科对算法本身有很好的参考，但基本上，你采用高斯的值曲线并将其转换为方阵，然后将其乘以图像中的每个像素，例如：（

Kernel:               
[0 1 2 0 0
1 4 6 4 1      X   Iterate over every single pixel in the image
2 6 10 6 2
1 4 6 4 1
0 1 2 1 0]

请注意，这只是一个示例内核，有非常具体的方程式，根据您的高斯变量，您会得到不同的结果结果）

为了回答问题的性能部分，假设图像大小恒定，该算法的整体速度将取决于一些因素。假设图像是 NxM 像素，卷积核是 PxP 像素。您将必须执行 PPN*M 次操作。 P 越大，您需要对给定图像执行的操作就越多。您可以巧妙地使用此处使用的算法，进行非常具体的基于行或列的数学运算。

实施也非常重要。如果您想要极其高效，您可能需要使用您的架构提供的最先进的指令。如果您使用的是 Intel x86 芯片，您可能需要考虑获取 Intel 性能原语 (IPP) 的许可证并直接调用这些指令。 IIRC，OpenCV 确实会在 IPP 可用时使用它......

如果给定架构上的浮点性能很差，您也可以做一些非常聪明的事情并使用所有缩放的整数。这可能会加快速度，但在走这条路之前我会先考虑其他选择。

To answer the second part of your question, a Gaussian blur is simply the a 3-d gaussian surface applied as a convolution kernel over the image. Wikipedia has a great reference on the algorithm itself, but basically, you take the values of a Gaussian curve and convert that into a square matrix, and multiply it by every single pixel in your image, e.g.:

Kernel:               
[0 1 2 0 0
1 4 6 4 1      X   Iterate over every single pixel in the image
2 6 10 6 2
1 4 6 4 1
0 1 2 1 0]

(Note that this is just a sample kernel, there are very specific eqns which, depending on your Gaussian variables, you'll get different results)

To answer the performance part of your question, the overall speed of this algorithm would depend on a few things, assuming a constant sized image. Lets say the image is NxM pixels, and the convolution kernel is PxP pixels. You're going to have to do PPN*M operations. The greater P, the more operations you're going to have to do for a given image. You can get crafty with the algorithm you use here, doing very specific row or columnar based math.

Implementation is also very important. If you want to be extremely efficient, you'll probably want to use the most advanced instructions that your architecture offers. If you're using an Intel x86 chip, you'll probably want to look at getting a license for Intel performance primitives (IPP) and calling those instructions directly. IIRC, OpenCV does make use of IPP when its available...

You could also do something very smart and work with all scaled integers if the floating point performance on your given architecture is poor. This would probably speed things up a bit, but I would look at other options first before going down this road.

回复收藏 0 原文