如何量化两幅图像之间的差异?

发布于 2024-07-07 05:33:40 字数 160 浏览 20 评论 0 原文

这就是我想做的:

我定期用网络摄像头拍照。 有点像延时摄影。 但是,如果没有任何变化,即图片看起来几乎一样,我不想存储最新的快照。

我想有某种方法可以量化差异,并且我必须凭经验确定阈值。

我追求简单而不是完美。 我正在使用Python。

Here's what I would like to do:

I'm taking pictures with a webcam at regular intervals. Sort of like a time lapse thing. However, if nothing has really changed, that is, the picture pretty much looks the same, I don't want to store the latest snapshot.

I imagine there's some way of quantifying the difference, and I would have to empirically determine a threshold.

I'm looking for simplicity rather than perfection.
I'm using python.

如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。

扫码二维码加入Web技术交流群

发布评论

需要 登录 才能够评论, 你可以免费 注册 一个本站的账号。

评论(25

山人契 2024-07-14 05:33:40

一般想法

选项 1:将两个图像加载为数组 (scipy.misc.imread) 并计算逐个元素(逐像素)差异。 计算差异的范数。

选项 2:加载两个图像。 计算每个特征向量(如直方图)。 计算特征向量之间的距离而不是图像之间的距离。

然而,首先需要做出一些决定。

问题

您应该首先回答以下问题:

  • 图像的形状和尺寸相同吗?

    如果没有,您可能需要调整它们的大小或裁剪它们。 PIL 库将有助于在 Python 中完成此操作。

    如果它们是使用相同的设置和相同的设备拍摄的,则它们可能是相同的。

  • 图像是否对齐良好?

    如果没有,您可能需要先运行互相关,以首先找到最佳对齐方式。 SciPy 有函数可以做到这一点。

    如果相机和场景静止,则图像可能对齐良好。

  • 图像的曝光总是相同吗? (亮度/对比度相同吗?)

    如果没有,您可能需要标准化图像。

    但要小心,在某些情况下,这可能弊大于利。 例如,深色背景上的单个明亮像素将使标准化图像非常不同。

  • 颜色信息重要吗?

    如果您想注意到颜色变化,您将拥有每个点的颜色值向量,而不是像灰度图像中的标量值。 编写此类代码时需要多加注意。

  • 图像中是否有明显的边缘? 他们可能会搬家吗?

    如果是,您可以首先应用边缘检测算法(例如,使用 Sobel 或 Prewitt 变换计算梯度,应用一些阈值),然后将第一个图像上的边缘与第二个图像上的边缘进行比较。

  • 图像中有噪点吗?

    所有传感器都会产生一定量的噪声污染图像。 低成本传感器的噪声更大。 您可能希望在比较图像之前应用一些降噪措施。 模糊是这里最简单(但不是最好)的方法。

  • 您想注意到什么样的变化?

    这可能会影响图像之间差异所使用的标准的选择。

    考虑使用曼哈顿范数(绝对值之和)或零范数(不等于零的元素数量)来衡量图像变化的程度。 前者会告诉您图像有多少偏差,后者只会告诉您有多少像素不同。

示例

我假设您的图像对齐良好,大小和形状相同,可能具有不同的曝光度。 为简单起见,即使它们是彩色 (RGB) 图像,我也会将它们转换为灰度。

您将需要这些导入:

import sys

from scipy.misc import imread
from scipy.linalg import norm
from scipy import sum, average

主函数,读取两个图像,转换为灰度,比较并打印结果:

def main():
    file1, file2 = sys.argv[1:1+2]
    # read images as 2D arrays (convert to grayscale for simplicity)
    img1 = to_grayscale(imread(file1).astype(float))
    img2 = to_grayscale(imread(file2).astype(float))
    # compare
    n_m, n_0 = compare_images(img1, img2)
    print "Manhattan norm:", n_m, "/ per pixel:", n_m/img1.size
    print "Zero norm:", n_0, "/ per pixel:", n_0*1.0/img1.size

如何比较。 img1img2 在这里是 2D SciPy 数组:

def compare_images(img1, img2):
    # normalize to compensate for exposure difference, this may be unnecessary
    # consider disabling it
    img1 = normalize(img1)
    img2 = normalize(img2)
    # calculate the difference and its norms
    diff = img1 - img2  # elementwise for scipy arrays
    m_norm = sum(abs(diff))  # Manhattan norm
    z_norm = norm(diff.ravel(), 0)  # Zero norm
    return (m_norm, z_norm)

如果文件是彩色图像,imread 返回一个 3D 数组,平均 RGB 通道(最后一个)阵列轴)以获得强度。 不需要对灰度图像执行此操作(例如 .pgm):

def to_grayscale(arr):
    "If arr is a color image (3D array), convert it to grayscale (2D array)."
    if len(arr.shape) == 3:
        return average(arr, -1)  # average over the last axis (color channels)
    else:
        return arr

标准化很简单,您可以选择标准化为 [0,1] 而不是 [0,255]。 arr 在这里是一个 SciPy 数组,因此所有操作都是按元素进行的:

def normalize(arr):
    rng = arr.max()-arr.min()
    amin = arr.min()
    return (arr-amin)*255/rng

运行 main 函数:

if __name__ == "__main__":
    main()

现在您可以将这一切放入脚本中并针对两个图像运行。 如果我们将图像与其本身进行比较,则没有区别:

$ python compare.py one.jpg one.jpg
Manhattan norm: 0.0 / per pixel: 0.0
Zero norm: 0 / per pixel: 0.0

如果我们模糊图像并与原始图像进行比较,则存在一些差异:

$ python compare.py one.jpg one-blurred.jpg 
Manhattan norm: 92605183.67 / per pixel: 13.4210411116
Zero norm: 6900000 / per pixel: 1.0

PS Entire compare.py 脚本。

更新:相关技术

由于问题是关于视频序列,其中帧可能几乎相同,并且您寻找不寻常的东西,我想提一些可能相关的替代方法:

  • 背景减法和分割(以检测前景物体)
  • 稀疏光流(检测运动)
  • 比较直方图或其他一些统计数据而不是图像

我强烈建议看一下“学习 OpenCV”一书,第 9 章(图像部分和分割)和第 10 章(跟踪和运动)。 前者教导使用背景扣除方法,后者给出一些有关光流方法的信息。 所有方法均在 OpenCV 库中实现。 如果你使用Python,我建议使用OpenCV ≥ 2.3,及其cv2 Python模块。

背景减法最简单的版本:

  • 学习背景每个像素的平均值 μ 和标准差 σ
  • 将当前像素值与 (μ-2σ,μ+2σ) 或 (μ-σ,μ+σ) 范围进行比较)

更高级的版本考虑每个像素的时间序列并处理非静态场景(例如移动的树木或草地)。

光流的思想是获取两个或更多帧,并将速度矢量分配给每个像素(密集光流)或其中一些像素(稀疏光流)。 要估计稀疏光流,您可以使用 Lucas-Kanade 方法(它也在 OpenCV 中实现)。 显然,如果存在大量流动(速度场的平均值高于速度场的最大值),则说明帧中存在某些物体正在移动,并且后续图像会更加不同。

比较直方图可能有助于检测连续帧之间的突然变化。 Courbon 等人,2010 使用了这种方法:

连续帧的相似度。测量两个连续帧之间的距离。 如果太高,则意味着第二帧已损坏,从而图像被消除。 直方图上的 Kullback–Leibler 距离,或互熵,在两帧:

$$ d(p,q) = \sum_i p(i) \log (p(i)/q( i)) $$

其中pq是使用帧的直方图。 阈值固定为0.2。

General idea

Option 1: Load both images as arrays (scipy.misc.imread) and calculate an element-wise (pixel-by-pixel) difference. Calculate the norm of the difference.

Option 2: Load both images. Calculate some feature vector for each of them (like a histogram). Calculate distance between feature vectors rather than images.

However, there are some decisions to make first.

Questions

You should answer these questions first:

  • Are images of the same shape and dimension?

    If not, you may need to resize or crop them. PIL library will help to do it in Python.

    If they are taken with the same settings and the same device, they are probably the same.

  • Are images well-aligned?

    If not, you may want to run cross-correlation first, to find the best alignment first. SciPy has functions to do it.

    If the camera and the scene are still, the images are likely to be well-aligned.

  • Is exposure of the images always the same? (Is lightness/contrast the same?)

    If not, you may want to normalize images.

    But be careful, in some situations this may do more wrong than good. For example, a single bright pixel on a dark background will make the normalized image very different.

  • Is color information important?

    If you want to notice color changes, you will have a vector of color values per point, rather than a scalar value as in gray-scale image. You need more attention when writing such code.

  • Are there distinct edges in the image? Are they likely to move?

    If yes, you can apply edge detection algorithm first (e.g. calculate gradient with Sobel or Prewitt transform, apply some threshold), then compare edges on the first image to edges on the second.

  • Is there noise in the image?

    All sensors pollute the image with some amount of noise. Low-cost sensors have more noise. You may wish to apply some noise reduction before you compare images. Blur is the most simple (but not the best) approach here.

  • What kind of changes do you want to notice?

    This may affect the choice of norm to use for the difference between images.

    Consider using Manhattan norm (the sum of the absolute values) or zero norm (the number of elements not equal to zero) to measure how much the image has changed. The former will tell you how much the image is off, the latter will tell only how many pixels differ.

Example

I assume your images are well-aligned, the same size and shape, possibly with different exposure. For simplicity, I convert them to grayscale even if they are color (RGB) images.

You will need these imports:

import sys

from scipy.misc import imread
from scipy.linalg import norm
from scipy import sum, average

Main function, read two images, convert to grayscale, compare and print results:

def main():
    file1, file2 = sys.argv[1:1+2]
    # read images as 2D arrays (convert to grayscale for simplicity)
    img1 = to_grayscale(imread(file1).astype(float))
    img2 = to_grayscale(imread(file2).astype(float))
    # compare
    n_m, n_0 = compare_images(img1, img2)
    print "Manhattan norm:", n_m, "/ per pixel:", n_m/img1.size
    print "Zero norm:", n_0, "/ per pixel:", n_0*1.0/img1.size

How to compare. img1 and img2 are 2D SciPy arrays here:

def compare_images(img1, img2):
    # normalize to compensate for exposure difference, this may be unnecessary
    # consider disabling it
    img1 = normalize(img1)
    img2 = normalize(img2)
    # calculate the difference and its norms
    diff = img1 - img2  # elementwise for scipy arrays
    m_norm = sum(abs(diff))  # Manhattan norm
    z_norm = norm(diff.ravel(), 0)  # Zero norm
    return (m_norm, z_norm)

If the file is a color image, imread returns a 3D array, average RGB channels (the last array axis) to obtain intensity. No need to do it for grayscale images (e.g. .pgm):

def to_grayscale(arr):
    "If arr is a color image (3D array), convert it to grayscale (2D array)."
    if len(arr.shape) == 3:
        return average(arr, -1)  # average over the last axis (color channels)
    else:
        return arr

Normalization is trivial, you may choose to normalize to [0,1] instead of [0,255]. arr is a SciPy array here, so all operations are element-wise:

def normalize(arr):
    rng = arr.max()-arr.min()
    amin = arr.min()
    return (arr-amin)*255/rng

Run the main function:

if __name__ == "__main__":
    main()

Now you can put this all in a script and run against two images. If we compare image to itself, there is no difference:

$ python compare.py one.jpg one.jpg
Manhattan norm: 0.0 / per pixel: 0.0
Zero norm: 0 / per pixel: 0.0

If we blur the image and compare to the original, there is some difference:

$ python compare.py one.jpg one-blurred.jpg 
Manhattan norm: 92605183.67 / per pixel: 13.4210411116
Zero norm: 6900000 / per pixel: 1.0

P.S. Entire compare.py script.

Update: relevant techniques

As the question is about a video sequence, where frames are likely to be almost the same, and you look for something unusual, I'd like to mention some alternative approaches which may be relevant:

  • background subtraction and segmentation (to detect foreground objects)
  • sparse optical flow (to detect motion)
  • comparing histograms or some other statistics instead of images

I strongly recommend taking a look at “Learning OpenCV” book, Chapters 9 (Image parts and segmentation) and 10 (Tracking and motion). The former teaches to use Background subtraction method, the latter gives some info on optical flow methods. All methods are implemented in OpenCV library. If you use Python, I suggest to use OpenCV ≥ 2.3, and its cv2 Python module.

The most simple version of the background subtraction:

  • learn the average value μ and standard deviation σ for every pixel of the background
  • compare current pixel values to the range of (μ-2σ,μ+2σ) or (μ-σ,μ+σ)

More advanced versions make take into account time series for every pixel and handle non-static scenes (like moving trees or grass).

The idea of optical flow is to take two or more frames, and assign velocity vector to every pixel (dense optical flow) or to some of them (sparse optical flow). To estimate sparse optical flow, you may use Lucas-Kanade method (it is also implemented in OpenCV). Obviously, if there is a lot of flow (high average over max values of the velocity field), then something is moving in the frame, and subsequent images are more different.

Comparing histograms may help to detect sudden changes between consecutive frames. This approach was used in Courbon et al, 2010:

Similarity of consecutive frames. The distance between two consecutive frames is measured. If it is too high, it means that the second frame is corrupted and thus the image is eliminated. The Kullback–Leibler distance, or mutual entropy, on the histograms of the two frames:

$$ d(p,q) = \sum_i p(i) \log (p(i)/q(i)) $$

where p and q are the histograms of the frames is used. The threshold is fixed on 0.2.

眼中杀气 2024-07-14 05:33:40

简单的解决方案:

将图像编码为jpeg,并查找文件大小的重大变化。

我已经实现了与视频缩略图类似的功能,并取得了很大的成功和可扩展性。

A simple solution:

Encode the image as a jpeg and look for a substantial change in filesize.

I've implemented something similar with video thumbnails, and had a lot of success and scalability.

不气馁 2024-07-14 05:33:40

您可以使用 PIL 中的函数来比较两个图像。

import Image
import ImageChops

im1 = Image.open("splash.png")
im2 = Image.open("splash2.png")

diff = ImageChops.difference(im2, im1)

diff 对象是一个图像,其中每个像素都是从第一图像中减去第二图像中该像素的颜色值的结果。 使用差异图像您可以做几件事。 最简单的一个是 diff.getbbox() 函数。 它会告诉您包含两个图像之间所有变化的最小矩形。

您也可以使用 PIL 中的函数来实现此处提到的其他内容的近似值。

You can compare two images using functions from PIL.

import Image
import ImageChops

im1 = Image.open("splash.png")
im2 = Image.open("splash2.png")

diff = ImageChops.difference(im2, im1)

The diff object is an image in which every pixel is the result of the subtraction of the color values of that pixel in the second image from the first image. Using the diff image you can do several things. The simplest one is the diff.getbbox() function. It will tell you the minimal rectangle that contains all the changes between your two images.

You can probably implement approximations of the other stuff mentioned here using functions from PIL as well.

夜光 2024-07-14 05:33:40

两种流行且相对简单的方法是:(a)已经建议的欧几里德距离,或(b)归一化互相关。 归一化互相关往往比简单互相关对光照变化更稳健。 维基百科给出了归一化互相关的公式。 也存在更复杂的方法,但它们需要更多的工作。

使用类似 numpy 的语法,

dist_euclidean = sqrt(sum((i1 - i2)^2)) / i1.size

dist_manhattan = sum(abs(i1 - i2)) / i1.size

dist_ncc = sum( (i1 - mean(i1)) * (i2 - mean(i2)) ) / (
  (i1.size - 1) * stdev(i1) * stdev(i2) )

假设 i1i2 是 2D 灰度图像数组。

Two popular and relatively simple methods are: (a) the Euclidean distance already suggested, or (b) normalized cross-correlation. Normalized cross-correlation tends to be noticeably more robust to lighting changes than simple cross-correlation. Wikipedia gives a formula for the normalized cross-correlation. More sophisticated methods exist too, but they require quite a bit more work.

Using numpy-like syntax,

dist_euclidean = sqrt(sum((i1 - i2)^2)) / i1.size

dist_manhattan = sum(abs(i1 - i2)) / i1.size

dist_ncc = sum( (i1 - mean(i1)) * (i2 - mean(i2)) ) / (
  (i1.size - 1) * stdev(i1) * stdev(i2) )

assuming that i1 and i2 are 2D grayscale image arrays.

极度宠爱 2024-07-14 05:33:40

尝试一个简单的事情:

将两个图像重新采样为小缩略图(例如 64 x 64),并将缩略图逐像素与特定阈值进行比较。 如果原始图像几乎相同,则重新采样的缩略图将非常相似甚至完全相同。 此方法可以处理尤其是在低光场景中可能出现的噪点。 如果你使用灰度,效果可能会更好。

A trivial thing to try:

Resample both images to small thumbnails (e.g. 64 x 64) and compare the thumbnails pixel-by-pixel with a certain threshold. If the original images are almost the same, the resampled thumbnails will be very similar or even exactly the same. This method takes care of noise that can occur especially in low-light scenes. It may even be better if you go grayscale.

凶凌 2024-07-14 05:33:40

另一种很好的、​​简单的方法来测量两个图像之间的相似性:

import sys
from skimage.measure import compare_ssim
from skimage.transform import resize
from scipy.ndimage import imread

# get two images - resize both to 1024 x 1024
img_a = resize(imread(sys.argv[1]), (2**10, 2**10))
img_b = resize(imread(sys.argv[2]), (2**10, 2**10))

# score: {-1:1} measure of the structural similarity between the images
score, diff = compare_ssim(img_a, img_b, full=True)
print(score)

如果其他人对比较图像相似性的更强大的方法感兴趣,我会整理一个 教程 和网页 应用 使用 Tensorflow 测量和可视化类似图像。

Another nice, simple way to measure the similarity between two images:

import sys
from skimage.measure import compare_ssim
from skimage.transform import resize
from scipy.ndimage import imread

# get two images - resize both to 1024 x 1024
img_a = resize(imread(sys.argv[1]), (2**10, 2**10))
img_b = resize(imread(sys.argv[2]), (2**10, 2**10))

# score: {-1:1} measure of the structural similarity between the images
score, diff = compare_ssim(img_a, img_b, full=True)
print(score)

If others are interested in a more powerful way to compare image similarity, I put together a tutorial and web app for measuring and visualizing similar images using Tensorflow.

萌辣 2024-07-14 05:33:40

我在工作中遇到了类似的问题,我正在重写我们的图像转换端点,我想检查新版本是否产生与旧版本相同或几乎相同的输出。 所以我写了这个:

https://github.com/nicolashahn/diffimg

它对相同的图像进行操作大小,并在每个像素级别测量每个通道的值差异:R、G、B(、A),取这些通道的平均差异,然后对所有像素的差异进行平均,并返回比率。

例如,对于白色像素的 10x10 图像,同一图像但一个像素更改为红色,则该像素处的差异为 1/3 或 0.33...(RGB 0,0,0 与 255,0,0 ),所有其他像素为 0。总共 100 个像素,0.33.../100 = 图像中约 0.33% 的差异。

我相信这对于 OP 的项目来说非常适合(我意识到这是一篇非常旧的帖子,但为将来也想在 python 中比较图像的 StackOverflowers 发布)。

I had a similar problem at work, I was rewriting our image transform endpoint and I wanted to check that the new version was producing the same or nearly the same output as the old version. So I wrote this:

https://github.com/nicolashahn/diffimg

Which operates on images of the same size, and at a per-pixel level, measures the difference in values at each channel: R, G, B(, A), takes the average difference of those channels, and then averages the difference over all pixels, and returns a ratio.

For example, with a 10x10 image of white pixels, and the same image but one pixel has changed to red, the difference at that pixel is 1/3 or 0.33... (RGB 0,0,0 vs 255,0,0) and at all other pixels is 0. With 100 pixels total, 0.33.../100 = a ~0.33% difference in image.

I believe this would work perfectly for OP's project (I realize this is a very old post now, but posting for future StackOverflowers who also want to compare images in python).

水水月牙 2024-07-14 05:33:40

我正在具体解决如何计算它们是否“足够不同”的问题。 我想你能弄清楚如何一一减去像素。

首先,我会拍摄一堆没有任何变化的图像,并找出由于捕获的变化、成像系统中的噪声、JPEG 压缩伪影和时刻而导致任何像素变化的最大量。 - 光照的即时变化。 也许您会发现,即使没有任何变化,也会出现 1 或 2 位差异。

然后,对于“真实”测试,您需要这样的标准:

  • 如果最多 P 个像素的差异不超过 E,则相同。

因此,也许,如果 E = 0.02,P = 1000,这意味着(大约)它将如果任何单个像素变化超过 ~5 个单位(假设 8 位图像),或者超过 1000 个像素有任何错误,则为“不同”。

这主要是作为一种良好的“分类”技术,以快速识别足够接近而无需进一步检查的图像。 “失败”的图像可能更多地是一种更复杂/更昂贵的技术,例如,如果相机稍微晃动,或者对照明变化更稳健,则不会出现误报。

我运行一个开源项目 OpenImageIO,其中包含一个名为“idiff”的实用程序,该实用程序将差异与阈值进行比较,如下所示(实际上,甚至更详细)。 即使您不想使用该软件,您也可能需要查看源代码以了解我们是如何做到的。 它在商业上得到了相当多的使用,并且开发了这种阈值技术,以便我们可以拥有一个用于渲染和图像处理软件的测试套件,其中“参考图像”可能在不同平台之间存在微小差异,或者当我们对那个算法,所以我们想要一个“容差范围内的匹配”操作。

I am addressing specifically the question of how to compute if they are "different enough". I assume you can figure out how to subtract the pixels one by one.

First, I would take a bunch of images with nothing changing, and find out the maximum amount that any pixel changes just because of variations in the capture, noise in the imaging system, JPEG compression artifacts, and moment-to-moment changes in lighting. Perhaps you'll find that 1 or 2 bit differences are to be expected even when nothing moves.

Then for the "real" test, you want a criterion like this:

  • same if up to P pixels differ by no more than E.

So, perhaps, if E = 0.02, P = 1000, that would mean (approximately) that it would be "different" if any single pixel changes by more than ~5 units (assuming 8-bit images), or if more than 1000 pixels had any errors at all.

This is intended mainly as a good "triage" technique to quickly identify images that are close enough to not need further examination. The images that "fail" may then more to a more elaborate/expensive technique that wouldn't have false positives if the camera shook bit, for example, or was more robust to lighting changes.

I run an open source project, OpenImageIO, that contains a utility called "idiff" that compares differences with thresholds like this (even more elaborate, actually). Even if you don't want to use this software, you may want to look at the source to see how we did it. It's used commercially quite a bit and this thresholding technique was developed so that we could have a test suite for rendering and image processing software, with "reference images" that might have small differences from platform-to-platform or as we made minor tweaks to tha algorithms, so we wanted a "match within tolerance" operation.

寂寞陪衬 2024-07-14 05:33:40

给出的大多数答案都不会涉及照明级别。

在进行比较之前,我首先将图像标准化为标准光照水平。

Most of the answers given won't deal with lighting levels.

I would first normalize the image to a standard light level before doing the comparison.

诺曦 2024-07-14 05:33:40

您看过查找相似图像的算法问题吗? 检查一下以查看建议。

我建议对你的帧进行小波变换(我已经使用 Haar 变换编写了一个 C 扩展); 然后,比较两张图片之间最大(按比例)小波因子的索引,您应该得到数值相似度近似值。

Have you seen the Algorithm for finding similar images question? Check it out to see suggestions.

I would suggest a wavelet transformation of your frames (I've written a C extension for that using Haar transformation); then, comparing the indexes of the largest (proportionally) wavelet factors between the two pictures, you should get a numerical similarity approximation.

如梦初醒的夏天 2024-07-14 05:33:40

我遇到了同样的问题,并编写了一个简单的 python 模块,该模块使用枕头的 ImageChops 比较两个相同大小的图像,以创建黑白差异图像并总结直方图值。

您可以直接获得此分数,也可以获得与全黑与白色差异相比的百分比值。

它还包含一个简单的 is_equal 函数,可以在图像传递为相等的情况下(并包括)提供模糊阈值。

该方法不是很复杂,但也许对其他遇到同样问题的人有用。

https://pypi.python.org/pypi/imgcompare/

I had the same problem and wrote a simple python module which compares two same-size images using pillow's ImageChops to create a black/white diff image and sums up the histogram values.

You can get either this score directly, or a percentage value compared to a full black vs. white diff.

It also contains a simple is_equal function, with the possibility to supply a fuzzy-threshold under (and including) the image passes as equal.

The approach is not very elaborate, but maybe is of use for other out there struggling with the same issue.

https://pypi.python.org/pypi/imgcompare/

眼藏柔 2024-07-14 05:33:40

如果回复太晚了,我深表歉意,但由于我一直在做类似的事情,所以我想我可以做出一些贡献。

也许使用 OpenCV,您可以使用模板匹配。 假设您按照您所说的那样使用网络摄像头:

  1. 简化图像(可能是阈值?)
  2. 应用模板匹配并使用 minMaxLoc 检查 max_val

提示:max_val(或 min_val 取决于所使用的方法)将为您提供数字,大数字。 要获得百分比差异,请使用与相同图像匹配的模板 - 结果将是 100%。

伪代码举例:

previous_screenshot = ...
current_screenshot = ...

# simplify both images somehow

# get the 100% corresponding value
res = matchTemplate(previous_screenshot, previous_screenshot, TM_CCOEFF)
_, hundred_p_val, _, _ = minMaxLoc(res)

# hundred_p_val is now the 100%

res = matchTemplate(previous_screenshot, current_screenshot, TM_CCOEFF)
_, max_val, _, _ = minMaxLoc(res)

difference_percentage = max_val / hundred_p_val

# the tolerance is now up to you

希望有帮助。

I apologize if this is too late to reply, but since I've been doing something similar I thought I could contribute somehow.

Maybe with OpenCV you could use template matching. Assuming you're using a webcam as you said:

  1. Simplify the images (thresholding maybe?)
  2. Apply template matching and check the max_val with minMaxLoc

Tip: max_val (or min_val depending on the method used) will give you numbers, large numbers. To get the difference in percentage, use template matching with the same image -- the result will be your 100%.

Pseudo code to exemplify:

previous_screenshot = ...
current_screenshot = ...

# simplify both images somehow

# get the 100% corresponding value
res = matchTemplate(previous_screenshot, previous_screenshot, TM_CCOEFF)
_, hundred_p_val, _, _ = minMaxLoc(res)

# hundred_p_val is now the 100%

res = matchTemplate(previous_screenshot, current_screenshot, TM_CCOEFF)
_, max_val, _, _ = minMaxLoc(res)

difference_percentage = max_val / hundred_p_val

# the tolerance is now up to you

Hope it helps.

素染倾城色 2024-07-14 05:33:40

您可以计算两个图像的直方图,然后计算 Bhattacharyya 系数,这是一个非常快的算法,我用它来检测板球视频中的镜头变化(在 C 中使用 openCV)

you can compute the histogram of both the images and then calculate the Bhattacharyya Coefficient, this is a very fast algorithm and I have used it to detect shot changes in a cricket video (in C using openCV)

如梦 2024-07-14 05:33:40
import os
from PIL import Image
from PIL import ImageFile
import imagehash
  
#just use to the size diferent picture
def compare_image(img_file1, img_file2):
    if img_file1 == img_file2:
        return True
    fp1 = open(img_file1, 'rb')
    fp2 = open(img_file2, 'rb')

    img1 = Image.open(fp1)
    img2 = Image.open(fp2)

    ImageFile.LOAD_TRUNCATED_IMAGES = True
    b = img1 == img2

    fp1.close()
    fp2.close()

    return b





#through picturu hash to compare
def get_hash_dict(dir):
    hash_dict = {}
    image_quantity = 0
    for _, _, files in os.walk(dir):
        for i, fileName in enumerate(files):
            with open(dir + fileName, 'rb') as fp:
                hash_dict[dir + fileName] = imagehash.average_hash(Image.open(fp))
                image_quantity += 1

    return hash_dict, image_quantity

def compare_image_with_hash(image_file_name_1, image_file_name_2, max_dif=0):
    """
    max_dif: The maximum hash difference is allowed, the smaller and more accurate, the minimum is 0.
    recommend to use
    """
    ImageFile.LOAD_TRUNCATED_IMAGES = True
    hash_1 = None
    hash_2 = None
    with open(image_file_name_1, 'rb') as fp:
        hash_1 = imagehash.average_hash(Image.open(fp))
    with open(image_file_name_2, 'rb') as fp:
        hash_2 = imagehash.average_hash(Image.open(fp))
    dif = hash_1 - hash_2
    if dif < 0:
        dif = -dif
    if dif <= max_dif:
        return True
    else:
        return False


def compare_image_dir_with_hash(dir_1, dir_2, max_dif=0):
    """
    max_dif: The maximum hash difference is allowed, the smaller and more accurate, the minimum is 0.

    """
    ImageFile.LOAD_TRUNCATED_IMAGES = True
    hash_dict_1, image_quantity_1 = get_hash_dict(dir_1)
    hash_dict_2, image_quantity_2 = get_hash_dict(dir_2)

    if image_quantity_1 > image_quantity_2:
        tmp = image_quantity_1
        image_quantity_1 = image_quantity_2
        image_quantity_2 = tmp

        tmp = hash_dict_1
        hash_dict_1 = hash_dict_2
        hash_dict_2 = tmp

    result_dict = {}

    for k in hash_dict_1.keys():
        result_dict[k] = None

    for dif_i in range(0, max_dif + 1):
        have_none = False

        for k_1 in result_dict.keys():
            if result_dict.get(k_1) is None:
                have_none = True

        if not have_none:
            return result_dict

        for k_1, v_1 in hash_dict_1.items():
            for k_2, v_2 in hash_dict_2.items():
                sub = (v_1 - v_2)
                if sub < 0:
                    sub = -sub
                if sub == dif_i and result_dict.get(k_1) is None:
                    result_dict[k_1] = k_2
                    break
    return result_dict


def main():
    print(compare_image('image1\\815.jpg', 'image2\\5.jpg'))
    print(compare_image_with_hash('image1\\815.jpg', 'image2\\5.jpg', 7))
    r = compare_image_dir_with_hash('image1\\', 'image2\\', 10)
    for k in r.keys():
        print(k, r.get(k))


if __name__ == '__main__':
    main()
  • 输出:

    错误
    真实
    图片2\5.jpg 图片1\815.jpg
    图片2\6.jpg 图片1\819.jpg
    图片2\7.jpg 图片1\900.jpg
    图片2\8.jpg 图片1\998.jpg
    图片2\9.jpg 图片1\1012.jpg

  • 示例图片:

    • 815.jpg

      815。 jpg

    • 5.jpg

      5. jpg

import os
from PIL import Image
from PIL import ImageFile
import imagehash
  
#just use to the size diferent picture
def compare_image(img_file1, img_file2):
    if img_file1 == img_file2:
        return True
    fp1 = open(img_file1, 'rb')
    fp2 = open(img_file2, 'rb')

    img1 = Image.open(fp1)
    img2 = Image.open(fp2)

    ImageFile.LOAD_TRUNCATED_IMAGES = True
    b = img1 == img2

    fp1.close()
    fp2.close()

    return b





#through picturu hash to compare
def get_hash_dict(dir):
    hash_dict = {}
    image_quantity = 0
    for _, _, files in os.walk(dir):
        for i, fileName in enumerate(files):
            with open(dir + fileName, 'rb') as fp:
                hash_dict[dir + fileName] = imagehash.average_hash(Image.open(fp))
                image_quantity += 1

    return hash_dict, image_quantity

def compare_image_with_hash(image_file_name_1, image_file_name_2, max_dif=0):
    """
    max_dif: The maximum hash difference is allowed, the smaller and more accurate, the minimum is 0.
    recommend to use
    """
    ImageFile.LOAD_TRUNCATED_IMAGES = True
    hash_1 = None
    hash_2 = None
    with open(image_file_name_1, 'rb') as fp:
        hash_1 = imagehash.average_hash(Image.open(fp))
    with open(image_file_name_2, 'rb') as fp:
        hash_2 = imagehash.average_hash(Image.open(fp))
    dif = hash_1 - hash_2
    if dif < 0:
        dif = -dif
    if dif <= max_dif:
        return True
    else:
        return False


def compare_image_dir_with_hash(dir_1, dir_2, max_dif=0):
    """
    max_dif: The maximum hash difference is allowed, the smaller and more accurate, the minimum is 0.

    """
    ImageFile.LOAD_TRUNCATED_IMAGES = True
    hash_dict_1, image_quantity_1 = get_hash_dict(dir_1)
    hash_dict_2, image_quantity_2 = get_hash_dict(dir_2)

    if image_quantity_1 > image_quantity_2:
        tmp = image_quantity_1
        image_quantity_1 = image_quantity_2
        image_quantity_2 = tmp

        tmp = hash_dict_1
        hash_dict_1 = hash_dict_2
        hash_dict_2 = tmp

    result_dict = {}

    for k in hash_dict_1.keys():
        result_dict[k] = None

    for dif_i in range(0, max_dif + 1):
        have_none = False

        for k_1 in result_dict.keys():
            if result_dict.get(k_1) is None:
                have_none = True

        if not have_none:
            return result_dict

        for k_1, v_1 in hash_dict_1.items():
            for k_2, v_2 in hash_dict_2.items():
                sub = (v_1 - v_2)
                if sub < 0:
                    sub = -sub
                if sub == dif_i and result_dict.get(k_1) is None:
                    result_dict[k_1] = k_2
                    break
    return result_dict


def main():
    print(compare_image('image1\\815.jpg', 'image2\\5.jpg'))
    print(compare_image_with_hash('image1\\815.jpg', 'image2\\5.jpg', 7))
    r = compare_image_dir_with_hash('image1\\', 'image2\\', 10)
    for k in r.keys():
        print(k, r.get(k))


if __name__ == '__main__':
    main()
  • output:

    False
    True
    image2\5.jpg image1\815.jpg
    image2\6.jpg image1\819.jpg
    image2\7.jpg image1\900.jpg
    image2\8.jpg image1\998.jpg
    image2\9.jpg image1\1012.jpg

  • the example pictures:

    • 815.jpg

      815.jpg

    • 5.jpg

      5.jpg

往事风中埋 2024-07-14 05:33:40

地球移动距离 可能正是您所需要的。
不过,实时实施可能会有点繁重。

Earth movers distance might be exactly what you need.
It might be abit heavy to implement in real time though.

你在我安 2024-07-14 05:33:40

计算两个图像的曼哈顿距离怎么样? 这给你 n*n 个值。 然后,您可以执行诸如行平均值之类的操作来减少到 n 个值,并对其进行函数以获得一个值。

What about calculating the Manhattan Distance of the two images. That gives you n*n values. Then you could do something like an row average to reduce to n values and a function over that to get one single value.

玩心态 2024-07-14 05:33:40

我认为您可以简单地计算两个图像的亮度之间的欧几里德距离(即 sqrt(差异平方和,逐像素)),并在低于某个经验阈值时认为它们相等。 你最好将它封装成一个 C 函数。

I think you could simply compute the euclidean distance (i.e. sqrt(sum of squares of differences, pixel by pixel)) between the luminance of the two images, and consider them equal if this falls under some empirical threshold. And you would better do it wrapping a C function.

韶华倾负 2024-07-14 05:33:40

我很幸运地使用三脚架上的同一台相机拍摄了 jpg 图像
(1) 大幅简化(如从 3000 像素宽变为 100 像素宽甚至更少)
(2) 将每个 jpg 数组展平为单个向量
(3)用简单的相关算法对序列图像进行两两相关,得到相关系数
(4) 对相关系数进行平方以获得 r 平方(即一幅图像中的变异性分数由下一幅图像中的变异性解释)
(5) 一般在我的应用中,如果 r-square < 0.9,我说这两个图像是不同的,并且中间发生了一些事情。

这在我的实现中是稳健且快速的(Mathematica 7),

值得尝试一下您感兴趣的图像部分,并通过将所有图像裁剪到该小区域来重点关注该部分,否则远离相机但很重要将会错过改变。

我不知道如何使用Python,但我确信它也能进行关联,不是吗?

I have been having a lot of luck with jpg images taken with the same camera on a tripod by
(1) simplifying greatly (like going from 3000 pixels wide to 100 pixels wide or even fewer)
(2) flattening each jpg array into a single vector
(3) pairwise correlating sequential images with a simple correlate algorithm to get correlation coefficient
(4) squaring correlation coefficient to get r-square (i.e fraction of variability in one image explained by variation in the next)
(5) generally in my application if r-square < 0.9, I say the two images are different and something happened in between.

This is robust and fast in my implementation (Mathematica 7)

It's worth playing around with the part of the image you are interested in and focussing on that by cropping all images to that little area, otherwise a distant-from-the-camera but important change will be missed.

I don't know how to use Python, but am sure it does correlations, too, no?

空宴 2024-07-14 05:33:40

了解 isk-daemon 是如何实现 Haar 小波的。 您可以使用它的 imgdb C++ 代码来实时计算图像之间的差异:

isk-daemon 是一个开源数据库服务器,能够向任何图像相关网站或软件添加基于内容(视觉)的图像搜索。

该技术允许任何与图像相关的网站或软件的用户在小部件上绘制他们想要查找的图像,并让网站回复他们最相似的图像,或者只是在每个图像详细信息页面请求更多相似的照片。

Check out how Haar Wavelets are implemented by isk-daemon. You could use it's imgdb C++ code to calculate the difference between images on-the-fly:

isk-daemon is an open source database server capable of adding content-based (visual) image searching to any image related website or software.

This technology allows users of any image-related website or software to sketch on a widget which image they want to find and have the website reply to them the most similar images or simply request for more similar photos at each image detail page.

◇流星雨 2024-07-14 05:33:40

一种更有原则性的方法是使用全局描述符来比较图像,例如 GIST 或 CENTRIST。 哈希函数,如此处所述,提供了类似的解决方案。

A somewhat more principled approach is to use a global descriptor to compare images, such as GIST or CENTRIST. A hash function, as described here, also provides a similar solution.

泛滥成性 2024-07-14 05:33:40

有一个使用 numpy 计算均方误差的简单快速的解决方案:

before = np.array(get_picture())
while True:
    now = np.array(get_picture())
    MSE = np.mean((now - before)**2)

    if  MSE > threshold:
        break

    before = now

There's a simple and fast solution using numpy by calculating mean squared error:

before = np.array(get_picture())
while True:
    now = np.array(get_picture())
    MSE = np.mean((now - before)**2)

    if  MSE > threshold:
        break

    before = now
白鸥掠海 2024-07-14 05:33:40

这是我编写的一个函数,它接受 2 个图像(文件路径)作为参数,并返回两个图像像素分量之间的平均差异。 这对我来说非常有效,可以确定视觉上“相等”的图像(当它们不 == 相等时)。

(我发现 8 是确定图像是否本质上相同的一个很好的限制。)

(如果不添加任何预处理,图像必须具有相同的尺寸。)

from PIL import Image

def imagesDifference( imageA, imageB ):
    A = list(Image.open(imageA, r'r').convert(r'RGB').getdata())
    B = list(Image.open(imageB, r'r').convert(r'RGB').getdata())
    if (len(A) != len(B)): return -1
    diff = []
    for i in range(0, len(A)):
        diff += [abs(A[i][0] - B[i][0]), abs(A[i][1] - B[i][1]), abs(A[i][2] - B[i][2])]
    return (sum(diff) / len(diff))

Here is a function I wrote, which takes 2 images (filepaths) as arguments and returns the average difference between the two images' pixels' components. This worked pretty well for me to determine visually "equal" images (when they're not == equal).

(I found 8 to be a good limit to determine if images are essentially the same.)

(Images must have the same dimensions if you add no preprocessing to this.)

from PIL import Image

def imagesDifference( imageA, imageB ):
    A = list(Image.open(imageA, r'r').convert(r'RGB').getdata())
    B = list(Image.open(imageB, r'r').convert(r'RGB').getdata())
    if (len(A) != len(B)): return -1
    diff = []
    for i in range(0, len(A)):
        diff += [abs(A[i][0] - B[i][0]), abs(A[i][1] - B[i][1]), abs(A[i][2] - B[i][2])]
    return (sum(diff) / len(diff))
凡尘雨 2024-07-14 05:33:40

有许多指标可用于评估两个图像是否相似/相似程度如何。

我不会在这里讨论任何代码,因为我认为这应该是一个科学问题,而不是技术问题。

一般来说,这个问题与人类对图像的感知有关,因此每种算法都有其对人类视觉系统特征的支持。

经典方法有:

可见差异预测器:一种用于评估图像保真度的算法 (https://www.spiedigitallibrary.org/conference-proceedings-of-spie/1666/ 0000/Visible-differences-predictor--an-algorithm-for-the-assessment-of/10.1117/12.135952.short?SSO=1)

图像质量评估:从错误可见性到结构相似性 (http://www.cns.nyu.edu/pub/lcv/wang03-reprint。 pdf)

FSIM:图像质量评估的特征相似性指数 (https://www4.comp.polyu.edu.hk/~cslzhang/IQA/TIP_IQA_FSIM.pdf

其中,SSIM(图像质量评估:从错误可见性到结构相似性)最容易计算而且它的开销也很小,正如另一篇论文“基于梯度相似性的图像质量评估”(https://www.semanticscholar.org/paper/Image-Quality-Assessment-Based-on-Gradient-Liu-Lin/2b819bef80c02d5d4cb56f27b2 02535e119df988)。

还有更多其他方法。 如果您对艺术感兴趣/真正关心的话,请查看 Google Scholar 并搜索“视觉差异”、“图像质量评估”等内容。

There are many metrics out there for evaluating whether two images look like/how much they look like.

I will not go into any code here, because I think it should be a scientific problem, other than a technical problem.

Generally, the question is related to human's perception on images, so each algorithm has its support on human visual system traits.

Classic approaches are:

Visible differences predictor: an algorithm for the assessment of image fidelity (https://www.spiedigitallibrary.org/conference-proceedings-of-spie/1666/0000/Visible-differences-predictor--an-algorithm-for-the-assessment-of/10.1117/12.135952.short?SSO=1)

Image Quality Assessment: From Error Visibility to Structural Similarity (http://www.cns.nyu.edu/pub/lcv/wang03-reprint.pdf)

FSIM: A Feature Similarity Index for Image Quality Assessment (https://www4.comp.polyu.edu.hk/~cslzhang/IQA/TIP_IQA_FSIM.pdf)

Among them, SSIM (Image Quality Assessment: From Error Visibility to Structural Similarity ) is the easiest to calculate and its overhead is also small, as reported in another paper "Image Quality Assessment Based on Gradient Similarity" (https://www.semanticscholar.org/paper/Image-Quality-Assessment-Based-on-Gradient-Liu-Lin/2b819bef80c02d5d4cb56f27b202535e119df988).

There are many more other approaches. Take a look at Google Scholar and search for something like "visual difference", "image quality assessment", etc, if you are interested/really care about the art.

森林迷了鹿 2024-07-14 05:33:40

使用 SSIM 测量 2 个图像之间的结构相似性指数测量。

Use SSIM to measure the Structural Similarity Index Measure between 2 images.

我做我的改变 2024-07-14 05:33:40

如果有人需要检查图像质量指标,请查看这个非常有用的 python 包。 sewar 项目

Check out this quite useful python package if someone may need to check image quality metric. project sewar

~没有更多了~
我们使用 Cookies 和其他技术来定制您的体验包括您的登录状态等。通过阅读我们的 隐私政策 了解更多相关信息。 单击 接受 或继续使用网站,即表示您同意使用 Cookies 和您的相关数据。
原文