当前位置：文江博客话题详情

OPENCV同型术后删除所有黑色

发布于 2025-02-06 20:41:08 字数 389 浏览 3 评论 0原文

我正在使用OPENCV同件将图像缝合在一起。如何删除图像之间的负空间，如下所示？

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

断桥再见 2025-02-13 20:41:09

如果您愿意使用非OPENCV解决方案，则可以使用ImageMagick 7 -trim（或使用ImageMagick的Python Wand）。您可以使用subprocess.call

输入：

magick stitched_panorama.png -background black -define trim:percent-background=0% -fuzz 5% -trim +repage stitched_panorama_trimmed.png

If you are willing to use a non-OpenCV solution, you can use Imagemagick 7 -trim (or Python Wand, which uses Imagemagick). You can call the Imagemagick command line below from Python using a subprocess.call

Input:

magick stitched_panorama.png -background black -define trim:percent-background=0% -fuzz 5% -trim +repage stitched_panorama_trimmed.png

回复收藏 0 原文

晒暮凉 2025-02-13 20:41:09

使用python wand函数，我们可以进行修剪到0：

from wand.image import Image
from wand.color import Color
import matplotlib.pylab as plt

stitched = plt.imread('images/AGSbD.png')
stitched2 = stitched.copy()
stitched2 = Image.from_array(stitched2)
stitched2.trim(color=Color('rgb(0,0,0)'), percent_background=0.0, fuzz=0)
stitched2 = np.array(stitched2)

plt.figure(figsize=(15,5))
plt.imshow(stitched), plt.axis('off'), plt.title('stitched', size=20)
plt.show()
plt.figure(figsize=(15,5))
plt.imshow(stitched2), plt.axis('off'), plt.title('stitched & trimmed', size=20)
plt.show()

Using python wand functions, we can do the trimming to0:

from wand.image import Image
from wand.color import Color
import matplotlib.pylab as plt

stitched = plt.imread('images/AGSbD.png')
stitched2 = stitched.copy()
stitched2 = Image.from_array(stitched2)
stitched2.trim(color=Color('rgb(0,0,0)'), percent_background=0.0, fuzz=0)
stitched2 = np.array(stitched2)

plt.figure(figsize=(15,5))
plt.imshow(stitched), plt.axis('off'), plt.title('stitched', size=20)
plt.show()
plt.figure(figsize=(15,5))
plt.imshow(stitched2), plt.axis('off'), plt.title('stitched & trimmed', size=20)
plt.show()

回复收藏 0 原文

别再吹冷风 2025-02-13 20:41:08

建议使用OPENCV的建议：

假设（x0，y0），（x1，y1）是该区域的左上和底部坐标（目标矩形，没有黑色边缘）。

将黑色边缘分成4个（或更少）的轮廓：

“顶轮廓” - 上部黑色区域。
“顶轮廓”定义y0坐标。
“底部轮廓” - 底部黑色区域。
“顶轮廓”定义y1坐标。
“右轮廓” - 右黑区域。
“右轮廓”定义x1坐标。
“左轮廓” - 上图中不存在（所以我们只有3个轮廓）。
“左轮廓”（如果存在）定义x0坐标。

这是一个例证：

在边缘构建一个二进制的“蒙版”，在“图像部分”处构建零：

我们想构建以下二进制图像：

您发布的示例图像使其有些挑战。
将黑色边缘与图像分开。
有些人工制品看起来像JPEG压缩工件，使过程更加困难。

建议使其更简单：
而不是以BGR像素格式缝制图像，而是以Bgra像素格式缝制图像。
将（原始图像集）的像素从BGR转换为BGRA，其中“ A”是一个alpha（透明）通道，填充了255（完全不透明）。
缝合图像后，输出的alpha通道应为255，其中像素是图像的一部分，在“黑色”边缘为零。
输出的Alpha通道为我们提供了所需的二进制掩码“免费”。

对于上述示例图像，我们可以使用以下阶段：

从BGR转换为灰度并应用阈值。
通过试用错误，手动将阈值设置为10（没有伪像，它应该为1）。
应用开放的形态操作以去除剩余的伪影。
找到轮廓并用白色（255）颜色填充（填充“孔” - “活动区域”以下的“活动区域”内的像素）。

裁剪矩形与最低面积：
要将背景分为4个（或更少）轮廓，我们必须确保“黑色区域”很好地分开（未连接）。
为此，我们可能会在最低区域裁剪矩形：

查找轮廓（应该只有一个）。
找到最小面积的矩形。
裁剪最小面积的矩形。

查找和分析4（或更少）轮廓：

获取逆阈值：Inv_thresh = 255-阈值。
再次查找轮廓...
现在，我们最多有4个轮廓（在我们的情况下为3个） - “顶部”，“底部”和“右”。
迭代轮廓，并识别左，紧身，顶部和底部：
对于每个轮廓，找到边界矩形（x，y，w，h）= cv2.boundingRect（c）。
如果x == 0和（h＆gt; w）我们找到了“左轮廓”。
如果x + w == image_witdh和（h＆gt; w）我们找到了“正确的轮廓”。
如果y == 0和（w＆gt; h）我们找到了“顶级轮廓”。
如果y + h == image_height和（w＆gt; h）我们找到了“底部轮廓”。
注意：当上述启发式方法失败时，可能会有很少的情况 - 对于县解决方案，我们可能需要将每个轮廓与所有其他轮廓进行比较（一种排序）。

代码样本：
以下代码示例还将“左”，“右”，“ top”，“底部”写为草图图像上的文本。

import numpy as np
import cv2

img = cv2.imread('stitched.png')  # Read input image

# Build a mask with zeros where pixels are black borders, and 255 otherwise
# Use threshold=10 instead of 1, and use opening operation due to JPEG compression artifacts.
thresh = cv2.threshold(cv2.cvtColor(img, cv2.COLOR_BGR2GRAY), 10, 255, cv2.THRESH_BINARY)[1]
thresh = cv2.morphologyEx(thresh, cv2.MORPH_OPEN, np.ones((3, 3), np.uint8))

# Find contours (there should be only one).
cnts = cv2.findContours(thresh, cv2.RETR_EXTERNAL, cv2.CHAIN_APPROX_NONE)[-2] # Use index -2 for OpenCV 3 and 4 compatibility.
c = cnts[0]  # Get the first contour (get the single contour).
cv2.drawContours(thresh, [c], 0, 255, -1)  # Fill the contour with 255 (fill dark pixels under threshold inside the contour).

# Find rectangle with minimum area
rect = cv2.minAreaRect(c)
box = np.int0(cv2.boxPoints(rect)) # Find 4 corners all convert from floating point values to int

# Crop the rectangle with minimum area (crop both img and thresh)
(topy, topx) = (np.min(box[:,1]), np.min(box[:,0]))  # https://stackoverflow.com/questions/28759253/how-to-crop-the-internal-area-of-a-contour
(boty, botx) = (np.max(box[:,1]), np.max(box[:,0]))
img = img[topy:boty+1, topx:botx+1, :]
thresh = thresh[topy:boty+1, topx:botx+1]

inv_thresh = 255 - thresh  # Inverse of thresh.

cnts = cv2.findContours(inv_thresh, cv2.RETR_EXTERNAL, cv2.CHAIN_APPROX_NONE)[-2]

inv_thresh_bgr = cv2.cvtColor(inv_thresh, cv2.COLOR_GRAY2BGR)  # inv_thresh_bgr is used for testing.

# Initialize top left and bottom right coordinates to be cropped (values are going apply area without black borders).
x0 = 0
y0 = 0
x1 = img.shape[1]
y1 = img.shape[0]

# Iterate contours:
for c in cnts:    
    (x, y, w, h) = cv2.boundingRect(c)  # Get contour bounding box.
    M = cv2.moments(c);cx = int(M["m10"] / M["m00"]);cy = int(M["m01"] / M["m00"]) # Compute the center of the contour (for testing)
     
    if (x == 0) and (h > w):
        # Enter here if "c" is the "Left contour" (for the given sample input, code should not reach here)
        x0 = w  # x0 is deffined by the left contour (x0 is the extreme right of the left contour).
        cv2.putText(inv_thresh_bgr, 'Left', (cx, cy), cv2.FONT_HERSHEY_DUPLEX, 1, (0, 255, 0), 2)  # Draw text for testing        
    elif ((x + w) == img.shape[1]) and (h > w):
        # Enter here if "c" is the "Right contour"
        x1 = x  # x1 is deffined by the right contour (x1 is the extreme left coordinate of the right contour)
        cv2.putText(inv_thresh_bgr, 'Right', (cx-80, cy), cv2.FONT_HERSHEY_DUPLEX, 1, (0, 255, 0), 2)  # Draw text for testing
    elif (y == 0) and (w > h):
        # Enter here if "c" is the "Top contour"
        y0 = h  # y0 is deffined by the to contour (y0 is the extreme bottom of the top contour).
        cv2.putText(inv_thresh_bgr, 'Top', (cx, cy+10), cv2.FONT_HERSHEY_DUPLEX, 1, (0, 255, 0), 2)  # Draw text for testing
    elif ((y + h) == img.shape[0]) and (w > h):
        # Enter here if "c" is the "Bottom contour"
        y1 = y  # y1 is deffined by the bottom contour (y1 is the extreme top of the bottom contour).
        cv2.putText(inv_thresh_bgr, 'Bottom', (cx, cy+10), cv2.FONT_HERSHEY_DUPLEX, 1, (0, 255, 0), 2)  # Draw text for testing

cv2.imwrite('inv_thresh_bgr.png', inv_thresh_bgr)  # Save for testing

# Crop the part without black margins.
img = img[y0:y1, x0:x1, :]

cv2.imshow('thresh', thresh)  # Show for testing
cv2.imshow('inv_thresh', inv_thresh)  # Show for testing
cv2.imshow('inv_thresh_bgr', inv_thresh_bgr)  # Show for testing
cv2.imshow('img', img)  # Show for testing
cv2.waitKey()
cv2.destroyAllWindows()

结果：

Inv_thresh_bgr（用于测试）：

输出img：

Suggested solution using OpenCV:

Assume (x0, y0), (x1, y1) are the top left and bottom right coordinates of the region to crop (target rectangle without black margins).

Split the black margins into 4 (or less) contours:

"Top contour" - upper black area.
The "Top contour" defines y0 coordinate.
"Bottom contour" - bottom black area.
The "Top contour" defines y1 coordinate.
"Right contour" - right black area.
The "Right contour" defines x1 coordinate.
"Left contour" - not exist in the above image (so we have only 3 contours).
The "Left contour" (if exists) defines x0 coordinate.

Here is an illustration:

Building a binary "mask" with zeros at the margins and white at the "image part":

We want to build the following binary image:

The sample image you have posted makes it a bit challenging.
There is no "clear cut" for separating the black margins from the image.
There are artifacts that looks like JPEG compression artifacts that makes the procedure more difficult.

Suggestion for making it simpler:
Instead of stitching images in BGR pixel format, stitch images in BGRA pixel format.
Convert the pixels (of the original set of images) from BGR to BGRA, where "A" is an alpha (transparency) channel, filled with 255 (fully opaque).
After stitching the images, the alpha channel of the output supposed to be 255 where pixels are part of an image, and zero in the "black" margins.
The alpha channel of the output gives us the desired binary mask "for free".

For the above sample image, we may use the following stages:

Convert from BGR to Grayscale and apply threshold.
The threshold was manually set to 10 by trial an error (with no artifacts it supposed to be 1).
Apply opening morphological operation for removing remaining artifacts.
Find the contour and fill it with white (255) color (fill "holes" - pixels inside the "active area" that are below the threshold).

Cropping rectangle with minimum area:
For separating the background into 4 (or less) contours, we have to make sure that the "black regions" are well split (not connected).
For that purpose we may crop the rectangle with minimum area:

Find contours (there should be only one).
Find rectangle with minimum area.
Crop the rectangle with minimum area.

Finding and analyzing the 4 (or less) contours:

Get the inverse threshold: inv_thresh = 255 - thresh.
Find contours once more...
Now we have up to 4 contours (3 in our case) - "Top", "Bottom" and "Right".
Iterate the contours, and identify the left, tight, top and bottom:
For each contour, find the bounding rectangle (x, y, w, h) = cv2.boundingRect(c).
if x == 0 and (h > w) we found the "Left Contour".
if x + w == image_witdh and (h > w) we found the "Right Contour".
if y == 0 and (w > h) we found the "Top Contour".
if y + h == image_height and (w > h) we found the "Bottom Contour".
Note: There may be rare cases when the above heuristics fails - for prefect solution we may need to compare each contour with all other contours (kind of sorting).

Code sample:
The following code sample also writes "Left", "Right", "Top", "Bottom" as text on a sketch image for testing).

import numpy as np
import cv2

img = cv2.imread('stitched.png')  # Read input image

# Build a mask with zeros where pixels are black borders, and 255 otherwise
# Use threshold=10 instead of 1, and use opening operation due to JPEG compression artifacts.
thresh = cv2.threshold(cv2.cvtColor(img, cv2.COLOR_BGR2GRAY), 10, 255, cv2.THRESH_BINARY)[1]
thresh = cv2.morphologyEx(thresh, cv2.MORPH_OPEN, np.ones((3, 3), np.uint8))

# Find contours (there should be only one).
cnts = cv2.findContours(thresh, cv2.RETR_EXTERNAL, cv2.CHAIN_APPROX_NONE)[-2] # Use index -2 for OpenCV 3 and 4 compatibility.
c = cnts[0]  # Get the first contour (get the single contour).
cv2.drawContours(thresh, [c], 0, 255, -1)  # Fill the contour with 255 (fill dark pixels under threshold inside the contour).

# Find rectangle with minimum area
rect = cv2.minAreaRect(c)
box = np.int0(cv2.boxPoints(rect)) # Find 4 corners all convert from floating point values to int

# Crop the rectangle with minimum area (crop both img and thresh)
(topy, topx) = (np.min(box[:,1]), np.min(box[:,0]))  # https://stackoverflow.com/questions/28759253/how-to-crop-the-internal-area-of-a-contour
(boty, botx) = (np.max(box[:,1]), np.max(box[:,0]))
img = img[topy:boty+1, topx:botx+1, :]
thresh = thresh[topy:boty+1, topx:botx+1]

inv_thresh = 255 - thresh  # Inverse of thresh.

cnts = cv2.findContours(inv_thresh, cv2.RETR_EXTERNAL, cv2.CHAIN_APPROX_NONE)[-2]

inv_thresh_bgr = cv2.cvtColor(inv_thresh, cv2.COLOR_GRAY2BGR)  # inv_thresh_bgr is used for testing.

# Initialize top left and bottom right coordinates to be cropped (values are going apply area without black borders).
x0 = 0
y0 = 0
x1 = img.shape[1]
y1 = img.shape[0]

# Iterate contours:
for c in cnts:    
    (x, y, w, h) = cv2.boundingRect(c)  # Get contour bounding box.
    M = cv2.moments(c);cx = int(M["m10"] / M["m00"]);cy = int(M["m01"] / M["m00"]) # Compute the center of the contour (for testing)
     
    if (x == 0) and (h > w):
        # Enter here if "c" is the "Left contour" (for the given sample input, code should not reach here)
        x0 = w  # x0 is deffined by the left contour (x0 is the extreme right of the left contour).
        cv2.putText(inv_thresh_bgr, 'Left', (cx, cy), cv2.FONT_HERSHEY_DUPLEX, 1, (0, 255, 0), 2)  # Draw text for testing        
    elif ((x + w) == img.shape[1]) and (h > w):
        # Enter here if "c" is the "Right contour"
        x1 = x  # x1 is deffined by the right contour (x1 is the extreme left coordinate of the right contour)
        cv2.putText(inv_thresh_bgr, 'Right', (cx-80, cy), cv2.FONT_HERSHEY_DUPLEX, 1, (0, 255, 0), 2)  # Draw text for testing
    elif (y == 0) and (w > h):
        # Enter here if "c" is the "Top contour"
        y0 = h  # y0 is deffined by the to contour (y0 is the extreme bottom of the top contour).
        cv2.putText(inv_thresh_bgr, 'Top', (cx, cy+10), cv2.FONT_HERSHEY_DUPLEX, 1, (0, 255, 0), 2)  # Draw text for testing
    elif ((y + h) == img.shape[0]) and (w > h):
        # Enter here if "c" is the "Bottom contour"
        y1 = y  # y1 is deffined by the bottom contour (y1 is the extreme top of the bottom contour).
        cv2.putText(inv_thresh_bgr, 'Bottom', (cx, cy+10), cv2.FONT_HERSHEY_DUPLEX, 1, (0, 255, 0), 2)  # Draw text for testing

cv2.imwrite('inv_thresh_bgr.png', inv_thresh_bgr)  # Save for testing

# Crop the part without black margins.
img = img[y0:y1, x0:x1, :]

cv2.imshow('thresh', thresh)  # Show for testing
cv2.imshow('inv_thresh', inv_thresh)  # Show for testing
cv2.imshow('inv_thresh_bgr', inv_thresh_bgr)  # Show for testing
cv2.imshow('img', img)  # Show for testing
cv2.waitKey()
cv2.destroyAllWindows()

Results:

inv_thresh_bgr (used for testing):