将图像分割成更小的图像的算法，减少空白量并指定最大矩形量

发布于 2024-11-07 04:52:06 字数 926 浏览 3 评论 0原文

我正在寻找一种算法，可以将图像分割成更小的图像，但有一些限制。一个限制是使用最少量的“空白”，即空像素。另一个是指定将其分割成的最大图像数量。

例如，让我们看一下下图。其中有很多“空白”。我想将此图像分成几个其他图像，这样我就可以减少该图像占用的内存量，并减少该图像所需的“绘制”量。

.=transparent pixel
x=colored pixel

....................
.xxxxxxxxxxx........
...xxxx...xxxxxx....
.............xxxxx..
...............xxx..
...............xxx..
....................
..xxxxxx............
.....xxxxxxxxxxx....
.........xxxxxxxxxx.
....................

假设我希望将图像最多分割为 4 个图像，可能的解决方案如下所示。

....................
.111111111111111....
.111111111111111....
.............22222..
.............22222.
.............22222..
....................
..3333333...........
..33333334444444444.
.........4444444444.
....................

有谁有这方面的算法，或者知道执行此操作的算法的名称？我已经寻找了一段时间并找到了一些相关的算法，但是我发现的算法没有考虑空白，例如它们将图像分割成仅覆盖非透明像素的矩形，从而产生大量的矩形。我正在处理的现实生活数据是 1024*1024 像素的图像，我更愿意将它们减少到最多 16 个部分。诀窍是使用最少的空白提取 16 张图像。

原文

I am looking for an algorithm which can split an image into smaller images, with some constraints.
One constraint is to use the least amount of "whitespace" meaning empty pixels. And the other is to specify a maximum amount of images to split it into.

For example lets look at the below image. There is a lot of "whitespace" in it. I would like to divide this images into a few other images so i can reduce the amount of memory this image occupies, and also to reduce the amount of "drawing" this image will take.

.=transparent pixel
x=colored pixel

....................
.xxxxxxxxxxx........
...xxxx...xxxxxx....
.............xxxxx..
...............xxx..
...............xxx..
....................
..xxxxxx............
.....xxxxxxxxxxx....
.........xxxxxxxxxx.
....................

Let's assume i want the image to be split into a maximum of 4 images, a possible sollution would be as drawn below.

....................
.111111111111111....
.111111111111111....
.............22222..
.............22222.
.............22222..
....................
..3333333...........
..33333334444444444.
.........4444444444.
....................

Does anyone have an algorithm for this, or knows the name of an algorithm which does this?
I have been looking for a while and found some related algorithms, but the algorithms i found don't account for the whitespace e.g. they split the image into rectangles covering only the non transparent pixels, resulting in a huge amount of rectangles.
The real life data i am working with are images of 1024*1024 pixels and i would prefer to reduce them into a maximum of 16 parts. the trick is to extract the 16 images using the least amount of whitespace.

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

誰ツ都不明白 2024-11-14 04:52:06

我会使用与 ravloony，但有一个轻微而重要的修改，使用“裁剪”操作来查找最小/最大列和行并不是完全空的，而是丢弃其余的。

实际上，裁剪操作将获取一个 X*Y 区域作为输入，并输出 4 个整数 - 包含该区域所有已使用像素的最小矩形的坐标。这也可用于检测和丢弃空白区域。

示例

....................
.xxxxxxxxxxx........     xxxxxxxxxxx.......
...xxxx...xxxxxx....     ..xxxx...xxxxxx...
.............xxxxx..     ............xxxxx.
...............xxx.. =>  ..............xxx. (first crop)
...............xxx..     ..............xxx.
....................     ..................
..xxxxxx............     .xxxxxx...........
.....xxxxxxxxxxx....     ....xxxxxxxxxxx...
.........xxxxxxxxxx.     ........xxxxxxxxxx
....................

现在将图像分为 NxN 部分（此处使用 N=4）并对每个部分使用裁剪操作：

xxxxx|xxxxx|x....|
..xxx|x...x|xxxxx|
---------------------
     |     |  xxx|xx
     |     |  ..x|xx
---------------------
     |     |    x|xx
     |     |     |
---------------------
 xxxx|xx...|     |
 ...x|xxxxx|xxxxx|
     |...xx|xxxxx|xxx

对于此示例，我们得到 10+10+10+6+4+1+2+8+15+ 10+3=79 像素，而不是 21*11=231，仅为 34.2%。请注意，这恰好与您手工制作的 4 部分分割的数量相同 (30+15+14+20=79)！

结论

当然，会有一些额外的数据来跟踪每个部件的 16 个部件的位置和大小，并且它并不总是给出最佳结果，但我认为这是速度和节省之间的一个很好的折衷，并且该算法很容易编写和维护。

关于附加数据：大小为 1024x1024 的图像并分为 4x4 部分，您可以使用 4 字节值来存储每个矩形，因此附加数据大小仅为 16*4 = 64 字节 - 关于这一点，您也许应该考虑增加 16 个部分的最大值，除非它会严重减慢其他部分（例如绘图）的速度。

最坏的情况

该算法的最坏情况是在边缘处或附近设置了一些像素的部分，如下所示：

x......x    xxxxxxxx    xx......
........    ........    x.......
........    ........    ........
x......x    ...x....    .......x

我想到了这些问题的几种解决方案：

再次分割区域（最终
使用四叉树实现）
使用一些额外的步骤来检测
完全空的矩形
里面。
稍微平移定义零件的网格

I'd go with the same algorithm as ravloony, but with a slight and important modification, using a "crop" operation that looks for the minimal/maximal columns and rows that aren't completely empty and discarding the rest.

In practice, the crop operation would get a X*Y region as input and would output 4 integers - the coordinates of the smallest rectangle that contains all the used pixels of the region. This can also be used to detect and discard empty regions.

Example

....................
.xxxxxxxxxxx........     xxxxxxxxxxx.......
...xxxx...xxxxxx....     ..xxxx...xxxxxx...
.............xxxxx..     ............xxxxx.
...............xxx.. =>  ..............xxx. (first crop)
...............xxx..     ..............xxx.
....................     ..................
..xxxxxx............     .xxxxxx...........
.....xxxxxxxxxxx....     ....xxxxxxxxxxx...
.........xxxxxxxxxx.     ........xxxxxxxxxx
....................

Now divide the image into NxN parts (using N=4 here) and use the crop operation on each of the parts:

xxxxx|xxxxx|x....|
..xxx|x...x|xxxxx|
---------------------
     |     |  xxx|xx
     |     |  ..x|xx
---------------------
     |     |    x|xx
     |     |     |
---------------------
 xxxx|xx...|     |
 ...x|xxxxx|xxxxx|
     |...xx|xxxxx|xxx

For this example, we get 10+10+10+6+4+1+2+8+15+10+3=79 pixels instead of 21*11=231 which is only 34,2%. Note that this happens to be the same amount as with your handcrafted 4-part segmentation (30+15+14+20=79)!

Conclusions

Of course there will be some additional data to keep track of the position and size of the 16 parts for each and it won't always give best results, but I think it's a nice compromise between speed and savings and the algorithm is easy to write and maintain.

About the additional data: Images of size 1024x1024 and splitting into 4x4 parts would give you the possibility to use 4 byte values to store each rectangle, so additional data size would be only 16*4 = 64 bytes - regarding this, you should perhaps consider to increase your 16 part maximum unless it will slow down some other part like the drawing heavily.

Worst cases

Worst cases for this algorithm would be parts with some pixels at or near the edges set, like these:

x......x    xxxxxxxx    xx......
........    ........    x.......
........    ........    ........
x......x    ...x....    .......x

Several solutions for these come to my mind:

Splitting the region again (ending up
with a quadtree implementation)
Using some additional step to detect
completely empty rectangles in the
inside.
Translating the grid that defines the parts a bit

回复收藏 0 原文

时光与爱终年不遇 2024-11-14 04:52:06

我会考虑递归地执行此操作，每次将其分成两半或四份，直到达到您想要的级别（对您来说 2 -> 4^2 = 16）。在底层检查空方块并丢弃它们。
当然，这会给你一个与原始图像的形状成比例的矩形网格，而不是最佳放置的矩形，但它可能会让你走上正确的轨道。

回复收藏 0 原文

若水般的淡然安静女子 2024-11-14 04:52:06

我的直觉告诉我，理想的解决方案类似于背包问题，因此在计算上是不切实际的。您也许可以使用某种启发式方法来生成“足够好”的解决方案。

您可以使用洪水填充算法来选择非透明像素的连接区域。作为第一次切割，这将为每个不相交的颜色区域提供一个矩形。如果您的预算中有更多可用的矩形，您可以尝试以不同的方式切割它们，看看哪种方式可以提供最高的彩色像素“密度”。

回复收藏 0 原文

嘦怹 2024-11-14 04:52:06

您想要编写运行长度或增量压缩算法。或者您想使用空间归档曲线或空间索引。 sfc 递归地将表面细分为更小的 4 个图块，并将 2 维的复杂性降低到 1 维，从而更容易识别空白。您想要查找 Nick 的希尔伯特曲线四叉树空间索引博客。您想在 phpclasses.org 下载我的 php 类希尔伯特曲线。

回复收藏 0 原文