使用 PIL 检测空白页扫描

发布于 2024-10-26 13:36:17 字数 387 浏览 10 评论 0原文

因此，我经常在一台不智能的佳能多功能一体机上运行大量双面扫描作业，这给我留下了一个巨大的 JPEG 文件夹。我是否疯狂地考虑使用 PIL 来分析图像文件夹以检测空白页的扫描并将其标记为删除？

不考虑文件夹爬行和标记部分，我想这看起来像：

检查图像是否是灰度的，因为这被认为是不确定的。
如果是这样，请检测主要色调范围（背景颜色）。
如果不是，请检测主要的色调范围，仅限于浅灰色。
确定整个图像中由所述阴影组成的百分比。
尝试找到一个能够充分检测带有文字、文字或图像的页面的阈值。
也许一次测试图像片段以提高阈值的准确性。

我知道这是一种边缘情况，但是任何有 PIL 经验的人都可以提供一些指导吗？

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

何以心动 2024-11-02 13:36:17

这是一个替代解决方案，使用 mahotas 和

首先创建两个目录：positives/ 和 negatives/，您将在其中手动挑选一些示例。
我假设其余数据位于 unlabeled/ 目录中
计算正片和负片中所有图像的特征
学习一个分类器
在未标记图像上使用该分类器

在下面的代码中我使用了 < a href="http://luispedro.org/software/jug" rel="noreferrer">jug 让您可以在多个处理器上运行它，但是如果您删除其中的每一行，该代码也可以工作提到 TaskGenerator

from glob import glob
import mahotas
import mahotas.features
import milk
from jug import TaskGenerator


@TaskGenerator
def features_for(imname):
    img = mahotas.imread(imname)
    return mahotas.features.haralick(img).mean(0)

@TaskGenerator
def learn_model(features, labels):
    learner = milk.defaultclassifier()
    return learner.train(features, labels)

@TaskGenerator
def classify(model, features):
     return model.apply(features)

positives = glob('positives/*.jpg')
negatives = glob('negatives/*.jpg')
unlabeled = glob('unlabeled/*.jpg')


features = map(features_for, negatives + positives)
labels = [0] * len(negatives) + [1] * len(positives)

model = learn_model(features, labels)

labeled = [classify(model, features_for(u)) for u in unlabeled]

这使用了纹理功能，这可能已经足够好了，但是如果您愿意，您可以使用 mahotas.features 中的其他功能（或者尝试 mahotas .surf，但这会变得更加复杂）。一般来说，我发现很难用您正在寻找的那种硬阈值进行分类，除非扫描受到严格控制。

Here is an alternative solution, using mahotas and milk.

Start by creating two directories: positives/ and negatives/ where you will manually pick out a few examples.
I will assume that the rest of the data is in an unlabeled/ directory
Compute features for all of the images in positives and negatives
learn a classifier
use that classifier on the unlabeled images

In the code below I used jug to give you the possibility of running it on multiple processors, but the code also works if you remove every line which mentions TaskGenerator

from glob import glob
import mahotas
import mahotas.features
import milk
from jug import TaskGenerator


@TaskGenerator
def features_for(imname):
    img = mahotas.imread(imname)
    return mahotas.features.haralick(img).mean(0)

@TaskGenerator
def learn_model(features, labels):
    learner = milk.defaultclassifier()
    return learner.train(features, labels)

@TaskGenerator
def classify(model, features):
     return model.apply(features)

positives = glob('positives/*.jpg')
negatives = glob('negatives/*.jpg')
unlabeled = glob('unlabeled/*.jpg')


features = map(features_for, negatives + positives)
labels = [0] * len(negatives) + [1] * len(positives)

model = learn_model(features, labels)

labeled = [classify(model, features_for(u)) for u in unlabeled]

This uses texture features, which is probably good enough, but you can play with other features in mahotas.features if you'd like (or try mahotas.surf, but that gets more complicated). In general, I have found it hard to do classification with the sort of hard thresholds you are looking for unless the scanning is very controlled.

回复收藏 0 原文