Android FaceDetector底层技术

发布于 2024-09-12 04:13:42 字数 1780 浏览 6 评论 0原文

我正在 Android 上实现一个面部跟踪器,作为一项文献研究,我想确定 Android FaceDetector 的底层技术。

简而言之:我想了解 android.media.FaceDetector 分类器是如何工作的。

简短的谷歌搜索没有产生任何信息,所以我想我应该看一下代码。

通过查看Java源代码, FaceDetector.java,没有太多需要学习的:FaceDetector 只是一个提供以下功能的类图像尺寸和面部数量,然后返回面部数组。

Android 源代码包含此类的 JNI 代码。我跟踪了函数调用,在精简到最基本的部分后,我了解到:

  1. “FaceFinder”是在 FaceFinder.c:75< /a>
  2. 在第 90 行,bbs_MemSeg_alloc 返回一个 btk_HFaceFinder 对象(其中包含实际查找面孔的函数),本质上是将其复制到 hsdkA->contextE。 memTblE.espArrE 原始 btk_HSDK 对象的数组在initialize()中初始化(FaceDetector_jni.cpp:145) 通过 btk_SDK_create()
  3. 看起来,迷宫般的函数相互提供了 btk_HSDK 的指针和实例,但我找不到 sdk->contextE 的具体实例。 memTblE.espArrE[0] 据说包含魔法。

我发现了一点线索:JNI 代码引用了一个 FFTEm 库,但我找不到它的源代码。然而,从表面上看,FFT 是快速傅里叶变换,它可能与预训练的神经网络一起使用。我能找到的唯一与这一理论相符的文献是 Ben-Yacoub 的论文等人。

我什至不知道我是否走在正确的道路上,所以任何建议无疑都会有帮助。

编辑:我为任何能够提供任何见解的人添加了 +100 赏金。

I'm implementing a face tracker on Android, and as a literature study, would like to identify the underlying technique of Android's FaceDetector.

Simply put: I want to understand how the android.media.FaceDetector classifier works.

A brief Google search didn't yield anything informative, so I thought I'd take a look at the code.

By looking at the Java source code, FaceDetector.java, there isn't much to be learned: FaceDetector is simply a class that is provided the image dimensions and number of faces, then returns an array of faces.

The Android source contains the JNI code for this class. I followed through the function calls, where, reduced to the bare essentials, I learned:

  1. The "FaceFinder" is created in FaceFinder.c:75
  2. On line 90, bbs_MemSeg_alloc returns a btk_HFaceFinder object (which contains the function to actually find faces), essentially copying it the hsdkA->contextE.memTblE.espArrE array of the original btk_HSDK object initialized within initialize() (FaceDetector_jni.cpp:145) by btk_SDK_create()
  3. It appears that a maze of functions provide each other with pointers and instances of btk_HSDK, but nowhere can I find a concrete instantiation of sdk->contextE.memTblE.espArrE[0] that supposedly contains the magic.

What I have discovered, is a little clue: the JNI code references a FFTEm library that I can't find the source code for. By the looks of it, however, FFT is Fast Fourier Transform, which is probably used together with a pre-trained neural network. The only literature I can find that aligns with this theory is a paper by Ben-Yacoub et al.

I don't even really know if I'm set on the right path, so any suggestions at all would undoubtedly help.

Edit: I've added a +100 bounty for anybody who can give any insight.

如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。

扫码二维码加入Web技术交流群

发布评论

需要 登录 才能够评论, 你可以免费 注册 一个本站的账号。

评论(3

朦胧时间 2024-09-19 04:13:42

我正在打电话,所以无法广泛回应,但谷歌关键字“neven Vision Algorithm”提取了一些有用的论文......

另外,美国专利 6222939 相关。

可能还有 http:// /peterwilliams97.blogspot.com/2008/09/google-picasa-to-have-face-recognition.html 可能很方便...

I'm on a phone, so can't respond extensively, but Google keywords "neven vision algorithm" pull up some useful papers...

Also, US patent 6222939 is related.

Possibly also some of the links on http://peterwilliams97.blogspot.com/2008/09/google-picasa-to-have-face-recognition.html might be handy...

忘东忘西忘不掉你 2024-09-19 04:13:42

看看这个:
http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arnumber= 1562271

我想我曾经在演示文稿中看到过一些 matlab 代码执行此操作。
也许它在网上的某个地方。

问候,
拉尔斯

have a look at this:
http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arnumber=1562271

I think I once saw some matlab code doing this in a presentation.
Maybe it's somewhere online.

Greetings,
Lars

~没有更多了~
我们使用 Cookies 和其他技术来定制您的体验包括您的登录状态等。通过阅读我们的 隐私政策 了解更多相关信息。 单击 接受 或继续使用网站,即表示您同意使用 Cookies 和您的相关数据。
原文