I'm implementing a face tracker on Android, and as a literature study, would like to identify the underlying technique of Android's FaceDetector. Simply put: I want to understand how the <code>android.media.FaceDetector</code> classifier works. A brief Google search didn't yield anything informative, so I thought I'd take a look at the code. By looking at the Java source code, <code>FaceDetector.java</code>, there isn't much to be learned: <code>FaceDetector</code> is simply a class that is provided the image dimensions and number of faces, then returns an array of faces. The Android source contains the JNI code for this class. I followed through the function calls, where, reduced to the bare essentials, I learned: <ol> <li>The "FaceFinder" is created in <code>FaceFinder.c:75</code> </li> <li>On line 90, <code>bbs_MemSeg_alloc</code> returns a <code>btk_HFaceFinder</code> object (which contains the function to actually find faces), essentially copying it the <code>hsdkA->contextE.memTblE.espArrE</code> array of the original <code>btk_HSDK</code> object initialized within initialize() (<code>FaceDetector_jni.cpp:145</code>) by <code>btk_SDK_create()</code> </li> <li>It appears that a maze of functions provide each other with pointers and instances of <code>btk_HSDK</code>, but nowhere can I find a concrete instantiation of <code>sdk->contextE.memTblE.espArrE[0]</code> that supposedly contains the magic.</li> </ol> What I have discovered, is a little clue: the JNI code references a FFTEm library that I can't find the source code for. By the looks of it, however, FFT is Fast Fourier Transform, which is probably used together with a pre-trained neural network. The only literature I can find that aligns with this theory is a paper by Ben-Yacoub et al. I don't even really know if I'm set on the right path, so any suggestions at all would undoubtedly help. Edit: I've added a +100 bounty for anybody who can give any insight.

I Found a couple of links too...Not sure if it would help you... http://code.google.com/p/android-playground-erdao/source/browse/#svn/trunk/SnapFace http://code.google.com/p/jjil/ http://benosteen.wordpress.com/2010/03/03/face-recognition-much-easier-than-expected/

I'm on a phone, so can't respond extensively, but Google keywords "neven vision algorithm" pull up some useful papers... Also, US patent 6222939 is related. Possibly also some of the links on http://peterwilliams97.blogspot.com/2008/09/google-picasa-to-have-face-recognition.html might be handy...

Underlying technique of Android's FaceDetector

Tags:

android

java-native-interface

fft

face-detection

I'm implementing a face tracker on Android, and as a literature study, would like to identify the underlying technique of Android's FaceDetector.

Simply put: I want to understand how the android.media.FaceDetector classifier works.

A brief Google search didn't yield anything informative, so I thought I'd take a look at the code.

By looking at the Java source code, FaceDetector.java, there isn't much to be learned: FaceDetector is simply a class that is provided the image dimensions and number of faces, then returns an array of faces.

The Android source contains the JNI code for this class. I followed through the function calls, where, reduced to the bare essentials, I learned:

The "FaceFinder" is created in FaceFinder.c:75
On line 90, bbs_MemSeg_alloc returns a btk_HFaceFinder object (which contains the function to actually find faces), essentially copying it the hsdkA->contextE.memTblE.espArrE array of the original btk_HSDK object initialized within initialize() (FaceDetector_jni.cpp:145) by btk_SDK_create()
It appears that a maze of functions provide each other with pointers and instances of btk_HSDK, but nowhere can I find a concrete instantiation of sdk->contextE.memTblE.espArrE[0] that supposedly contains the magic.

What I have discovered, is a little clue: the JNI code references a FFTEm library that I can't find the source code for. By the looks of it, however, FFT is Fast Fourier Transform, which is probably used together with a pre-trained neural network. The only literature I can find that aligns with this theory is a paper by Ben-Yacoub et al.

I don't even really know if I'm set on the right path, so any suggestions at all would undoubtedly help.

Edit: I've added a +100 bounty for anybody who can give any insight.

605

asked Jul 28 '10 14:07

Paul Lammertsma

2 Answers

I Found a couple of links too...Not sure if it would help you...

http://code.google.com/p/android-playground-erdao/source/browse/#svn/trunk/SnapFace

http://code.google.com/p/jjil/

http://benosteen.wordpress.com/2010/03/03/face-recognition-much-easier-than-expected/

139

answered Oct 19 '22 19:10

DeRagan

I'm on a phone, so can't respond extensively, but Google keywords "neven vision algorithm" pull up some useful papers...

Also, US patent 6222939 is related.

Possibly also some of the links on http://peterwilliams97.blogspot.com/2008/09/google-picasa-to-have-face-recognition.html might be handy...

answered Oct 19 '22 21:10

Stobor

Related questions
                            
                                Action bar displayed incorrectly when returning from immersive mode
                            
                                Caught a RuntimeException from the binder stub implementation when swap data in arrayadapter
                            
                                Android build get error message "Emulator: OpenGL backend 'angle' without OpenGL ES 1.x library detected. Using GLESv2 only."
                            
                                Are there any spreadsheet widgets for Android?
                            
                                Is it possible to dump a device hardware profile to create an equivalent AVD?
                            
                                How do I stop hogging the microphone
                            
                                Huawei device killing my foreground service, even with dontkillmyapp.com's solution
                            
                                Send auto email programmatically [duplicate]
                            
                                webviewglue nativedestroy view
                            
                                VideoView getCurrentPosition() irregularity on Acer Iconia A200
                            
                                Cross compiling GCC with newlib for ARM: how to specify GCC options like -march?
                            
                                Can I integrate Microsoft Lens into my application?
                            
                                save keystore password in android studio 3
                            
                                How to make Flutter work on WSL2 using host's emulator?
                            
                                How do I make my Android ContentObserver for ContactsContract detect a added, updated or deleted contact?
                            
                                IntelliJ new android gradle plugin (0.14.+)
                            
                                Getting more hardware profiles in to Android Studio
                            
                                ConstraintLayout: set height of all views in row to match the tallest one
                            
                                Android: Is it better to create a new fragment every time a navigation drawer item is clicked, or load up previously created fragments?
                            
                                Make Espresso wait for WebView to finish loading

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With