Research Journal of Recent Sciences ________________________________________________ ISSN 2277-2502
Vol. 2(5), 10-14, May (2013)
Res.J.Recent Sci.
Sub-Holistic Hidden Markov Model for Face Recognition
Muhammad Sharif, Jamal Hussain Shah, Sajjad Mohsin, Mudassar Raza
Department of Computer Sciences, COMSATS Institute of Information Technology, Wah Cantt., 47040 PAKISTAN
Available online at: www.isca.in
Received 10th October 2012, revised 14th November 2012, accepted 2nd February 2013
Abstract
In this paper, a face recognition technique “Sub-Holistic Hidden Markov Model” has been proposed. The technique
divides the face image into three logical portions. The proposed technique, which is based on Hidden Markov Model
(HMM), is then applied to these portions. The recognition process involves three steps i.e. pre-processing, template
extraction and recognition. The experiments were conducted on images with different resolutions of the two standard
databases (YALE and ORL) and the results were analyzed on the basis of recognition time and accuracy. The accuracy of
proposed technique is also compared with SHPCA algorithm, which shows better recognition rates.
Keywords: Face, recognition, 2D, image division, hidden Markov model, sub-holistic.
Introduction
Biometrics is usually used in pattern recognition system where a
person is recognized by his/her unique identity. This unique
identity may derive from physiological or behavioural traits of
any individual. The collection of feature vector data of unique
identity is stored in the database. At this step the feature are
being extracted. Biometric significance increases because of its
emerging applications in different fields of relevance. Face
recognition is one of the many applications of Biometrics under
image processing. But it is difficult to make a system that is
comparable to human intelligence. Irregular lighting, density,
depth of the image, unwanted objects and the angle at which the
image is captured are some of the factors that make image
processing difficult.
Many techniques have been developed in the past half century
for image recognition. Every technique has its own pros and
cons, as few are fast but less accurate, whereas the others are
very accurate but time consuming.
Face recognition algorithms and techniques were initially
developed in the early 1960s. The main emphasis of the image
processing and recognition approaches of that time was the
geometry of the facial parts i.e. eyes, nose, ears and mouth. The
distance between each facial part, and the angle at which it has
been captured, were also catered while processing an image.
This kind of system proposed by Kanade1 in 1973 was one of
the first approaches towards automated face recognition.
Another traditional technique in face recognition is Template
Matching2.
The modern techniques include 2-Dimentional (2D) and 3Dimentional (3D) image processing and recognition. Most
common used in 2D techniques are Principal Component
Analysis (PCA)3, Eigen face method4, Linear Discriminant
International Science Congress Association
Analysis (LDA)5, Hidden Markov Model (HMM)6-12 and the
Dynamic Link Architecture (DLA)13, whereas the 3D
techniques use the 3D Face Recognition and 3D Morphological
Operations14-15 etc. In real time applications a lot of challenges
arises such as occlusion, illumination, expression in face
recognition many authors developed their own algorithms to
dealt with these challenges such as Hu J. et al6, Gernoth T. et
al16, Vu N.S. et al.17, Heo J.18, Prabhu U.H. et al19, Gross R.
et al20 and Sharif M. et al21.
But face recognition is subjected to many great challenges when
it comes to its use in real time applications.
The paper introduces a technique based on Hidden Markov
Model and Named “Sub Holistic Hidden Markov Model
(SHMM)”. In this technique, the face image is divided into three
portions and then recognized through Hidden Markov Model.
Methodology
Proposed Technique: The proposed system is divided into
three major steps. Figure 1 shows the steps of proposed system.
Preprocessing Module: The image which is to be tested should
be similar to the one stored in the database. It will make sure
that all images that are used for face recognition are entered in
the system in a uniform manner (normalization). It means that
whenever a face image is to be matched with the database for
recognition, it has to be preprocessed first. The most important
property of preprocessing module is face detection, so that the
resulting face image is in its proper form for the next module
that is extracting templates. The reason is that the extraction of
horizontal strips, containing left eye, right eye and lips can
easily be determined. Accurate and consistent preprocessing is
one of the most important steps in developing a good face
recognition system.
10
Research Journal of Recent Sciences ______________________________________________________________ ISSN 2277-2502
Vol. 2(5), 10-14, May (2013)
Res. J. Recent Sci.
Figure-1
Flow Diagram of Proposed System
Taking pixel value as observation vectors could be a bit risky
because of its sensitivity against image noise, image rotation
and large dimension of observation vector, which results in a
high computational complexity7. In preprocessing, the most
important feature is training the database. The trained database
is divided in such a way that all the top left quadrants of the
image are kept in one database, top right quadrant and lips
portion are in the other databases.
Extracting the Template Module: This module receives a
normalized image from the preprocessed module. This module
splits the input face into three portions (Top Left, Top Right and
Lips portion). We have not used other sub-sets of the face which
include unnecessary portions like background, nose etc as
shown in figure 4.
The quadrant of the same images stored in the trained database
contains same index number. This feature has been shown in
figure 2:
Figure-4
Extracted Template of Image
Figure-2
The extracted features are saved with same index number
The training process of top left, top right and lips portion is
depicted in figure 3.
Figure-3
Training process of Database images
International Science Congress Association
The main idea of face division comes from Hidden Markov
Model (HMM) where HMM training scheme is used as an
iterative process for clustering data. Initial parameters are
obtained by using training data and prototype model. Main goal
is to obtain the observation probability, for this, data is
segmented uniformly and is matched with each model to extract
the initial model parameter. Then by using Viterbi Algorithm,
training observation cycles are segmented into states. This
procedure is used by Takahashi Y. et al10 for face recognition.
Recognition Module: The third and last module of the
proposed technique is recognition process. In this, searching
process starts by taking the top left quadrant of test image and
match with the templates of corresponding quadrant in the
trained images. If the match does not give satisfactory results, it
would be considered as an error. If the match gives satisfactory
results then it will move to next quadrant of the same image and
hence the third quadrant will be matched in the same manner.
11
Research Journal of Recent Sciences ______________________________________________________________ ISSN 2277-2502
Vol. 2(5), 10-14, May (2013)
Res. J. Recent Sci.
Standard databases (ORL and YALE) were used to obtain the
Face Recognition Rate (FRR) against varying image resolution.
Table 1 shows both databases resolutions that were used to
obtain the results:
Table-1
Images Resolution of ORL and YALE Database
Image Resolution
S.No.
ORL
YALE
1
112 X 92
163 X 240
2
56 X 46
128 X 192
3
37 X 23
100 X 100
4
22 X 18
56 X 46
5
18 X 15
30 X 30
Experiment and Results based on Timing: The analysis was
taken on the bases of recognition time per image (in seconds) as
depicted in figure 5. As the resolution per image decreases, the
recognition time varies accordingly, table 2 shows the varying
image resolution of ORL and YALE database:
Table-3
Recognition Time (seconds) against varying resolution.
(from table I)
Recognition Time in Seconds
Resolution
ORL Database
YALE Database
1
29.616
15.654
2
27.354
11.048
3
20.059
8.362
4
19.73
5.452
5
19.395
5.304
35
30
FRR Time (Seconds)
Results and Discussion
20
YALE
15
ORL
10
5
0
1
2
3
4
5
Dimension
Figure-6
ORL and YALE Recognition Time (seconds) against
varying resolution
Table-2
Recognition Time Per image (seconds) against varying
resolution (from table I)
Recognition Time Per image(seconds)
Resolution
ORL
YALE
1
0.124
0.172
2
0.11
0.144
3
0.093
0.11
4
0.089
0.109
5
0.07
0.091
25
Experiment and Results based on Recognition Rate: In order
to calculate the recognition rate images were categorized under
different resolutions, 112 X 92, 56 X 46, 37 X 23, 22 X 18 and
18 X 15 for ORL database. Each resolution consists of 8 sets of
400 images, each set is divided in such a way that first set
contains first 50 images, second set contains 100, and third set
contains 150 and so on up till 400 images. The combined
Percentage recognition rate of each resolution is shown in table
4 and its respective figure 7.
0.35
Recognition Time Per
image(seconds)
0.3
0.25
0.2
YALE
0.15
ORL
0.1
0.05
0
1
2
3
4
5
Table-4
FRR (Percentage) of ORL at different resolution
ORL FRR (Percentage)
Resolution
No. of Failure
FRR (Percentage)
112 X 92
2
99.5%
56 X 46
3
99.25%
37 X 23
10
98.75%
22 X 18
15
97.5%
18 X 15
19
95.25%
Dimension
ORL
In order to calculate the recognition time, 24 images of various
resolutions, as mentioned above in table 1 were taken from ORL
database and 10 images were taken from YALE database. The
resolution used is also mentioned in table 1 under the heading of
image resolution. The combined results obtained from both the
databases for recognition time are shown in table 3 and figure 6.
International Science Congress Association
100.00%
99.00%
98.00%
% FRR
Figure-5
ORL and YALE Recognition Time Per image (seconds)
against varying resolution. (from table I)
97.00%
ORL
96.00%
95.00%
94.00%
93.00%
112 X 92
56 X 46
37 X 23
22 X 18
18 X 15
Image Resolution
Figure-7
FRR (Percentage) of ORL at different resolution graph
12
Research Journal of Recent Sciences ______________________________________________________________ ISSN 2277-2502
Vol. 2(5), 10-14, May (2013)
Res. J. Recent Sci.
YALE
100.00%
99.00%
% FRR
98.00%
97.00%
96.00%
YALE
100.00%
99.00%
98.00%
97.00%
% FRR
Table 5 shows the dataset of YALE database of varying
resolution listed ahead 163 X 240, 128 X 192, 100 X 100, 56 X
46 and 30 X 30. In this case 3 sets of images are formed, each
sets is divided in such a way that first set contains first 5 images,
second set contains 110, and third set contains 165. The
combined Percentage recognition rate of each resolution is
shown in figure 8.
Table-5
FRR (Percentage) of YALE at different resolution
YALE FRR (Percentage)
Resolution
No. of Failure
FRR
(Percentage)
163 X 240
1
99.39%
128 X 192
1
99.39%
100 X 100
2
98.78%
56 X 46
6
96.36%
30 X 30
9
94.54%
96.00%
SHHMM
95.00%
SHPCA
94.00%
93.00%
92.00%
91.00%
112 X 92
56 X 46
37 X 23
22 X 18
18 X 15
Resolution
Figure-9
FRR (Percentage) of SHHMM and SHPCA at different
resolution of ORL Database
Conclusion
The results were taken by using the proposed technique on
different resolutions of ORL and YALE database. The process
consists of dividing the face image into three quadrants namely
top left, top right and lip portion. This division is beneficial to
reduce the recognition time. Through analysis, it is concluded
that when the resolution is reduce, the face has been losing its
original data which is difficult to recognize and hence it
generates much error rate in face recognition. The proposed
technique handled these issues to some extent and produces the
better results.
95.00%
94.00%
References
93.00%
92.00%
163 X 240
128 X 192
100 X 100
56 X 46
30 X 30
1.
Kanade T., Picture processing by computer complex and
recognition of human faces, doctoral dissertation, Kyoto
University, (1973)
2.
Levada A.L.M. Correa D.C., Salvadeo D., Saito J.H.,
Mascarenhas N., Novel approaches for face recognition:
Template-matching using Dynamic Time Warping and
LSTM neural network supervised classification, 15th
International Conference on Systems, Signals and Image
Processing, 241-244 (2008)
3.
Turk M. and Pentland A., Face recognition using eigenfaces
Proc. IEEE Conf. Comput. Vis. Pattern Recogn, 586-591
(1991)
4.
Matthew A. Turk and Alex P. Pentland, Face recognition
using eigenfaces. Proc. CVPR, 586-591 IEEE, June (1991)
5.
Daniel L. Swets John Weng, Using Discriminant
Eigenfeatures for Image Retrieval, IEEE Transactions on
Pattern Analysis and Machine Intelligence Volume
18, Issue 8 (1996)
6.
Hu J., Deng W. and Guo J., 2D projective transformation
based active shape model for facial feature location, IEEE
Conference Proceedings, 2442-2446 (2011)
7.
Ara V., Nefian and Monson H., Hayes III, Hidden Markov
Models for face recognition, IEEE International Conference
on Acoustic Speech and Signal Processing (1998)
image Resolution
Figure-8
FRR (Percentage) of YALE at different resolution graph
Comparison of Proposed Technique with SHPCA: The
proposed face recognition technique was tested and compared
with Sub-holistic PCA (SHPCA)16 under the condition of FRR
(Percentage) and with timing. Five different resolutions of same
database (ORL) were used for the comparison of proposed
technique with the SHPCA. The results indicate that the
proposed technique provides recognition rates of 99.5% at
image resolution of 112 X 92 whereas SHPCA7 achieved results
of 94.5% at same resolution, remaining results are shown in
table 6.
Table-6
FRR (Percentage) of SHHMM and SHPCA at different
resolution of ORL Database
Resolution
112 X 92
56 X 46
37 X 23
22 X 18
18 X 15
ORL Database with
SHHMM
No. of
FRR
Failure (Percentage)
2
3
10
15
19
99.5%
99.25%
98.75%
97.5%
95.25%
ORL Database with
SHPCA
No. of
FRR
Failure (Percentage)
22
14
17
21
23
International Science Congress Association
94.5%
96.5%
95.75%
94.75%
94.25%
13
Research Journal of Recent Sciences ______________________________________________________________ ISSN 2277-2502
Vol. 2(5), 10-14, May (2013)
Res. J. Recent Sci.
8.
Wang H. and Cao Y., An HMM-Based Face Recognition
Model under Variable Pose in Videos, Chinese Conference
on Pattern Recognition (CCPR), 1-7 (2010)
9.
Hu Y. and Liu B., Face Recognition Based on PLS and
HMM, Chinese Conference on Pattern Recognition CCPR,
1-4 (2009)
10. Takahashi Y., Tamamori A., Nankaku Y. and Tokuda K.,
Face recognition based on separable lattice 2-D HMM with
state duration modeling, IEEE International Conference on
Acoustics Speech and Signal Processing (ICASSP), (2010)
11. Chien J. and Liao C., Maximum Confidence Hidden
Markov Modeling for Face Recognition, IEEE Transactions
on Pattern Analysis and Machine Intelligence, 606-616
(2008)
12. Kotropoulos C.L. Tefas A. and Pitas I., Frontal face
authentication
using
discriminating
grids
with
morphological feature vectors, IEEE Transactions on
Multimedia, 2, 14-26 (2000)
13. Berretti S., Del Bimbo A. and Pala P., 3D Face
Recognition Using Isogeodesic Stripes, IEEE Transactions
on Pattern Analysis and Machine Intelligence, 32, 21622177 (2010)
14. Zaeri N., Feature extraction for 3D face recognition system,
International Conference on Multimedia Computing and
Systems (ICMCS), 1-4 (2011)
International Science Congress Association
15. Muhammad Almas Anjum, An Improved Face recognition
using Image Resolution Reduction and Optimization of
Feature vector PhD Thesis, College of E and ME Nust
RawalPindi-Pakistan (2008)
16. Gernoth T., Gooßen A. and Grigat R.R., Face recognition
under pose variations using shape-adapted texture features,
IEEE Conference Proceedings, 4525-4528 (2010)
17. Vu N.S. and Caplier A., Efficient statistical face
recognition across pose using local binary patterns and
gabor wavelets, IEEE Conference Proceedings, 1-5 (2009)
18. Heo J., Generic elastic models for 2d pose synthesis and
face recognition, Ph. D. dissertation, CMU, (2009)
19. Prabhu U.H. and Savvides J.M., Unconstrained Pose
Invariant Face Recognition Using 3D Generic Elastic
Models, IEEE Transactions on Pattern Analysis and
Machine Intelligence, 1-1 (2011)
20. Gross R., Matthews I. and Baker S., Appearance-based
face recognition and light-fields, IEEE Transactions on
Pattern Analysis and Machine Intelligence, 26, 449-465
(2004)
21. Sharif M., Mohsin S. and Javed M.Y., A Survey: Face
Recognition Techniques, Research Journal of Applied
Sciences, Engineering and Technology, 4, 4979-4990
(2012)
14