Resume With Research Statement Saurabh
Resume With Research Statement Saurabh
Resume With Research Statement Saurabh
SPONSORED PROJECT
TITLE : Brain Fingerprinting: Detection of Physiological Based Concealed Information in the Brain of
Culprit using Machine Learning Techniques
FUNDING AGENCY : DRDO-INMAS (2016)
AMOUNT OF GRANT : Rs. 9.89 Lacs
ROLE : Co- Investigator
TITLE : Development of Speaker and Dialect Recognition models for Forensic Applications*
FUNDING AGENCY : DeitY
AMOUNT OF GRANT : Rs. 45,00,000
ROLE : Principal Investigator
• Working Group Recommended; Final sanction letter awaited
pg. 1
Saurabh Bhardwaj, Ph.D.
PUBLICATIONS
SCI JOURNALS
[1] S. Srivastava, Gopal, S. Bhardwaj. "Multi-scenario dataset for speaker recognition." Journal of
Intelligent & Fuzzy Systems 34, no. 3 (2018): 1385-1392.
(IMPACT FACTOR: 1.261)
[2] Gopal, S. Srivastava, S. Bhardwaj. "Feature Extraction Methods for Speaker Recognition: A Review."
International Journal of Pattern Recognition and Artificial Intelligence 31, no. 12 (2017): 1750041.
(IMPACT FACTOR: 0.660)
[3] G. Chaudhary, S. Srivastava, S. Bhardwaj and S. Bhargava, “Fusion of palm-phalanges print with
Palmprint and dorsal hand vein,” Applied Soft Computing, Elsevier, vol. 47, pp. 12-20, 2016.
(IMPACT FACTOR: 2.857)
[4] S. Bhardwaj, S. Srivastava, M. Hanmandlu, J.R.P. Gupta, “GFM Based Methods for Speaker
Identification,” IEEE Transactions on Cybernetics, vol.43, pp. 1047-1058, 2013.
(IMPACT FACTOR: 4.943)
[5] S. Bhardwaj, V. Sharma, S. Srivastava, O.S. Sastri, J.R.P. Gupta, S.S. Chandel, B. Bandyopadhyay,
“Estimation of solar radiation using a combination of Hidden Markov Model and Generalized Fuzzy
Model,” Solar Energy, Elsevier, vol. 93, pp. 43-54, 2013.
(IMPACT FACTOR: 3.685)
[6] S. Bhardwaj, S. Srivastava, J.R.P Gupta, “Pattern Similarity based model for Time Series Prediction,”
Computational Intelligence, Wiley, vol. 31, pp. 106-131, 2013.
(IMPACT FACTOR: 0.722)
INTERNATIONAL CONFERENCES:
[1] P. Bhola, S. Bhardwaj. "Solar energy estimation techniques: A review." In Power Electronics
(IICPE), 2016 7th India International Conference on, pp. 1-5. IEEE, 2016.
[2] Gopal, S. Srivastava, S. Bhardwaj, S. Srivastava. "Information Fusion in Animal Biometric
Identification." In Proceedings of the 5th International Conference on Frontiers in Intelligent
Computing: Theory and Applications, pp. 609-617. Springer, Singapore, 2017.
[3] G. Chaudhary, S. Srivastava, S. Bhardwaj, “Multi-level Fusion of Palmprint and Dorsal Hand Vein,”
in Proceedings of 3rd International Conference on Information Systems Design and Intelligent
Applications, Springer, pp. 321-330, 2016. (BEST PAPER AWARD)
[4] G. Chaudhary, S. Srivastava, S. Bhardwaj, P. Kiran, “Gaussian Membership Function-Based
Speaker Identification Using Score Level Fusion of MFCC and GFCC,” in Proceedings of the
International Congress on Information and Communication Technology: ICICT 2015, Springer, pp.
283—291, 2016
[5] S. Bhardwaj, S. Srivastava, “Choquet Fuzzy Integral Based Controller,” IEEE Conference on
Intelligent Systems and Control, Coimbatore, 2013
[6] S. Srivastava, S. Bhardwaj, J.R.P Gupta, “A novel clustering approach using shape based similarity,”
Intelligent Informatics, springer, pp. 17–27, 2013.
[7] S. Srivastava, S. Bhardwaj, A. Bhandari, K. Gupta, H. Bahl, J.R.P. Gupta, “Wavelet packet based mel
frequency cepstral features for text independent speaker identification,” Intelligent Informatics,
springer, pp. 237–247 2013.
[8] S. Srivastava, S. Bhardwaj, “A Novel Hybrid Model for Solar Radiation Prediction,” IEEE
Conference on Emerging Trends in Electrical Engineering and Energy Management, Chennai, 2012.
pg. 2
Saurabh Bhardwaj, Ph.D.
[9] S. Bhardwaj, S. Srivastava, J.R.P Gupta, A. Madhvan, “A novel shape based batching and prediction
approach for sunspot data using HMMs and ANNs,” IEEE Conference on Power Electronics, NSIT,
New Delhi, pp. 1-5, 28-30 Jan., 2011
[10] S. Bhardwaj, S. Srivastava, S. Vaishnavi, J.R.P Gupta, “Chaotic time series prediction using
combination of Hidden Markov Model and Neural Nets,” IEEE Conference on Computer Information
Systems and Industrial Management Applications, Gwalior, pp. 585-589, 8-10 Oct., 2010.
[11] S. Srivastava, S. Bhardwaj, A. Madhavan, J.R.P Gupta, “A novel shape based batching and
prediction approach for time series using HMMs and FISs,” IEEE Conference on Intelligent Systems
Design and Applications, Cairo, Egypt, pp. 929-934, Nov. 29- Dec 1, 2010
EXPERIENCE
2002-2016 TEACHING
June 2014 – THAPAR UNIVERSITY, PATIALA
Till Now ELECTRICAL AND INSTRUMENTATION ENGINEERING DEPARTMENT
Associate Professor
• Coordinated the Self-Assessment Report of NBA (Washington Accord Signatory) for UG EIC
• Member Self-Study Report ABET for UG EE
• UG Coordinator (EIC)
Feb. 2009 – GALGOTIAS UNIVERSITY, GREATER NOIDA
June 2014 SCHOOL OF ELECTRICAL, ELECTRONICS AND COMMUNICATION ENGINEERING
Assistant Professor
• Program Chair (Electronics & Communication Engineering) for B. Tech & M. Tech Programme
• Research Coordinator (School of Electrical, Electronics & Communication Engineering)
July 2009– NETAJI SUBHAS INSTITUTE OF ENGINEERING AND TECHNOLOGY (NSIT), NEW DELHI
Jan. 2013 DEPARTMENT OF INSTRUMENTATION & CONTROL ENGINEERING
Teaching Cum Research Fellow
June 2008 – DEWAN V.S INSTITUTE OF ENGINEERING & TECHNOLOGY, MEERUT
July2009 ELECTRONICS & INSTRUMENTATION ENGINEERING DEPARTMENT
Assistant Professor
• Head (Electronics & Instrumentation Engineering Department)
• Assistant Centre Superintendent (UPTU-SEE)
Aug. 2003 – RADHA GOVIND ENGINEERING COLLEGE, MEERUT
July 2006 ELECTRONICS DEPARTMENT (ECE & EIE)
Senior Lecturer
• Department Coordinator (ECE & EIC)
• Coordinator: Embedded System Training Program
Aug. 2002– IIMT ENGINEERING COLLEGE , MEERUT
Jul. 2003 ELECTRONICS AND COMMUNICATION ENGINEERING DEPARTMENT
Lecturer
2001– 2002 INDUSTRY
June 2001– HOTLINE SWITCHGEAR & CONTROLS, DELHI
July 2002 R&D Engineer
• Design and Development of Microcontroller based Single Phase Energy Meter
• Best performance award in R & D by the Managing Director of Hotline Switchgear & Controls
pg. 3
Saurabh Bhardwaj, Ph.D.
AWARDS
• Best Paper Award in Springer Conference: G. Chaudhary, S. Srivastava, S. Bhardwaj, “Multi-level fusion of palm
print and dorsal hand vain,” 3rd International Conference on Information Systems Design and Intelligent Applications,”
on Jan 9, 2016, Visakhapatnam, India.
• Best Presentation Award in IEEE workshop: S. Bhardwaj, S. Srivastava, Sanidhya, Mani, “Multi-Environment
Dataset for Speaker Identification,” Computational Intelligence: Theories, Applications and Future Directions on July
14, 2013 at IIT Kanpur, India.
SAURABH BHARDWAJ
pg. 4
Saurabh Bhardwaj, Ph.D.
Research Statement
My research interests are focused on the development and improvement of statistical and machine
learning based solutions for the problems that involve recognition, classification, clustering, modelling
and information retrieval.
As part of my Ph.D. thesis, I have investigated different statistical and soft-computing models
and their combinations for different applications including speaker recognition, clustering, solar
radiation estimation, control and for time series forecasting. The idea of combining different models is
that their individual weaknesses can be compensated by their individual merits while solving complex
problems. During this research, I have used the statistical techniques such as Hidden Markov Model
(HMM) and Gaussian Mixture Model (GMM) and soft-computing techniques such as Artificial Neural
Network (ANN), Fuzzy Logic (FL) and Choquet Fuzzy Integral(CFI).
In the paragraphs that follow, I will describe my contributions to each of the areas stated above
and give the related and other research directions that I am planning to pursue in the future.
Early Research
Speaker Recognition.
There are a number of situations in which correct recognition of persons is required. The use of biometric-
based recognition is the most “natural” way of recognizing a person. This is also very safe as these
characteristics cannot be stolen or forgotten. One of the benefits of using speech over other biometric
traits is that it can be used remotely and it contains both the physiological and behavioural
characteristics of humans.
I have developed three approaches for the text independent speaker identification of which two
methods utilize both the continuous density hidden Markov model (HMM) and the generalized fuzzy
model (GFM), which has the advantages of both Mamdani and Takagi–Sugeno models.
In the first approach, the HMM is utilized for the extraction of pattern similarity-based batch
feature vector that is fitted with the GFM to identify the speaker. Next HMM is applied on the feature
data of all users to yield the HMM parameters assuming a certain number of states and Gaussian
mixtures. The feature data are clustered into batches by sorting the Log Likelihood (LL) values obtained
from the forward algorithm. The sorted LL values are thresholded into batches by finding the correlation
coefficients. When this is done, the features of different users fall into different batches. Hence, each
feature is identified with the associated LL and the user number. These batches are then fitted with
GFM. When the test feature vector arrives, its LL is found by the learned HMM parameters. This
indicates which model it belongs to. The output of the GFM gives the identity of the speaker.
In the second approach, the equivalence between the defuzzified output of the GFM and the
conditional mean of the GMM under certain conditions is used for the identification of speakers. In this
the parameters of the GFM are calculated with the help of GMM. Here, GMM–GFMs as many as the
number of users are found using the feature data of all users with the difference that the output values
corresponding to the user are taken as unity with all other output values remaining zero. When the test
feature arrives, the outputs of all the speaker models are evaluated, and the model with the highest
output gives the user identity.
Finally, the third method has been inspired by the way humans cash in on the mutual
acquaintances while identifying a speaker. In this, several sets of HMM models are created with different
initial parameters such as the number of states and the number of Gaussian mixtures. Out of these, any
two sets of HMM models that identify most of the speakers correctly but give different misidentified
speakers are selected. If all the misidentified speakers are different, then we term it as zero similarity in
the misidentification. If one of the misidentified speakers is the same, then we have one similarity in the
misidentification and so on. When the test feature vector arrives, the outputs are evaluated with the two
sets of HMMs. If both the sets of models provide the same output for a particular speaker, then we get
the speaker’s identity. If not, we calculate the output from the GFM to distinguish between the two.
pg. 5
Saurabh Bhardwaj, Ph.D.
Solar Radiation Estimation.
Owing to considerable advancement during last two decades, solar photovoltaic has become a reliable
technology for power production worldwide. For solar radiation estimation, a model is developed which
uses HMM with Pearson R model for the extraction of shape-based clusters from input meteorological
parameters and it is then processed by GFM to accurately estimate the solar radiation. The estimation
method used in this work exploits the pattern identification prowess of HMM for cluster selection and
generalization and nonlinear modelling capabilities of GFM to predict the solar radiation.
Present Research:
pg. 6
Saurabh Bhardwaj, Ph.D.
samples of the individual in English and Hindi in read speech styles under two environments i.e. a
controlled recording chamber and library. To study the aging effect, an aging speaker database of 53
famous personalities was collected from online source varying over a period of 10–20 years.
Speaker Recognition.
As part of my early work in speaker recognition I have worked on the development of models for biometric
speaker recognition but as a part of my future research plan I would like to extend this work for forensic
speaker recognition. The models developed for biometric speaker recognition are not suitable for forensic
case. For the law enforcement agencies, a ‘confidence measure’ is required for speaker recognition. In
this, probabilistic certainty level of correctness is addressed for every decision based on statistics with
known error rates generated from large sample populations.
Normally, the speech samples which are received for forensic examination and comparison are in
conversational form so it is my plan to develop models for identification and verification in forensic
scenario for the speaker present in a conversational speech along with the conventional task of speaker
detection where speech by only a single speaker is present. For the development of models in forensic
scenario my plan is to apply the following techniques:
a. To model the feature vectors with the help of non-additive fuzzy systems in which the consequent
part of fuzzy rule in T-S model will be replaced by Choquet Integral.
b. To use the information set theory for the extraction of features.
c. To use deep learning for the fuzzy and neural networks.
Dialect Identification.
My further plan is to develop models for dialect recognition for forensic applications. As the speech of a
person belonging to a dialectal group always has its specific dialectal accent and this shows the potential
in the investigation of crime as accented speech carries linguistic information regarding the regional
dialect of an individual. Classification of speakers by their dialects is of particular interest if a suspect is
classified according to his/her dialect then this would assist in police investigation in narrowing down
the search region and can help in the person identification tasks. Normally, the speech samples which
are received for forensic examination and comparison are in conversational form in which multiple
speakers are present with channel variability. Therefore, a model for the identification of dialects is
proposed for single and conversational speech. Approximately, 600 million people across the globe speak
Hindi with about 200 different regional dialects. It is impossible to collect a single database having all
possible speaking styles, channel variation, noises and dialectal information. So, keeping in view the
crime investigation and law enforcement cases in the Indian scenario it is also in my plan to develop a
database of dialects of Hindi language in various forensic situations. The work will be helpful at any
phase of justice, for police investigation, for investigating agencies, and for researchers working in the
area of dialect recognition.
pg. 7
Saurabh Bhardwaj, Ph.D.