Handwritten Character Recognition Using Multiscale Neural Network Training Technique
Handwritten Character Recognition Using Multiscale Neural Network Training Technique
Handwritten Character Recognition Using Multiscale Neural Network Training Technique
images. Secondly selective thresholding using minimum distance with modifications in the input training vectors.
technique is proposed to be used to increase the level of accuracy of The recognition accuracy of the handwritten characters
character recognition. A simulator program (a GUI) is designed in depends a lot on the exemplars that are used for recognition.
such a way that the characters can be located on any spot on the
In general, the overall recognition process can be divided into
blank paper in which the characters are written. The results show that
such methods with moderate level of training epochs can produce 3 main sections, namely segmentation, preprocessing, and
accuracies of at least 85% and more for handwritten upper case classification [6]. Segmentation requires isolating the
English characters and numerals. characters individually before they are fed to the
preprocessing unit where the important features of characters
Keywords—Character recognition, multiscale, backpropagation, (feature extraction) are identified. Finally, classification
neural network, minimum distance technique. process is done by determining the category or the group of
each character used during the recognition process.
I. INTRODUCTION
International Scholarly and Scientific Research & Innovation 2(3) 2008 638
World Academy of Science, Engineering and Technology
International Journal of Computer and Information Engineering
Vol:2, No:3, 2008
h
RH = (3)
hmax
Fig. 1 Sample of character A
Relative-height ratio is defined as object’s bounding box
Note that the scanned or captured images are in RGB scale. height, h, over maximum bounding box height among all
These images have to be converted into grayscale format objects, hmax.
before further processing can be done. Using appropriate For example, if the RW of one object exceeds RWmin,
grayscale thresholding, binary images are to be created. (where RWmin is the threshold value for comparison), that
object will be captured. Similar analogy goes for RH and
Digital Open Science Index, Computer and Information Engineering Vol:2, No:3, 2008 waset.org/Publication/2037
International Scholarly and Scientific Research & Innovation 2(3) 2008 639
World Academy of Science, Engineering and Technology
International Journal of Computer and Information Engineering
Vol:2, No:3, 2008
in the hidden layer (layer 2) is taken arbitrarily by trial and neural network simulation.
error to be 1500 [7].
International Scholarly and Scientific Research & Innovation 2(3) 2008 640
World Academy of Science, Engineering and Technology
International Journal of Computer and Information Engineering
Vol:2, No:3, 2008
G
reconsidering the vector P of all G these characters. The The calculated dfr1,2 will be compared against the threshold
character with the next smallest P will be the second th1,2 where the subscripts 1 and 2 refer to the output values
character (first row, second column) and so on until all of the used (1 means highest output, 2 means second highest output).
characters in the first row have been considered. Once this is Note that th1,2 is not fixed (different pair of characters has
G
different th1,2) and is determined based on certain algorithm
done, the vector magnitude, P of the remaining characters (T,
(selective thresholding). If th1,2 ≥ dfr1,2, then minimum
H, E, R, and E) will be computed to determine the first
character of the second row. Again, the one with the smallest distance would be applicable. Minimum distance, MD, simply
P
G means the sum of the squared differences of the corresponding
will be the first character of second row. Similar procedure pixel intensity values between a pair of image set (template
is repeated to determine the remaining characters in the image and the input sample image).
second row.
m n
VII. NEURAL NETWORK SIMULATION USING SELECTIVE MD = ∑ ∑ (aij − bij ) 2
THRESHOLDING MINIMUM DISTANCE TECHNIQUE (MDT) i =1 j =1
Prior to network simulation, the captured character images Equation 7: Given template image and input image samples with m
will need to be converted into input vectors and this step is number of rows and n number of columns, the MD is computed as
shown, where ai,j and bi,j are pixel intensity values for template
Digital Open Science Index, Computer and Information Engineering Vol:2, No:3, 2008 waset.org/Publication/2037
International Scholarly and Scientific Research & Innovation 2(3) 2008 641
World Academy of Science, Engineering and Technology
International Journal of Computer and Information Engineering
Vol:2, No:3, 2008
Discussion
It is shown that MST training allows faster convergence.
Digital Open Science Index, Computer and Information Engineering Vol:2, No:3, 2008 waset.org/Publication/2037
International Scholarly and Scientific Research & Innovation 2(3) 2008 642
World Academy of Science, Engineering and Technology
International Journal of Computer and Information Engineering
Vol:2, No:3, 2008
REFERENCES
[1] Wu, P.H. (2003), Handwritten Character Recognition, B.Eng (Hons)
Thesis, the School of Information Technology and Electrical
Engineering, the University of Queensland.
[2] Liou, C.Y. & Yang, H.C. (1996), “Hand printed Character Recognition
Based on Spatial Topology Distance Measurement”, IEEE Transactions
On Pattern Analysis and Machine Intelligence, Vol. 18. No. 9, pp 941-
945.
[3] Didaci, L. & Giacinto, G. (2004), Dynamic Classifier Selection by
Adaptive k-Nearest-Neighbourhood Rule, Available:
http://ce.diee.unica.it/en/publications/papers-prag/MCS-Conference-
th
19.pdf (Accessed: 2007, October 11 ).
[4] Brown, E.W. (1993), Applying Neural Networks to Character
Recognition, Available:
http://www.ccs.neu.edu/home/feneric/charrecnn.html (Accessed: 2007,
Digital Open Science Index, Computer and Information Engineering Vol:2, No:3, 2008 waset.org/Publication/2037
th
October 11 ).
[5] Robinson, G. (1995), The Multiscale Technique, Available:
http://www.netlib.org/utk/lsi/pcwLSI/text/node123.html (Accessed:
2007, October 11th).
[6] Handwritten Character Recognition, Available:
http://tcts.fpms.ac.be/rdf/hcrinuk.htm (Accessed: 2007, October 11th).
[7] Rivals I. & Personnaz L. A statistical procedure for determining the
optimal number of hidden neurons of a neural model. Second
International Symposium on Neural Computation (NC.2000), Berlin,
May 23-26 2000.
International Scholarly and Scientific Research & Innovation 2(3) 2008 643