Research Article
Research Article
Research Article
Abstract
INTRODUCTION
In today's digital age, handwriting recognition and interpretation software as well as the
digitalization of handwritten documents are essential. Applications for handwriting recognition
We will use a dataset of handwritten digits from Kaggle for this project, which includes photos
of different handwritten numbers. The dataset will be preprocessed in order to standardise it.
These stages include shrinking the photographs to a constant dimension, turning them to
grayscale in order to reduce the amount of data they generate, and normalising the pixel values to
fall within a common range. Additionally, to help the neural network successfully analyse the
data, we will flatten the 2D images into a 1D array. The set of handwritten digits in the dataset
consists of about 30,000 training photos and 3,000 testing images.
Designing an Artificial Neural Network (ANN) model for handwriting detection is the strategy
we've decided to take. The input layer will include 784 neurons that represent the flattened image
data, and the architecture will have two hidden layers with 128 neurons each. The hidden layers
will get the Rectified Linear Unit (ReLU) activation function, while the
The Softmax activation feature will be used by the output layer. The categorical cross-entropy
loss function will be used to train the model, and the Adam optimizer with a learning rate of
0.001 will be used to optimise it.
After the model has been trained on the supplied dataset, we will analyse its accuracy and overall
recognition capabilities by evaluating its performance on a different testing dataset. We will
create a Graphical User Interface (GUI) using the Streamlit library to guarantee usability and
accessibility. End users will be able to communicate with the system through the GUI and draw a
digit with a mouse or touchscreen input. The interface will take a picture of the sketch and send
it to the trained model for analysis. The user will see the recognised digit and a confidence score
Literature Review:
Y. LeCun, Bottou, and others. This foundational book provided the first discussion of
Convolutional Neural Networks (CNNs) for document recognition, including handwriting
detection. The authors demonstrated how highly accurate and robustly CNNs can recognise
handwritten characters. Later advancements in deep learning-based handwriting recognition
systems benefited from their study.It was recommended that offline handwriting identification be
carried out using recurrent neural networks (RNNs), more specifically Bidirectional Long Short-
Term Memory (BLSTM) networks, in a study report by Graves, A., Liwicki, et al. The
researchers demonstrated how the memory cells in the LSTM units and the bidirectional nature
of the network improved recognition accuracy, particularly for cursive handwriting.Simard et al.
wrote the text. In this study, recommended training procedures for Convolution Neural Networks
(CNNs) for visual document analysis, including handwriting recognition, were given. The
authors discussed a number of elements, such as network architecture, training procedures, and
data pretreatment, that can enhance CNN models' performance and increase their
generalizability. Their work contributed to the advancement of CNN-based handwriting
recognition systems.This significant book chapter by Graves, et al. provides a complete overview
of sequence labelling issues using recurrent neural networks (RNNs), including handwriting
recognition. Various RNN topologies, training methods, and strategies for handling with input
sequences of different lengths were all explored by the author. The chapter was a helpful tool for
academics and professionals working on handwriting recognition using RNNs.The first article by
Bluche, et al. to discuss the Multi-Dimensional LSTM (MDLSTM) with attention mechanisms
for end-to-end handwritten paragraph detection. By selectively paying attention to different parts
The current system has taken a variety of approaches to handwriting recognition, including rule-
based methods, statistical methods, and conventional machine learning algorithms. To recognise
handwritten language, these techniques frequently rely on intricate algorithms and manually
constructed features. The speed and precision of recognition are constrained, and they struggle to
deal with a wide range of handwriting styles and intricate patterns.
Advantages of the Proposed System:
Unstructured and non-numeric data formats like image, text, and speech are accepted by
convolution neural networks (CNN) and recursive neural networks (RNN). The only subject of
this article is artificial neural networks. Architecture of artificial neural networks 1. The input
layer, hidden layer (more than one), and output layer are the three layers that make up the
network architecture. They are sometimes referred to as MLPs (Multi-Layer Perceptrons)
because of their many layers.
Pre-Processing: Pre-processing takes an image as input and cleans it. It effectively improves the
image by removing disturbance. In addition, images may be required to be in grayscale or binary
format, which is accomplished at this stage [10].
Segmentation Using a segmentation technique, individual characters are separated after the
input images have been preprocessed. These characters are then stored as an image sequence.
Then, if available, boundaries within each character image are eliminated. Next, individual
character sizes are determined [10].
Feature Extraction:
Character segmentation is followed by the extraction of features. In our case, CNN with the
ReLU activation function is used to extract the features, as shown in Figure 1. CNN reduces the
size of each character image matrix through convolution and pooling. Using the ReLU function,
the reduced matrix is compacted to vector form. This vector is considered to be a feature vector
[5].
Classification and Recognition: The derived feature vector is used as individual input to
formulate corresponding class. During the training phase, the parameters, biases, and weights are
calculated. The calculated parameters, biases, and weights are used in the testing phase for
classification and recognition purposes [11].
Using the MNIST dataset, handwritten digit recognition is a significant effort that was created
with the use of neural networks. In essence, it recognises the scanned copies of handwritten
numbers.
Our handwritten digit identification technology goes a step further in that it can now recognise
handwritten numbers typed directly on the screen with the aid of an integrated GUI in addition
to detecting them in scanned photos.
Approach:
We will approach this project by using a three-layered Neural Network.
The input layer: It distributes the features of our examples to the next layer for calculation
of activations of the next layer.
The hidden layer: They are made of hidden units called activations providing nonlinear
ties for the network. A number of hidden layers can vary according to our requirements.
The output layer: The nodes here are called output units. It provides us with the final
prediction of the Neural Network on the basis of which final predictions can be made.
Sample screens
CONCLUSION
[1] Plamondon R and Srihari S N 2000 IEEE Trans. on Patt. Anal. and Mach. Intelligence 22 63
[2] Kato N, Suzuki M, Omachi S I, Aso H and Nemoto Y 1999 IEEE Trans. on Patt. Anal. and Mach.
Intelligence 21 258
[3] Islam M M, Tayan O, Islam M R, Islam M S, Nooruddin S, Kabir M N and Islam M R 2020
IEEE Access 8 166117
[4] Gu J, Wang Z, Kuen J, Ma L, Shahroudy A, Shuai B, Liu T, Wang X, Wang G, Cai J and
Chen T 2018 Patt. Recognition 77 354
[5] Ptucha R, Such F P, Pillai S, Brockler F, Singh V and Hutkowski P 2019 Patt. Recognition
88 604
[7] Gan J, Wang W and Lu K 2020 Patt. Recog. Letters 129 190
[8] Fardous A and Afroge S 2019 Proc. Int. Conf. on Elect. Comp. and Comm. Engineering 4 1
[9] Das D, Nayak D R, Dash R and Majhi B 2020 Multi. Tools and Applications 79 33023
[10] Khoon L L and Chuin L S 2016 Int. J. of Soft. Eng. and Comp. Systems 2 10
[11] Sallam A A, Kabir M N, Alginahi Y M, Jamal A and Esmeel T K 2020 Proc. Int. Colloq.
on Sign. Proc. & Its Applications 3 255
12 Graves, A., Liwicki, M., Fernández, S., Bertolami, R., Bunke, H., & Schmidhuber, J.
(2009). Unconstrained On-Line Handwriting Recognition with Recurrent Neural Networks.
14 Bluche, T., Ney, H., & Kermorvant, C. (2017). Joint Line Segmentation and
Transcription for End-to-End Handwritten Paragraph Recognition.
15 Wang, K., Belaid, A., & Ahmed, A. B. (2018). Writer Independent Handwritten Text
Line Recognition with Hybrid Recurrent Neural Networks.
16 Graves, A., Fernández, S., Gomez, F., & Schmidhuber, J. (2006). Connectionist
Temporal Classification: Labelling Unsegmented Sequence Data with Recurrent Neural
Networks.
17 Jaeger, S., & Manmatha, R. (2001). A Hierarchical Neural Network for Handwritten Text
Recognition.
18 Breuel, T. M., Ul-Hasan, A., Al-Azawi, M. A., & Shafait, F. (2013). High-Performance
OCR for Printed English and Fraktur using LSTM Networks.
19 Zhang, X., Yin, F., Zhang, Y., Liu, C. L., & Bengio, S. (2019). Online and Offline
Handwriting Recognition: AComprehensive Survey.