Auto Correction
Auto Correction
Auto Correction
Abstract
The current way of checking subjective paper is adverse.
Evaluating the Subjective Answers is a critical task to per-
form. When human being evaluates anything, the quality of
evaluation may vary along with the emotions of Person. In
Machine Learning, all result is only based on the input data
provided by the user. Our proposed system uses machine
learning and NLP to solve this problem. Our Algorithm
performs a task like Tokenizing words and sentences, Part
of Speech tagging, Chunking, Chinking, Lemmatizing words
and Wordnetting to evaluate the subjective answer. Along
with it, our proposed algorithm provides the semantic mean-
ing of the context. Our System is divided into two modules.
The first one is extracting the data from the scanned images
and organizing it in the proper manner and the second is
applying ML and NLP to the text retrieved from the above
step and giving marks to them.
Key Words:Nave bayes, Cosine Similarity, Classifier,
Semantic Checking, Machine Learning
1
International Journal of Pure and Applied Mathematics Special Issue
1 Introduction
The manual system for evaluation of Subjective Answers for tech-
nical subjects involves a lot of time and effort of the evaluator.
Subjective answers have various parameters upon which they can
be evaluated such as the question specific content and writing style.
Evaluating subjective answers is a critical task to Perform. When
human being evaluates anything, the quality of evaluation may
vary along with the emotions of the person. Performing evaluation
through computers using intelligent techniques ensures uniformity
in marking as the same inference mechanism is used for all the stu-
dents. In Machine Learning, all result is only based on the input
data provided by the user. Our Proposed System uses machine
learning and NLP to solve this problem. Our Algorithm performs
a task like Tokenizing words and sentences, Part of Speech tag-
ging, Chunking, chinking, Lemmatizing words and Wordnetting to
evaluate the subjective answer. Along with it, our proposed algo-
rithm provides the semantic meaning of the context. Our System
is divided into two modules, Extracting the data from the scanned
images and organizing it in the proper manner and Applying ML
and NLP to the text retrieved from the above step and giving marks
to them. The software will take a scanned copy of the answer as
an input and then after the preprocessing step, it will extract the
test of the answer. This text will again go through processing to
build a model of keywords and feature sets. Model answer sets
and keywords categorized as mentioned will be the input as well.
The classifier will then, based on the training will give marks to
the answers. Marks to the answer will be the final output. The
need for online examination aroused mainly to overcome the draw-
backs of the existing system. The main aim of the project is to
ensure user-friendly and more interactive software to the user. The
online evaluation is a much faster and clear method to define all
the relevant marking schemes. It brings much transparency to the
present method of answer checking The answers to all the questions
after the extraction would be stored in a database. The database is
designed as such that it is very easily accessible. Automating repet-
itive tasks has been the main aim of the industrial and technological
revolution. The work of checking hundreds of answer sheets which
more or less contains the same answer can be quite a boring task
2
International Journal of Pure and Applied Mathematics Special Issue
for the teachers. This system can be used instead in order to re-
duce their burden. It will save a lot of effort and time on teachers
part. The human efforts applied in this repetitive task can be saved
and spent more in other academic endeavors. The obvious human
mistakes can be reduced to obtain an unbiased result. The system
calculates the score and provides results fairly quickly. This sys-
tem can be widely used in academic institutions such as schools,
colleges, coaching and institutes for checking answer sheets. It can
also be implemented in different organizations which conduct com-
petitive examinations.
The software will take scanned copy of the answer as an input
and then after the preprocessing step it will extract the test of the
answer. This text will again go through processing to build a model
of keywords and feature sets. Model answer sets and keywords
categorized as mentioned will be the input as well. Classifier will
then, based on the training will give marks to the answers. Marks
to the answer will be the final output.
The paper is organized as follows: Section II contains the review
of related work. Section III gives brief idea about working of system.
Section IV contains Experimental Analysis and section V contain
the conclusions of this research work.
2 LITERATURE SURVEY
Evaluation of subject answer checking isnt a new thought. It has
been in the works since a decade and a half. A large number of tech-
niques where experimented with to solve the problem efficiently.
Natural Language processing, Latent Semantic Analysis, General-
ized Latent Semantic Analysis, Bayes theorem, K- nearest neighbor,
etc. In general they can be categorized as follows : Clustering tech-
niques, classification techniques and natural language processing
techniques.Intelligent Essay evaluator developed by Landauer[3],[4-
7] in 2003 using a technique known as Latent Semantic Analysis. It
gives results in the accuracy range of 60-90 %. A slightly better ver-
sion of using the probabilistic LSA technique[8-10] used to develop
automatic essay evaluator tool by Kakkonen. Generalized LSA[11]
technique extends the LSA approach by working on vectors(n-gram,
bag of vectors) instead of the dual document-term representation.
3
International Journal of Pure and Applied Mathematics Special Issue
3 WORKING
This system can be widely used in academic institutions such as
schools, colleges, coaching and institutes for checking answer sheets.
It can also be implemented in different organizations which conduct
competitive examinations. Our Algorithm performs a task like To-
kenizing words and sentences, Part of Speech tagging, Chunking,
chinking, Lemmatizing words and Wordnetting to evaluate the sub-
jective answer. Along with it, our proposed algorithm provides the
semantic meaning of the context.
4
International Journal of Pure and Applied Mathematics Special Issue
5
International Journal of Pure and Applied Mathematics Special Issue
6
International Journal of Pure and Applied Mathematics Special Issue
7
International Journal of Pure and Applied Mathematics Special Issue
We have trained our model using above dataset. The values that
we have defined in the yable are set according to the requirement of
the answer. The evalautor/moderator/teacher of the answersheet
can define these values for themselves to suit their needs.
8
International Journal of Pure and Applied Mathematics Special Issue
4 EXPERIMENTAL ANALYSIS
We have given 3 questions to each student. Total number of stu-
dent was 20.Each question carries 5 Marks .All answers are evalu-
ated firstly by 10 Professors then our algorithm will evaluate them
.Then the similarity betweenProfessorEvaluation and our algorith-
mevaluation is taken into consideration .we have found :-
We have made python flask web app for experiment purpose, where
students will write the subjective question answersand we also have
made an android application to show the results.
9
International Journal of Pure and Applied Mathematics Special Issue
5 CONCLUSION
The techniques discussed and implemented in this project should
have a high agreement (up to 90 percent) with Human Performance.
The project works with the same factors which an actual human be-
ing considers while evaluation such as length of the answer, presence
of keywords, and context of key-words. Use of Natural Language
Processing coupled with robust classification techniques, checks for
not only keywords but also the question specific things. Students
will have certain degree of freedom while writing the answer as the
system checks for the presence of keywords, synonyms, right word
context and coverage of all concepts. It is concluded that using ML
techniques will give satisfactory results due to holistic evaluation.
The accuracy of the evaluation can be increased by feeding it a
huge and accurate training dataset. As the technicality of the sub-
ject matter changes different classifiers can be employed. Further
improvement by taking feedback from all the stakeholders such as
students and teachers can improve the system meticulously.
10
International Journal of Pure and Applied Mathematics Special Issue
References
[1] B. Rujiang and L. Junhua, Improving documents classifica-
tion with semantic features, 2nd Int.Symp. Electron. Commer.
Secur. ISECS 2009, vol. 1, pp. 640643, 2009.
11
International Journal of Pure and Applied Mathematics Special Issue
12
International Journal of Pure and Applied Mathematics Special Issue
13