ISSN: 2350-0328
International Journal of Advanced Research in Science,
Engineering and Technology
Vol. 5, Issue 5 , May 2018
Studying of Educational Data Mining
Techniques
Saja Taha Ahmed Prof. Dr. Rafah shihab Al-hamdani Dr. Muayad Sadik Croock
Informatics Institute of Postgraduate studies,University Of Information Technology and
Communication, Baghdad, Iraq
Informatics Institute of Postgraduate studies University Of Information Technology and
Communication, Baghdad, Iraq
Computer Engineering Department,University of Technology ,Baghdad, Iraq
ABSTRACT: The Electronic (E) - Learning attracts the attention of researchers in the recent years. This for different
reasons, such as easing the board studying and guarantee the education for busy people. Different methods and
algorithm have been adopted in e-learning systems to offer more flexible services for students. In addition, the recent
smart systems consider the prediction strategies for expecting the logical results of different categories in e-learning.
The researcher goes further with decision making for students, presented as a recommendation for each type of
classification. Moreover, the e-learning systems use the classification and clustering methods for classifying the
investigated dataset. In this paper, a comprehensive study of the recent e-learning decision making and prediction is
presented. It offers a wide information regarding the subject of decision making and prediction in e-learning that can
improve them efficiently. Discussion and recommendations have been included in this paper.
KEYWORDS: Educational Data Mining; Student Performance Prediction; Hybrid Technique.
1. INTRODUCTION
Data mining (DM) is defined as Knowledge Discovery in Databases (KDD), used for discovering hidden information
from different data sources, multiple data mining techniques can be applied in various fields including marketing,
business, medical, etc [1].
In recent years there was an intense attention in the field of educational data mining (EDM) as a result of the
development of e-learning systems, the massive volume of educational data, and even evolving traditional education.
Educational data mining is a subdomain from data mining which has been developed to extract and analyze new
information from educational data sources [2].
Early Student academic performance (SAP) prediction can support educational institutes, universities and schools in
improving their education quality by forecasting student level activities, student' subject interests, factors (economic,
educational, behavioral, social, etc.) that affect student performance and identifying dropout points [3].
For the purpose of early notifying student activity level, machine learning utilized along with EDM. This can improve
the acquisition of knowledge or skills on how to treat weak points at student's achievement. In the future, machine
learning techniques can be considered as an integrated part of any educational institution instead of manual
investigation in related data which has great effort and time-consuming [4,5].
There are numerous machine learning and DM techniques that can be categorized into two types; either traditional DM
methods and compare accuracy among them or a combination of two or more methods (hybrid) to improve accuracy
and overcome drawbacks of a single approach. This paper aims to illustrate and study the main prediction techniques,
in addition, to discuss their findings, (pros& cons) in case of presenting, state on which features classification is
performed and suggest some recommended beneficial to the learning process.
II. LITERATURE REVIEW
Several research studies have dealt with the prediction of students’ performance in different levels. In the present
literature, a number of complementary approaches are offered with taking into consideration distinct sizes and sources
of datasets, as well as factors affecting student performance assessment.
Copyright to IJARSET
www.ijarset.com
5742
ISSN: 2350-0328
International Journal of Advanced Research in Science,
Engineering and Technology
Vol. 5, Issue 5 , May 2018
In [6], the authors utilized Apriori algorithm to extract similar patterns come from a different set of records.
Additionally, they employed the k-means cluster analysis to generate a group of students based on their personal and
academic fields.This study gave greater priority to psychological attributes than an academic one. Students were
grouped into two clusters good and poor.Dataset consisted of 100 students using 45 parameters of Government Higher
Secondary School and prediction was satisfactory.
The authors of [7] performed the preprocessing to the data that was collected from three different colleges of 300
students with 24 attributes. After employing the data cleaning by removing incomplete data, 22 features were selected.
Four types of classifier J48, PART, BayesNet and Random Forest were applied. Based on accuracy and classification
errors, a confusion matrix was found to show that Random Forest was more suitable than other classification methods,
Apriori algorithm also executed to extract association rule among all attributes and find the best-suited rules.
Another study of predicting the performance of students in [8] introduced two new features family expenditures and
student personal information. The authors analyzed these feature with other feature to determine the impact of the
selected attribute. Two generative (Bayes Network (BN), Naive Bayes (NB)) and three discriminative (SVM, C4.5,
CART), SVM was the most effective for finding the new feature among other methods.
In addition, soft computing methods played a major role in EDM. They were applied as a single or hybrid with other
methods. The authors of [9,10,11,12] analyzed student's academic level using the Neural Network (NN) trained by
decision tree learning or applied various types of NN to classify the students into the predicted success and failure
categories.
III. TRADITIONAL DM TECHNIQUES
The Prediction can be applied using DM that consider the following techniques:
classification.
clustering.
association rule mining.
Sequential and Pattern prediction.
Classification is the most popular methods used for SAP prediction in EDM [13][14]. This paper preview the most
frequently used methods applied for forecasting SAP, including classification and clustering.
a. CLASSIFICATION
Classification is a process of describing a model (or function) that distinguishes data classes. The model is built based
on data objects for which the class labels are known. After that, the model is used to predict the class label of unknown
label objects [13]. Classification is supervised learning method. It consists of two steps [15]:
Model construction by analyzing the training data.
Test data used for model evaluation.
The most popular method for data classification is the decision tree that includes many types.
Decision Tree
A decision tree is a tree-like structure, which starts from root attributes, each branch of the internal node being a
possible value of the attribute and ends with leaf nodes (class's names). Decision tree classifiers produce simple way
used to interpret the classification rules [9].
Many forms of decision tree search related student personal and social factors assumed to influence students’ academic
performance. Four kinds of decision tree classifiers: ID3, C4.5, CART, and CHAID have been used on a dataset of 270
records collected through classes and Google forms. This research found that the performance of students does not
depend entirely on academic factors, although there are other factors that have the same or greater impact on the level
of performance students [16].
Next research of [17] was used the same DT types on a different dataset. Predicted student performance can be found
by comparing between different decision trees classifiers. These classifiers are built based on various attributes belong
Copyright to IJARSET
www.ijarset.com
5743
ISSN: 2350-0328
International Journal of Advanced Research in Science,
Engineering and Technology
Vol. 5, Issue 5 , May 2018
to 466 students from an engineering college. The accuracy of each classifier was computed and it was shown that
CHAID has the highest accuracy of 76.11%, C4.5 with 73.13%. Table 1. shows accuracy results for four decision tree
types, dataset sizes, and attributes used for prediction in each research.
TABLE 1: DECISION TREE TYPE'S ACCURACY
RESEARCH
Attributes
[16]
Datasets size(record)
ID3
C4.5
CART
CHAID
270
33.33%.
35.19%.
40%.
34.07%.
466
70.15%
73.13%
71.64%
76.11%
Academic and social features
[17]
Previous Semester Marks
Internal Tests Grades
Project Marks
Attendance
Final Semester Marks (Semester 8)
Addition to that, the numerous forms of the decision tree can be compared with a different classifier (Neural Network,
Naïve Bayes, and KNN) in term of their accuracy. Table 2 shows the decision tree accuracy as compared to another
classifier. It can be said that decision tree has higher accuracy than other methods in investigated papers based on the
various attribute.
Table 2: Comparison accuracy of Decision Tree with other classification methods
Research
[20]
Methods
accuracy
J48
82.2%
Neural Network
[21]
[22]
Attribute
Gender, Nationality, Place of Birth, Stages,
80.0%
J48
75.8333 %
Random Forest
76.6667 %
Naive Bayes Naive
67.7083 %
Bayes Multinomial
43.9583 %
DT
92.96%
Level, Orphan, SHC, BirthYear, Camp,FatherWork,
Naïve Bayes
91.79
FailYears, PrevYear, first term, FinalResult
KNN
88.86%
Copyright to IJARSET
Gender, Nationality,
Place of birth…
www.ijarset.com
5744
ISSN: 2350-0328
International Journal of Advanced Research in Science,
Engineering and Technology
Vol. 5, Issue 5 , May 2018
Performance accuracy for each classification method for researches explained in Table 2 in recent two years (20172018) can be visualized in Figure 1.
Figure 1: DM Classification Technique's highest accuracy in 2017-2018
95
90
85
80
75
70
DT
NN
K-NN
Naïve Bayes
B. Clustering
Clustering is an unsupervised method that divides data into distinct categories, in which the data in the same set have
similar properties and differ from other data in other aggregates. Clustering has an important role in data mining
applications, for example, marketing, text mining, medical diagnostics, and many others [21].
There are various clustering techniques can be utilized in data mining. In [22], the authors provided a comprehensive
review of different clustering techniques in data mining. Figure II states the main categorization of clustering approach
in DM.
K-mean
Several research studies were reviewed EDM with various clustering methods. The authors of [23] presented EDM
clustering methods in an overall review by stating objective of presented paper, algorithm, and datasets. Most of the
goals are focused on analysis and definition of educational data so that universities can know who can be failed in
advance. In [24], authors showed that the K-mean clustering algorithm was the major type in the analysis of student
data due to finding common characteristics that enable educational institutions group to help their students for faster
performance prediction.
Figure 2: Major Clustering Methods [22]
Clustering
Techniques
Partioning
Methods
Herarchical
Methods
Agglome
rative
Copyright to IJARSET
Divisive
KMedoids
KMean
www.ijarset.com
Density
5745
ISSN: 2350-0328
International Journal of Advanced Research in Science,
Engineering and Technology
Vol. 5, Issue 5 , May 2018
Table 3 summarizes some researches that predict students performance based on clustering techniques which use kmean method along with other clustering and classification technique.
Table 3: EDM Clustering/Classification Techniques
research
[10]
#instance
Attribute
Methods
#cluster
Findings
58
Total marks
Kmean/ Hierarchical
3
Influential
Grade attendance
Clustering
for instances using k
Algorithm(single
mean.
Major
/
minor1/minor2
clustering
The model was built
linkage)
with less time using kmean(0.12 sec) while
taking (0.49 sec) using
thehieratical method
[11]
118
Grades
from
semester
III
k-mean
4
to
Present
performance
evaluation
semester VII
for
semesters
based
on
deterministic model.
[12]
10
th
12th
percentage,
Kmean
3
percentage,
Decision tree
Decision tree obtain more
accuracy than
semester marks
k-mean,
therefore
,Preferred
classification
on
the
clustering
[13]
79
Student’s scores of
k-mean
3,4,5
9 courses
Evaluate
clustering
performance
for
each
cluster number
[14]
200
Grade
(
Internal,
Quiz,
K-mean/ decision tree
3
Lab,
using
K-Means
in
clustering algorithms and
Attendance
decision tree technique to
analyze and improve the
quality
of
engineering
education
[15]
270
No. of students,
k-mean
4
Placements
.
faster
k-mean
implementat-ion
produce tighter clusters
Copyright to IJARSET
www.ijarset.com
5746
ISSN: 2350-0328
International Journal of Advanced Research in Science,
Engineering and Technology
Vol. 5, Issue 5 , May 2018
[19]
94
Gender, Category,
Grade division in
X class, Grade
division in XII
class,…..
PCA
2,3,4,5
K-MEAN
uses extraction method
PCA
to
analyze
variables and perform
ANOVA
contrast
test
the
to
group’s
means
IV. HYBRID TECHNIQUES
The major traditional/classic approaches may have certain drawbacks, to overcome weak points in elementary methods.
This section highlights the outcomes of the researches that used a combination of two or more soft computing/DM
methods. This section can be explained in different categories as follows:
A. Fuzzy Decision Tree
DT is built by splitting the nodes depending on the best features, as it influences DT accuracy, and finally interpreting
its rule. Intuitionist fuzzy decision tree classifier (FDT) uses intuitionist fuzzy entropy to compute information gain,
which is based on a soft decision tree that is an extension of ID3.In [36]authors presented a comparison between the
FDT classifier and other well-known methods (J48, Logistic Model Tree, NBTree, RandomForest,
MultilayerPerceptron, Logistic) using two criteria ROC curve and identification of instances. The obtained results
showed that the standard deviation of the accuracy of FDT method is lower than other methods. Additionally,
classification accuracy is slightly better than other methods of 0.39.
B. Genetic Decision Tree
In e-learning, the numbers of dropout students are absolutely greater than those in traditional educational institutions.
Furthermore, dropout rates depend on the e-learning system provided by the educational institution, and on the chosen
topic.Authors of [37] predicted student failure by improving decision trees (populations) size vs. accuracy using a
fitness function.Genetic algorithms generated a number of DT population sorted according to fitness value. Genetic
operators have performed on DT population result in a new population. The accuracy of the proposed method
(GATREE) performed well as compared with other methods, such as J48, MLP, Logistic regression, Naïve Bayes, 3NN.
C. Fuzzy Genetic
Fuzzy logic represents real world due to uncertainty. Authors of [38] exploited this concept through the use of fuzzy
with genetic algorithm(FGA)made classification more flexible for a student, classified as at a low level (fail). This
student could be in safe (success) by combining the attribute that has high efficiency with other attributes with low
effect to obtain more successful students. In contrast to DT that follows hard classification rules, the truth is they were
on the edge of success, this affected the degree attention of the lecturers to those students due to categorizing more
students at risk.
D. Fuzzy Probabilistic Neural Network
The probabilistic neural network is a neural network with deterministic classification and has fast building/training
time, while the fuzzy neural network has the ability to solve nonlinear and uncertain equations. In [39], the authors used
Fuzzy probabilistic Neural Network (FNN) to fix misclassification problem when input can belong to more than one
class at the same time with a certain degree of the membership. Some methods were used for predicting SAP without
considering the most relevant factors due to their ambiguity. In order to improve the obtained results, FNN can be used.
The proposed method was applied to two datasets 300, 570 students into three classes (good, average, and poor). The
correct prediction percentage for average class of the first dataset is 100%, and the processing time is about 47 mins as
compared with a typical back propagation network which takes about 120 mins.
Copyright to IJARSET
www.ijarset.com
5747
ISSN: 2350-0328
International Journal of Advanced Research in Science,
Engineering and Technology
Vol. 5, Issue 5 , May 2018
E. Adaptive Neuro-Fuzzy Inference (ANFIS)
The combination of more than one methods can deal with cons of two approaches when they are used separately.
Neuro-fuzzy inference system (NFIS) is a soft computing tool which merges the FL with the NN. In [40] three kinds of
ANFS models have been used: Gaussian, Triangular, and Generalized Bell membership functions. The preprocessing
step (normalization) was done first to the degrees of the previous exam to enhance the accuracy of SAP prediction due
to minimizing error during the learning process.The normalization was achieved by using the mean and standard
deviation of the data samples. The second step applied ANFIS to the forecasted future outcome. The result showed that
ANFIS-GbellMFhad the lowest Root Mean Square Error as compared with two other models.
V. DISCUSSION
From Table 1, we conclude that the two types of research used different characteristics to predict the level of students.
Research work of [17] was based on the evaluation of students' grades with the consideration of grades in the previous
semester marks, tests, project scores, and attendance. In contrast to the research work of [16], the authors considered
the academic and social factors of the student without adopting the previous and current grades of the students.
Therefore, the accuracy obtained from the classification was higher in [17], even though the above research worked on
two different datasets.
Research work of [35] considered single UCI dataset that consisted of attributes of two courses (Portugal
Language course and Math Course). The making of prediction was performed using three decision trees algorithms:
J48, RepTree, Hoeffding Tree. In each algorithm, different student variables were considered. The first algorithm
selected all attributes except G1Grade, G2Grade, G3Grade used as end class, Absence attribute deleted and replaced by
Absence Rate. The second algorithm was run based on all attributes except G1 and G3 with the goal student want
higher education or not. The third algorithm used all attributes. From all tests can be inferred that the accuracy affected
by two factors: the number of input attributes and the goal class attributes and the goal class attributes.
Clustering analysis is affected by some properties, such as high dimensionality, attributes, shape, size, etc. Cluster
analysis can be used when data label is not known well, thus relevant data can be grouped together and noisy data can
be excluded. Among many DM methods, prediction and clustering provide accurate results in forecasting student
performance as conducted by the work, presented in [31]. For example, the decision tree can be used for predicting
student's performance by categorizing critical courses based on social/demographic feature and marks at early stages.
Then clustering can be used for identifying attributes of outgoing students [32]. But the decision tree does not take all
the variables for the students into consideration when building the model and k-mean that has a divergence problem in
computing cluster, led to infinite iterations. Thus, the research work of [33] overrides limitation of two algorithms
using enhanced K-strange point clustering algorithm, Naïve Bayes, and multiple linear regressions.
The most important issue in predicting student’s early failure or success is to determine and identify the most
influencing factors that affect student during their study. This is done by analyzing the relationship between students
behavioral and their scores. From Table III, the authors of [34] conducted that feature of parent’s occupation has a vital
effect on prediction and ‘type of school’ does not influence student performance.
The research works, considered hybrid techniques, demonstrated that reasonable accuracy could be achieved when
some or all drawbacks at ingredients of hybrid methods are addressed. For example, using fuzzy logic with DT or with
neural networks can ride of crisp classification results (either success or failure).
It is important to note that the use of hybrid methods cannot gain the highest accuracy as explained in [41] showed that
the combination of the neural network multilayer perception, DT (C4.5), NB classifier has a lower accuracy (94.47)
than NB classifier that obtains greater accuracy (95.09) on the selected case study.
Copyright to IJARSET
www.ijarset.com
5748
ISSN: 2350-0328
International Journal of Advanced Research in Science,
Engineering and Technology
Vol. 5, Issue 5 , May 2018
VI. RECOMMENDATIONS
The existences of some issues are implicitly / explicitly stated in the processes of predicting the future achievements of
students. Making the motivation for some recommendations that can be taken when considering different EDM
techniques to have better results:
1. The algorithm can deal with the certain type of features, utilize influencing factors that affect classification
accuracy, therefore, the preprocessing steps are time-consuming and effort-intensive.
2. Feeding algorithm with students' a priori knowledge (i.e, previous semester grades, daily exams, and tests) can
improve accuracy.
3. Using an algorithm that has the ability to handle outliers.
4. Taking into consideration dataset scalability.
5. The result of the algorithm must be interpreted to understandable rules.
6. Using of methods that can deal with misclassification problem resulted from using an ambiguous feature, for
example, fuzzy reasonable.
7. Data collected manually through questionnaire is time-consuming thus student admission data in learning system
is recommended.
REFRENCES
[1]
[2]
[3]
[4]
[5]
[6]
[7]
[8]
[9]
[10]
[11]
[12]
[13]
Amala J.M, Swathi.S, Tharakai.R, "Data Mining– A Survey", International Journal of Advanced Research in Computer Science and
Software Engineering, Vol.6, Issue. 4, 2016.
John J., Kavya J., Paarth K., Shubha P., " Educational Data Mining Techniques and their Applications", IEEE, 20 15.
Oyerinde O. D., Chia P. A, "Predicting Students’ Academic Performances – A Learning Analytics Approach using Multiple Linear
Regression", International Journal of Computer Applications, Vol.157, No.4, 2017.
Anal A., Devadatta S., " Early Prediction of Students Performance using Machine Learning Techniques", International Journal of
Computer Applications, Vol.107, No. 1, 2014 .
Fedor D., Anders G., "Machine Learning-Based App for Self-Evaluation of Teacher-Specific Instructional Style and Tools",
www.mdpi.com/journal/education, 2018.
G.ShiyamalaGowri, Ramasamy T., and Mahindra A. B., "Educational Data Mining Application for Estimating Students Performance in
Weka Environment", ICSET, IOP Conference Series: Materials Science and Engineering, 2017.
Sadiq H., Neama A. D., FadlMutaher B. N. ,"Educational Data Mining and Analysis of Students’ Academic Performance Using
WEKA", Indonesian Journal of Electrical Engineering and Computer Science, Vol. 9, No. 2, 2018.
Ali D., Naif R. A. and Rabeeh A. ,,Miltiadis D. L., Farhat A., Jalal S. A., " Predicting Student Performance using Advanced Learning
Analytics", International World Wide Web Conference Committee (IW3C2), 2017.
Pimpa Ch.," Predicting Student Academic Achievement by Using the Decision Tree and Neural Network Techniques", Institute Press,
Vol. 12, No. 2 , ISSN: 2408-137X, 2015.
R.Ruhin kouser, J.Daphney Joann, K.Suganya, "Forecasting Student Academic Performance by Decision Tree learning using Artificial
Neural Networks", International Research Journal of Engineering and Technology (JET), ISSN: 2395 -0056, Vol.03, Issue: 12 , 2016.
Ioannis E. Liveris, Konstantina D., Panagiotis P.,"Predicting students' performance using artificial neural networks", 2012.
Richa Sh. A.l, Prof. MitulaHirenkumar P., "Survey of papers for data mining with neural networks to predict the student's academic
achievements", International Journal of Computer Science Trends and Technology (IJCST), Vol.3 Issue 5, 2015.
Shruthi P, Chaitra BP, "Student Performance Prediction in Education Sector Using Data Mining", International Journal of Advanced
Research in Computer Science and Software Engineering, 2016.
[14]
A.Dinesh Kumar, Dr.V.Radhika, "A Survey on Predicting Student Performance", (IJCSIT) International Journal of Computer Science
[15]
JyotiBansode, " Mining Educational Data to Predict Student‘s Academic Performance", International Journal on Recent and Innovation
Trends in Computing and Communication ISSN: 2321-8169, Vol.4, Issue 1, 2016.
Amjad Abu Saa, " Educational Data Mining & Students’ Performance Prediction", International Journal of Advanced Computer
Science and Applications (IJACSA), Vol. 7, No. 5, 2016.
Bevinda A. P., Anusha P., Cassandra F., "A
Comparative Analysis of Decision Tree Algorithms for Predicting Student’s Performance", Vol.7, No.4, 2017.
and Information Technologies, 2014.
[16]
[17]
Copyright to IJARSET
www.ijarset.com
5749
ISSN: 2350-0328
International Journal of Advanced Research in Science,
Engineering and Technology
Vol. 5, Issue 5 , May 2018
[18]
[19]
[20]
[21]
[22]
[23]
[24]
[25]
[26]
[27]
[28]
[29]
[30]
[31]
[32]
[33]
[34]
[35]
[36]
[37]
[38]
[39]
[40]
[41]
Bendangnu k. and Dr. Prabu P, "Students' Performance Prediction Using Deep Neural Network ", International Journal of Applied
Engineering Research, ISSN 0973-4562, Vol.13, No.2, pp. 1171-1176, 2018.
Bhrigu K., Nakin A., Sathyaraj R," Comparative Study on Marks Prediction using Data Mining and Classification Algorithms",
International Journal of Advanced Research in Computer Science, No. 0976-5697, 2017.
Hafez M., Ashraf M., "School Students' Performance Predication Using Data Mining Classification", International Journal of Advanced
Research in Computer and Communication Engineering, Vol. 6, Issue 8, 2017.
Mahendra T., Randhir S., "An Empirical Study of Applications of Data Mining Techniques for Predicting Student Performance in
Higher Education", International Journal of Computer Science and Mobile Computing, Vol. 2, Issue. 2, 2013.
Pavel B., "Survey of Clustering Data Mining Techniques", Technical Report, Accrue Software Inc., 2002.
Ashish D., Saeed A., Maizatul A. I., and Hamidreza M., "Clustering Algorithms Applied in Educational Data Mining", International
Journal of Information and Electronics Engineering, Vol. 5, No. 2, 2015.
C.Anuradha, T.Velmurugan, R. Anandavally, "Clustering
Algorithms in Educational Data Mining: a Review", International Journal of Power Control and Computation(IJPCSC), Vol 7.
No.1, Pp.47-52, 2015.
Shiwani Rana, Roopali Garg, "Evaluation of Student’s Performance of an Institute Using Clustering Algorithms", International Journal
of Applied Engineering Research, ISSN 0973-4562 Vol.11, No.5, 2016.
Rakesh Kumar Arora, Dr. Dharmendra Badal, "Evaluating Student’s Performance Using k-Means Clustering", IJCST Vol. 4, Issue 2,
2013.
Ritika S.,"Educational Data Mining: Performance Evaluation of Decision Tree and Clustering Techniques Using WEKA Platform",
International Journal of Computer Science and Business Informatics, ISSN:1694, Vol. 15, No. 2, 2015.
Oyelade, O. J, Oladipupo, O. O, Obagbuwa, I. C, "Application of k-Means Clustering algorithm for prediction of Students’ Academic
Performance", (IJCSIS) International Journal of Computer Science and Information Security, Vol. 7, No. 1, 2010.
Sreedevi K., Chandra S. P.," Analyzing the Student’s Academic Performance by using Clustering Methods in Data Mining",
International Journal of Scientific & Engineering Research, ISSN 2229-5518, Vol.5, Issue 6, 2014.
Alok S. Ch., Varun S. Ch., Narendra K. Sh.," Application of K-means Clustering Algorithm for Mining Educational Data", IJCST, Vol.
3, Issue 4, 2012
S. Padmapriya, Dr. L. Jayasimman, Dr. Nisha Jebaseeli, B. Senthil Kumar, " Mining Educational Data Using Data Mining Techniques
and Algorithms – a Review", International Research Journal of Engineering and Technology (IRJET), Vol. 02 Issue: 09 , 2015.
Abdullah A., Mohamed H., "On the application of data mining algorithms for predicting student performance: a case study",IJCSNS
International Journal of Computer Science and Network Security, VOL.17 No.10, 2017.
Snehal B., Kedar S., Purva N., Rubana Sh., Odelia D., Saylee D., "Predicting Student Performance Based on Clustering and
Classification", Journal of Computer Engineering (IOSR-JCE), ISSN: 2278-0661, Vol.19, Issue .3, 2017.
Prashant S. S., Govil M. C., "Prediction of Student’s Academic Performance using Clustering", IJANA, Special Conference Issue:
National Conference on Cloud Computing & Big Data, 2017.
Alaa Kh., "Selection of Best Decision Tree Algorithm for Prediction and Classification of Students’ Action", American International
Journal of Research in Science, Technology, Engineering & Mathematics (AIJRSTEM), 2016
Paweł B., Eulalia S., Janusz K.,"An Approach to Intuitionistic Fuzzy Decision Trees", 16th World Congress of the International Fuzzy
Systems Association (IFSA), 9th Conference of the European Society for Fuzzy Logic and Technology (EUSFLAT), 2015.
Dimitris K., Christos P., "Analyzing student performance in distance learning with genetic algorithms and decision trees", International
Journal of Applied Artificial Intelligence, Vol.20, Issue 8, 2006.
Hashmia H., Simi I., Jubilant J. K., "Student Academic Performance Prediction Model Using Decision Tree and Fuzzy Genetic
Algorithm", RAREST, 2016.
Nidhi A., Jatinder K., R. Saini,"A Fuzzy Probabilistic Neural Network for Student’s Academic Performance Prediction", International
Journal of Innovative Research in Science, Engineering and Technology, International Journal of Innovative Research in Science,
Engineering, and Technology, Vol. 2, Issue 9, 2013.
Altyeb A. and Omer B., "Prediction of Student's Academic Performance Based on Adaptive Neuro-Fuzzy Inference", IJCSNS
International Journal of Computer Science and Network Security, Vol.17 No.1, 2017.
Carol F. R., Yuliana F. Z., Daived M., Armando F, " Prediction the University Desertion through Hybridization of Classification
Algorithm", CEUR-WS.org, 2014.
Copyright to IJARSET
www.ijarset.com
5750