Academia.eduAcademia.edu

Studying of Educational Data Mining Techniques

2018

The Electronic (E)-Learning attracts the attention of researchers in the recent years. This for different reasons, such as easing the board studying and guarantee the education for busy people. Different methods and algorithm have been adopted in e-learning systems to offer more flexible services for students. In addition, the recent smart systems consider the prediction strategies for expecting the logical results of different categories in e-learning. The researcher goes further with decision making for students, presented as a recommendation for each type of classification. Moreover, the e-learning systems use the classification and clustering methods for classifying the investigated dataset. In this paper, a comprehensive study of the recent e-learning decision making and prediction is presented. It offers a wide information regarding the subject of decision making and prediction in e-learning that can improve them efficiently. Discussion and recommendations have been included in this paper.

ISSN: 2350-0328 International Journal of Advanced Research in Science, Engineering and Technology Vol. 5, Issue 5 , May 2018 Studying of Educational Data Mining Techniques Saja Taha Ahmed Prof. Dr. Rafah shihab Al-hamdani Dr. Muayad Sadik Croock Informatics Institute of Postgraduate studies,University Of Information Technology and Communication, Baghdad, Iraq Informatics Institute of Postgraduate studies University Of Information Technology and Communication, Baghdad, Iraq Computer Engineering Department,University of Technology ,Baghdad, Iraq ABSTRACT: The Electronic (E) - Learning attracts the attention of researchers in the recent years. This for different reasons, such as easing the board studying and guarantee the education for busy people. Different methods and algorithm have been adopted in e-learning systems to offer more flexible services for students. In addition, the recent smart systems consider the prediction strategies for expecting the logical results of different categories in e-learning. The researcher goes further with decision making for students, presented as a recommendation for each type of classification. Moreover, the e-learning systems use the classification and clustering methods for classifying the investigated dataset. In this paper, a comprehensive study of the recent e-learning decision making and prediction is presented. It offers a wide information regarding the subject of decision making and prediction in e-learning that can improve them efficiently. Discussion and recommendations have been included in this paper. KEYWORDS: Educational Data Mining; Student Performance Prediction; Hybrid Technique. 1. INTRODUCTION Data mining (DM) is defined as Knowledge Discovery in Databases (KDD), used for discovering hidden information from different data sources, multiple data mining techniques can be applied in various fields including marketing, business, medical, etc [1]. In recent years there was an intense attention in the field of educational data mining (EDM) as a result of the development of e-learning systems, the massive volume of educational data, and even evolving traditional education. Educational data mining is a subdomain from data mining which has been developed to extract and analyze new information from educational data sources [2]. Early Student academic performance (SAP) prediction can support educational institutes, universities and schools in improving their education quality by forecasting student level activities, student' subject interests, factors (economic, educational, behavioral, social, etc.) that affect student performance and identifying dropout points [3]. For the purpose of early notifying student activity level, machine learning utilized along with EDM. This can improve the acquisition of knowledge or skills on how to treat weak points at student's achievement. In the future, machine learning techniques can be considered as an integrated part of any educational institution instead of manual investigation in related data which has great effort and time-consuming [4,5]. There are numerous machine learning and DM techniques that can be categorized into two types; either traditional DM methods and compare accuracy among them or a combination of two or more methods (hybrid) to improve accuracy and overcome drawbacks of a single approach. This paper aims to illustrate and study the main prediction techniques, in addition, to discuss their findings, (pros& cons) in case of presenting, state on which features classification is performed and suggest some recommended beneficial to the learning process. II. LITERATURE REVIEW Several research studies have dealt with the prediction of students’ performance in different levels. In the present literature, a number of complementary approaches are offered with taking into consideration distinct sizes and sources of datasets, as well as factors affecting student performance assessment. Copyright to IJARSET www.ijarset.com 5742 ISSN: 2350-0328 International Journal of Advanced Research in Science, Engineering and Technology Vol. 5, Issue 5 , May 2018 In [6], the authors utilized Apriori algorithm to extract similar patterns come from a different set of records. Additionally, they employed the k-means cluster analysis to generate a group of students based on their personal and academic fields.This study gave greater priority to psychological attributes than an academic one. Students were grouped into two clusters good and poor.Dataset consisted of 100 students using 45 parameters of Government Higher Secondary School and prediction was satisfactory. The authors of [7] performed the preprocessing to the data that was collected from three different colleges of 300 students with 24 attributes. After employing the data cleaning by removing incomplete data, 22 features were selected. Four types of classifier J48, PART, BayesNet and Random Forest were applied. Based on accuracy and classification errors, a confusion matrix was found to show that Random Forest was more suitable than other classification methods, Apriori algorithm also executed to extract association rule among all attributes and find the best-suited rules. Another study of predicting the performance of students in [8] introduced two new features family expenditures and student personal information. The authors analyzed these feature with other feature to determine the impact of the selected attribute. Two generative (Bayes Network (BN), Naive Bayes (NB)) and three discriminative (SVM, C4.5, CART), SVM was the most effective for finding the new feature among other methods. In addition, soft computing methods played a major role in EDM. They were applied as a single or hybrid with other methods. The authors of [9,10,11,12] analyzed student's academic level using the Neural Network (NN) trained by decision tree learning or applied various types of NN to classify the students into the predicted success and failure categories. III. TRADITIONAL DM TECHNIQUES The Prediction can be applied using DM that consider the following techniques:  classification.  clustering.  association rule mining.  Sequential and Pattern prediction. Classification is the most popular methods used for SAP prediction in EDM [13][14]. This paper preview the most frequently used methods applied for forecasting SAP, including classification and clustering. a. CLASSIFICATION Classification is a process of describing a model (or function) that distinguishes data classes. The model is built based on data objects for which the class labels are known. After that, the model is used to predict the class label of unknown label objects [13]. Classification is supervised learning method. It consists of two steps [15]:  Model construction by analyzing the training data.  Test data used for model evaluation. The most popular method for data classification is the decision tree that includes many types. Decision Tree A decision tree is a tree-like structure, which starts from root attributes, each branch of the internal node being a possible value of the attribute and ends with leaf nodes (class's names). Decision tree classifiers produce simple way used to interpret the classification rules [9]. Many forms of decision tree search related student personal and social factors assumed to influence students’ academic performance. Four kinds of decision tree classifiers: ID3, C4.5, CART, and CHAID have been used on a dataset of 270 records collected through classes and Google forms. This research found that the performance of students does not depend entirely on academic factors, although there are other factors that have the same or greater impact on the level of performance students [16]. Next research of [17] was used the same DT types on a different dataset. Predicted student performance can be found by comparing between different decision trees classifiers. These classifiers are built based on various attributes belong Copyright to IJARSET www.ijarset.com 5743 ISSN: 2350-0328 International Journal of Advanced Research in Science, Engineering and Technology Vol. 5, Issue 5 , May 2018 to 466 students from an engineering college. The accuracy of each classifier was computed and it was shown that CHAID has the highest accuracy of 76.11%, C4.5 with 73.13%. Table 1. shows accuracy results for four decision tree types, dataset sizes, and attributes used for prediction in each research. TABLE 1: DECISION TREE TYPE'S ACCURACY RESEARCH Attributes [16] Datasets size(record) ID3 C4.5 CART CHAID 270 33.33%. 35.19%. 40%. 34.07%. 466 70.15% 73.13% 71.64% 76.11% Academic and social features [17] Previous Semester Marks Internal Tests Grades Project Marks Attendance Final Semester Marks (Semester 8) Addition to that, the numerous forms of the decision tree can be compared with a different classifier (Neural Network, Naïve Bayes, and KNN) in term of their accuracy. Table 2 shows the decision tree accuracy as compared to another classifier. It can be said that decision tree has higher accuracy than other methods in investigated papers based on the various attribute. Table 2: Comparison accuracy of Decision Tree with other classification methods Research [20] Methods accuracy J48 82.2% Neural Network [21] [22] Attribute Gender, Nationality, Place of Birth, Stages, 80.0% J48 75.8333 % Random Forest 76.6667 % Naive Bayes Naive 67.7083 % Bayes Multinomial 43.9583 % DT 92.96% Level, Orphan, SHC, BirthYear, Camp,FatherWork, Naïve Bayes 91.79 FailYears, PrevYear, first term, FinalResult KNN 88.86% Copyright to IJARSET Gender, Nationality, Place of birth… www.ijarset.com 5744 ISSN: 2350-0328 International Journal of Advanced Research in Science, Engineering and Technology Vol. 5, Issue 5 , May 2018 Performance accuracy for each classification method for researches explained in Table 2 in recent two years (20172018) can be visualized in Figure 1. Figure 1: DM Classification Technique's highest accuracy in 2017-2018 95 90 85 80 75 70 DT NN K-NN Naïve Bayes B. Clustering Clustering is an unsupervised method that divides data into distinct categories, in which the data in the same set have similar properties and differ from other data in other aggregates. Clustering has an important role in data mining applications, for example, marketing, text mining, medical diagnostics, and many others [21]. There are various clustering techniques can be utilized in data mining. In [22], the authors provided a comprehensive review of different clustering techniques in data mining. Figure II states the main categorization of clustering approach in DM. K-mean Several research studies were reviewed EDM with various clustering methods. The authors of [23] presented EDM clustering methods in an overall review by stating objective of presented paper, algorithm, and datasets. Most of the goals are focused on analysis and definition of educational data so that universities can know who can be failed in advance. In [24], authors showed that the K-mean clustering algorithm was the major type in the analysis of student data due to finding common characteristics that enable educational institutions group to help their students for faster performance prediction. Figure 2: Major Clustering Methods [22] Clustering Techniques Partioning Methods Herarchical Methods Agglome rative Copyright to IJARSET Divisive KMedoids KMean www.ijarset.com Density 5745 ISSN: 2350-0328 International Journal of Advanced Research in Science, Engineering and Technology Vol. 5, Issue 5 , May 2018 Table 3 summarizes some researches that predict students performance based on clustering techniques which use kmean method along with other clustering and classification technique. Table 3: EDM Clustering/Classification Techniques research [10] #instance Attribute Methods #cluster Findings 58 Total marks Kmean/ Hierarchical 3  Influential Grade attendance Clustering for instances using k Algorithm(single mean. Major / minor1/minor2 clustering  The model was built linkage) with less time using kmean(0.12 sec) while taking (0.49 sec) using thehieratical method [11] 118 Grades from semester III k-mean 4 to  Present performance evaluation semester VII for semesters based on deterministic model. [12] 10 th 12th percentage, Kmean 3 percentage, Decision tree Decision tree obtain more accuracy than semester marks k-mean, therefore ,Preferred classification on the clustering [13] 79 Student’s scores of k-mean 3,4,5 9 courses Evaluate clustering performance for each cluster number [14] 200 Grade ( Internal, Quiz, K-mean/ decision tree 3 Lab, using K-Means in clustering algorithms and Attendance decision tree technique to analyze and improve the quality of engineering education [15] 270 No. of students, k-mean 4 Placements . faster k-mean implementat-ion  produce tighter clusters Copyright to IJARSET www.ijarset.com 5746 ISSN: 2350-0328 International Journal of Advanced Research in Science, Engineering and Technology Vol. 5, Issue 5 , May 2018 [19] 94 Gender, Category, Grade division in X class, Grade division in XII class,….. PCA 2,3,4,5 K-MEAN  uses extraction method PCA to analyze variables and perform ANOVA contrast test the to group’s means IV. HYBRID TECHNIQUES The major traditional/classic approaches may have certain drawbacks, to overcome weak points in elementary methods. This section highlights the outcomes of the researches that used a combination of two or more soft computing/DM methods. This section can be explained in different categories as follows: A. Fuzzy Decision Tree DT is built by splitting the nodes depending on the best features, as it influences DT accuracy, and finally interpreting its rule. Intuitionist fuzzy decision tree classifier (FDT) uses intuitionist fuzzy entropy to compute information gain, which is based on a soft decision tree that is an extension of ID3.In [36]authors presented a comparison between the FDT classifier and other well-known methods (J48, Logistic Model Tree, NBTree, RandomForest, MultilayerPerceptron, Logistic) using two criteria ROC curve and identification of instances. The obtained results showed that the standard deviation of the accuracy of FDT method is lower than other methods. Additionally, classification accuracy is slightly better than other methods of 0.39. B. Genetic Decision Tree In e-learning, the numbers of dropout students are absolutely greater than those in traditional educational institutions. Furthermore, dropout rates depend on the e-learning system provided by the educational institution, and on the chosen topic.Authors of [37] predicted student failure by improving decision trees (populations) size vs. accuracy using a fitness function.Genetic algorithms generated a number of DT population sorted according to fitness value. Genetic operators have performed on DT population result in a new population. The accuracy of the proposed method (GATREE) performed well as compared with other methods, such as J48, MLP, Logistic regression, Naïve Bayes, 3NN. C. Fuzzy Genetic Fuzzy logic represents real world due to uncertainty. Authors of [38] exploited this concept through the use of fuzzy with genetic algorithm(FGA)made classification more flexible for a student, classified as at a low level (fail). This student could be in safe (success) by combining the attribute that has high efficiency with other attributes with low effect to obtain more successful students. In contrast to DT that follows hard classification rules, the truth is they were on the edge of success, this affected the degree attention of the lecturers to those students due to categorizing more students at risk. D. Fuzzy Probabilistic Neural Network The probabilistic neural network is a neural network with deterministic classification and has fast building/training time, while the fuzzy neural network has the ability to solve nonlinear and uncertain equations. In [39], the authors used Fuzzy probabilistic Neural Network (FNN) to fix misclassification problem when input can belong to more than one class at the same time with a certain degree of the membership. Some methods were used for predicting SAP without considering the most relevant factors due to their ambiguity. In order to improve the obtained results, FNN can be used. The proposed method was applied to two datasets 300, 570 students into three classes (good, average, and poor). The correct prediction percentage for average class of the first dataset is 100%, and the processing time is about 47 mins as compared with a typical back propagation network which takes about 120 mins. Copyright to IJARSET www.ijarset.com 5747 ISSN: 2350-0328 International Journal of Advanced Research in Science, Engineering and Technology Vol. 5, Issue 5 , May 2018 E. Adaptive Neuro-Fuzzy Inference (ANFIS) The combination of more than one methods can deal with cons of two approaches when they are used separately. Neuro-fuzzy inference system (NFIS) is a soft computing tool which merges the FL with the NN. In [40] three kinds of ANFS models have been used: Gaussian, Triangular, and Generalized Bell membership functions. The preprocessing step (normalization) was done first to the degrees of the previous exam to enhance the accuracy of SAP prediction due to minimizing error during the learning process.The normalization was achieved by using the mean and standard deviation of the data samples. The second step applied ANFIS to the forecasted future outcome. The result showed that ANFIS-GbellMFhad the lowest Root Mean Square Error as compared with two other models. V. DISCUSSION From Table 1, we conclude that the two types of research used different characteristics to predict the level of students. Research work of [17] was based on the evaluation of students' grades with the consideration of grades in the previous semester marks, tests, project scores, and attendance. In contrast to the research work of [16], the authors considered the academic and social factors of the student without adopting the previous and current grades of the students. Therefore, the accuracy obtained from the classification was higher in [17], even though the above research worked on two different datasets. Research work of [35] considered single UCI dataset that consisted of attributes of two courses (Portugal Language course and Math Course). The making of prediction was performed using three decision trees algorithms: J48, RepTree, Hoeffding Tree. In each algorithm, different student variables were considered. The first algorithm selected all attributes except G1Grade, G2Grade, G3Grade used as end class, Absence attribute deleted and replaced by Absence Rate. The second algorithm was run based on all attributes except G1 and G3 with the goal student want higher education or not. The third algorithm used all attributes. From all tests can be inferred that the accuracy affected by two factors: the number of input attributes and the goal class attributes and the goal class attributes. Clustering analysis is affected by some properties, such as high dimensionality, attributes, shape, size, etc. Cluster analysis can be used when data label is not known well, thus relevant data can be grouped together and noisy data can be excluded. Among many DM methods, prediction and clustering provide accurate results in forecasting student performance as conducted by the work, presented in [31]. For example, the decision tree can be used for predicting student's performance by categorizing critical courses based on social/demographic feature and marks at early stages. Then clustering can be used for identifying attributes of outgoing students [32]. But the decision tree does not take all the variables for the students into consideration when building the model and k-mean that has a divergence problem in computing cluster, led to infinite iterations. Thus, the research work of [33] overrides limitation of two algorithms using enhanced K-strange point clustering algorithm, Naïve Bayes, and multiple linear regressions. The most important issue in predicting student’s early failure or success is to determine and identify the most influencing factors that affect student during their study. This is done by analyzing the relationship between students behavioral and their scores. From Table III, the authors of [34] conducted that feature of parent’s occupation has a vital effect on prediction and ‘type of school’ does not influence student performance. The research works, considered hybrid techniques, demonstrated that reasonable accuracy could be achieved when some or all drawbacks at ingredients of hybrid methods are addressed. For example, using fuzzy logic with DT or with neural networks can ride of crisp classification results (either success or failure). It is important to note that the use of hybrid methods cannot gain the highest accuracy as explained in [41] showed that the combination of the neural network multilayer perception, DT (C4.5), NB classifier has a lower accuracy (94.47) than NB classifier that obtains greater accuracy (95.09) on the selected case study. Copyright to IJARSET www.ijarset.com 5748 ISSN: 2350-0328 International Journal of Advanced Research in Science, Engineering and Technology Vol. 5, Issue 5 , May 2018 VI. RECOMMENDATIONS The existences of some issues are implicitly / explicitly stated in the processes of predicting the future achievements of students. Making the motivation for some recommendations that can be taken when considering different EDM techniques to have better results: 1. The algorithm can deal with the certain type of features, utilize influencing factors that affect classification accuracy, therefore, the preprocessing steps are time-consuming and effort-intensive. 2. Feeding algorithm with students' a priori knowledge (i.e, previous semester grades, daily exams, and tests) can improve accuracy. 3. Using an algorithm that has the ability to handle outliers. 4. Taking into consideration dataset scalability. 5. The result of the algorithm must be interpreted to understandable rules. 6. Using of methods that can deal with misclassification problem resulted from using an ambiguous feature, for example, fuzzy reasonable. 7. Data collected manually through questionnaire is time-consuming thus student admission data in learning system is recommended. REFRENCES [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12] [13] Amala J.M, Swathi.S, Tharakai.R, "Data Mining– A Survey", International Journal of Advanced Research in Computer Science and Software Engineering, Vol.6, Issue. 4, 2016. John J., Kavya J., Paarth K., Shubha P., " Educational Data Mining Techniques and their Applications", IEEE, 20 15. Oyerinde O. D., Chia P. A, "Predicting Students’ Academic Performances – A Learning Analytics Approach using Multiple Linear Regression", International Journal of Computer Applications, Vol.157, No.4, 2017. Anal A., Devadatta S., " Early Prediction of Students Performance using Machine Learning Techniques", International Journal of Computer Applications, Vol.107, No. 1, 2014 . Fedor D., Anders G., "Machine Learning-Based App for Self-Evaluation of Teacher-Specific Instructional Style and Tools", www.mdpi.com/journal/education, 2018. G.ShiyamalaGowri, Ramasamy T., and Mahindra A. B., "Educational Data Mining Application for Estimating Students Performance in Weka Environment", ICSET, IOP Conference Series: Materials Science and Engineering, 2017. Sadiq H., Neama A. D., FadlMutaher B. N. ,"Educational Data Mining and Analysis of Students’ Academic Performance Using WEKA", Indonesian Journal of Electrical Engineering and Computer Science, Vol. 9, No. 2, 2018. Ali D., Naif R. A. and Rabeeh A. ,,Miltiadis D. L., Farhat A., Jalal S. A., " Predicting Student Performance using Advanced Learning Analytics", International World Wide Web Conference Committee (IW3C2), 2017. Pimpa Ch.," Predicting Student Academic Achievement by Using the Decision Tree and Neural Network Techniques", Institute Press, Vol. 12, No. 2 , ISSN: 2408-137X, 2015. R.Ruhin kouser, J.Daphney Joann, K.Suganya, "Forecasting Student Academic Performance by Decision Tree learning using Artificial Neural Networks", International Research Journal of Engineering and Technology (JET), ISSN: 2395 -0056, Vol.03, Issue: 12 , 2016. Ioannis E. Liveris, Konstantina D., Panagiotis P.,"Predicting students' performance using artificial neural networks", 2012. Richa Sh. A.l, Prof. MitulaHirenkumar P., "Survey of papers for data mining with neural networks to predict the student's academic achievements", International Journal of Computer Science Trends and Technology (IJCST), Vol.3 Issue 5, 2015. Shruthi P, Chaitra BP, "Student Performance Prediction in Education Sector Using Data Mining", International Journal of Advanced Research in Computer Science and Software Engineering, 2016. [14] A.Dinesh Kumar, Dr.V.Radhika, "A Survey on Predicting Student Performance", (IJCSIT) International Journal of Computer Science [15] JyotiBansode, " Mining Educational Data to Predict Student‘s Academic Performance", International Journal on Recent and Innovation Trends in Computing and Communication ISSN: 2321-8169, Vol.4, Issue 1, 2016. Amjad Abu Saa, " Educational Data Mining & Students’ Performance Prediction", International Journal of Advanced Computer Science and Applications (IJACSA), Vol. 7, No. 5, 2016. Bevinda A. P., Anusha P., Cassandra F., "A Comparative Analysis of Decision Tree Algorithms for Predicting Student’s Performance", Vol.7, No.4, 2017. and Information Technologies, 2014. [16] [17] Copyright to IJARSET www.ijarset.com 5749 ISSN: 2350-0328 International Journal of Advanced Research in Science, Engineering and Technology Vol. 5, Issue 5 , May 2018 [18] [19] [20] [21] [22] [23] [24] [25] [26] [27] [28] [29] [30] [31] [32] [33] [34] [35] [36] [37] [38] [39] [40] [41] Bendangnu k. and Dr. Prabu P, "Students' Performance Prediction Using Deep Neural Network ", International Journal of Applied Engineering Research, ISSN 0973-4562, Vol.13, No.2, pp. 1171-1176, 2018. Bhrigu K., Nakin A., Sathyaraj R," Comparative Study on Marks Prediction using Data Mining and Classification Algorithms", International Journal of Advanced Research in Computer Science, No. 0976-5697, 2017. Hafez M., Ashraf M., "School Students' Performance Predication Using Data Mining Classification", International Journal of Advanced Research in Computer and Communication Engineering, Vol. 6, Issue 8, 2017. Mahendra T., Randhir S., "An Empirical Study of Applications of Data Mining Techniques for Predicting Student Performance in Higher Education", International Journal of Computer Science and Mobile Computing, Vol. 2, Issue. 2, 2013. Pavel B., "Survey of Clustering Data Mining Techniques", Technical Report, Accrue Software Inc., 2002. Ashish D., Saeed A., Maizatul A. I., and Hamidreza M., "Clustering Algorithms Applied in Educational Data Mining", International Journal of Information and Electronics Engineering, Vol. 5, No. 2, 2015. C.Anuradha, T.Velmurugan, R. Anandavally, "Clustering Algorithms in Educational Data Mining: a Review", International Journal of Power Control and Computation(IJPCSC), Vol 7. No.1, Pp.47-52, 2015. Shiwani Rana, Roopali Garg, "Evaluation of Student’s Performance of an Institute Using Clustering Algorithms", International Journal of Applied Engineering Research, ISSN 0973-4562 Vol.11, No.5, 2016. Rakesh Kumar Arora, Dr. Dharmendra Badal, "Evaluating Student’s Performance Using k-Means Clustering", IJCST Vol. 4, Issue 2, 2013. Ritika S.,"Educational Data Mining: Performance Evaluation of Decision Tree and Clustering Techniques Using WEKA Platform", International Journal of Computer Science and Business Informatics, ISSN:1694, Vol. 15, No. 2, 2015. Oyelade, O. J, Oladipupo, O. O, Obagbuwa, I. C, "Application of k-Means Clustering algorithm for prediction of Students’ Academic Performance", (IJCSIS) International Journal of Computer Science and Information Security, Vol. 7, No. 1, 2010. Sreedevi K., Chandra S. P.," Analyzing the Student’s Academic Performance by using Clustering Methods in Data Mining", International Journal of Scientific & Engineering Research, ISSN 2229-5518, Vol.5, Issue 6, 2014. Alok S. Ch., Varun S. Ch., Narendra K. Sh.," Application of K-means Clustering Algorithm for Mining Educational Data", IJCST, Vol. 3, Issue 4, 2012 S. Padmapriya, Dr. L. Jayasimman, Dr. Nisha Jebaseeli, B. Senthil Kumar, " Mining Educational Data Using Data Mining Techniques and Algorithms – a Review", International Research Journal of Engineering and Technology (IRJET), Vol. 02 Issue: 09 , 2015. Abdullah A., Mohamed H., "On the application of data mining algorithms for predicting student performance: a case study",IJCSNS International Journal of Computer Science and Network Security, VOL.17 No.10, 2017. Snehal B., Kedar S., Purva N., Rubana Sh., Odelia D., Saylee D., "Predicting Student Performance Based on Clustering and Classification", Journal of Computer Engineering (IOSR-JCE), ISSN: 2278-0661, Vol.19, Issue .3, 2017. Prashant S. S., Govil M. C., "Prediction of Student’s Academic Performance using Clustering", IJANA, Special Conference Issue: National Conference on Cloud Computing & Big Data, 2017. Alaa Kh., "Selection of Best Decision Tree Algorithm for Prediction and Classification of Students’ Action", American International Journal of Research in Science, Technology, Engineering & Mathematics (AIJRSTEM), 2016 Paweł B., Eulalia S., Janusz K.,"An Approach to Intuitionistic Fuzzy Decision Trees", 16th World Congress of the International Fuzzy Systems Association (IFSA), 9th Conference of the European Society for Fuzzy Logic and Technology (EUSFLAT), 2015. Dimitris K., Christos P., "Analyzing student performance in distance learning with genetic algorithms and decision trees", International Journal of Applied Artificial Intelligence, Vol.20, Issue 8, 2006. Hashmia H., Simi I., Jubilant J. K., "Student Academic Performance Prediction Model Using Decision Tree and Fuzzy Genetic Algorithm", RAREST, 2016. Nidhi A., Jatinder K., R. Saini,"A Fuzzy Probabilistic Neural Network for Student’s Academic Performance Prediction", International Journal of Innovative Research in Science, Engineering and Technology, International Journal of Innovative Research in Science, Engineering, and Technology, Vol. 2, Issue 9, 2013. Altyeb A. and Omer B., "Prediction of Student's Academic Performance Based on Adaptive Neuro-Fuzzy Inference", IJCSNS International Journal of Computer Science and Network Security, Vol.17 No.1, 2017. Carol F. R., Yuliana F. Z., Daived M., Armando F, " Prediction the University Desertion through Hybridization of Classification Algorithm", CEUR-WS.org, 2014. Copyright to IJARSET www.ijarset.com 5750