Parkinson'S Brain Disease Prediction Using Descriptive Analysis
Parkinson'S Brain Disease Prediction Using Descriptive Analysis
Parkinson'S Brain Disease Prediction Using Descriptive Analysis
DESCRIPTIVE ANALYSIS
A PROJECT REPORT
Submitted by
AISHWARYA S (211614104005)
ANUSHA S (211614104019)
BACHELOR OF ENGINEERING
IN
COMPUTER SCIENCE AND ENGINEERING
SIGNATURE SIGNATURE
Dr. P. KUMAR Mr.B.BHUVANESWARAN
Professor and Head Supervisor
Assistant Professor (SG)
Department of Computer Science and Department of Computer Science and
Engineering Engineering
Rajalakshmi Engineering College Rajalakshmi Engineering College
Chennai – 602 105 Chennai – 602 105
Big data is a term that describes the large volume of data, both structured and
unstructured, that flows into a business on a regular basis. In health care domain,
the medical data of numerous patients collected, forms Big Data. This Big Data can
be analyzed to build better health profiles and better predictive models around
individual patients so that efficient diagnosis and treatment can be provided.
Previously the analysis was difficult due to the manual collection of patient records
but due to digitization of records this process is made simple.
Existing technologies, do not take into consideration large dataset for
prediction of Parkinson’s Disease (PD). This leads to shortcomings in terms of
accuracy and execution time. The idea of the proposed system is to generate Big
Data on PD by taking into parameters that fall within specified ranges which is
used as input. The data is preprocessed to eliminate records with missing values
and outliers. This preprocessed data is given as an input to the classifier consisting
of DecisionTree (DT), Artificial Neural Network (ANN) and Support Vector
Machine (SVM) which compares the accuracy and execution time.
iii
TABLE OF CONTENTS
ABSTRACT ................................................................................................................................... iii
LIST OF TABLES ...................................................................................................................... vi
REFERENCES ……………………………………………………………...52
APPENDIX 1…………………………………………………………………………………….53
APPENDIX 2…………………………………………………………………………………….59
LIST OF TABLES
5.1 Values in each of the attribute that decides the range for populating
the dataset. ............................. 21
5.2 Conditions of combinations of the attributes that help in populating
the dataset. ............................. 22
6.1 Comparison of accuracy and execution time of ANN algorithm. . 47
6.2 Comparison of accuracy and execution time of DT algorithm. . . . 48
6.3 Comparison of accuracy and execution time of SVM algorithm. . 50
4
LIST OF FIGURES
3.1 Competent Predictor of Parkinson’s Disease(CPPD) . . . . . . . . 15
5.1 The Generated Dataset ....................... 24
5.2 Dataset with Outliers . . . . . . . . . . . . . . . . . . . . . . . . 25
5.3 Dataset without Outliers ...................... 26
5.4 Execution Time and Accuracy of Decision Tree . . . . . . . . . . 36
5.5 Execution Time and Accuracy of ANN . . . . . . . . . . . . . . . 40
5.6 Execution Time and Accuracy of SVM . . . . . . . . . . . . . . . 44
6.1 Accuracy and execution time before Preprocessing for
ANN Algorithm ………………………………………………. 46
6.2 Accuracy and execution time after Preprocessing for ANN 46
Algorithm…………………………………………………….
6.3 Accuracy and execution time before Preprocessing for DT 47
Algorithm………………………………………………………..
6.4 Accuracy and execution time after Preprocessing for DT 48
algorithm ……………………………………………………….
6.5 Accuracy and execution time before Preprocessing for SVM 49
Algorithm……………………………………………………….
6.6 Accuracy and execution time after Preprocessing for SVM 49
Algorithm………………………………………………………