CHAPTER 4 Diabetes
CHAPTER 4 Diabetes
CHAPTER 4 Diabetes
4.1 Overview
Module 2: Pre-processing( feature selection) and split train and test dataset
Pre-processing refers to the transformations applied to our data before
feeding it to the algorithm.
Data Preprocessing is a technique that is used to convert the raw data into
a clean data set. In other words, whenever the data is gathered from different
sources it is collected in raw format which is not feasible for the analysis.
Module 4: Prediction
Prediction
“Prediction” refers to the output of an algorithm after it has
been trained on a historical dataset and applied to new data when forecasting the
likelihood of a particular outcome, such as whether data has disease or not.
TP (True Positive): The no. of people who actually suffer from ‘diabetes’ among
those who were diagnosed ‘diabetic’.
TN (True Negative): States the number of people who are ‘healthy’ among those who
were diagnosed ‘diabetic’.
FP (False Positive): Depicts the number of persons who are unhealthy that is,
‘diabetic’ but was diagnosed as ‘healthy’.
FN (False Negative): The number of people found to be ‘healthy’ among those who
were diagnosed as ‘diabetic’.
The performance of classification can be measure in the fallowing criteria.
Sensitivity must have high percentage.
Specificity must have low percentage.
Accuracy must have high percentage.
Module 7.
After finding the best algorithm, we will give the input to prediction model and
would get corresponding output.
The fallowing Detail Architecture of work flow diagram can be described entire
project process.
Architecture diagram Upload Dataset
Data Pre-Processing
Train Dataset
Test Dataset
Conclusion