Diabetes Prediction Model
Diabetes Prediction Model
Diabetes Prediction Model
https://doi.org/10.22214/ijraset.2022.45503
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 10 Issue VII July 2022- Available at www.ijraset.com
Abstract: Everyone is currently quite aware how dangerous and adverse issues Diabetes causes on a human body. In today’s
world filled with all sorts of impurities and all other adulterations, even slightest carelessness in maintaining lifestyle can cause
serious diseases, disorders and consequences on health. Although with advancement of medical science, we do have treatment
cures of Diabetes, but still lacks speed in detection of presence of Diabetes in a human body.
Here, in this study proposed a system that can predict whether a person has diabetes or not with the help of Machine Learning.
This project uses Logistic Regression Machine model for the prediction of presence of Diabetes in a person.
Keywords: Diabetes, human body, adulterations, lifestyle, diseases, Machine learning, Logistic Regression, Prediction.
I. INTRODUCTION
Diabetes is a chronic health disorder which affects the body’s natural process of converting food into energy. Our body produces
natural hormone called Insulin that moves sugar from the blood to the cells for storage or for later use of energy. What Diabetes
does is, either not allow enough Insulin production or restrict the effective use of Insulin produced.
Due to all the speeding environment, number of people affected by Diabetes is rising up rapidly. And most among the diabetics,
know not much about the risk factors they face prior to detection.
In the past 30 years, of overall developments, we can also evidently see a rise in number of diabetics. People have now slowly
begun to realize how deeply Diabetes impacts one’s health and his everyday life. When observed, there is a constant inclining
trend in the proportion of diabetics in the general population, and the specific growth rate in males is evidently higher than that in
females as shown in Fig.1. Globally, China has the largest diabetic population in the world followed by United States and India.
To effectively lower the morbidity and reduce the impact of Diabetes, we need to focus on the high-risk age group of people.
According to WHO standards, these are the common categories of groups under high risk of Diabetes [4]:
Age ≥ 45 and infrequent exercising
BMI ≥ 24 kg/m2
Family history of DM
Hypertension or cardiovascular and cerebrovascular disease
Gestation female whose age ≥30.
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 4158
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 10 Issue VII July 2022- Available at www.ijraset.com
As we are steadily learning about these diseases and disorders, we get to know that Diabetes is an incurable disease. But with the
help of modern science, Diabetes is still manageable and can be well controlled with regular treatment. However, modern science
would show its greatest miracles when detection of diabetes in a person is done at an early stage.
For avoiding and reducing the critical impact, there exists an urgent need to create a system that will detect the presence of
diabetes disease with optimal cost and better performance.
Our proposed system has an objective to fulfil this urgency and has used Logistic Regression, a machine learning method to
develop the model required to predict the presence of diabetes.
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 4159
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 10 Issue VII July 2022- Available at www.ijraset.com
A. Data Pre-processing
The zero or null values in the features of dataset need to be located. The predictor model cannot have zero values in any other
feature than the Pregnancies feature in the dataset.
Such zero values are replaced by the mean values of the feature column. This step is a major requirement for the growth in the
accuracy as the incorrect values increase the chance of faulty prediction.
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 4160
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 10 Issue VII July 2022- Available at www.ijraset.com
V. RESULTS
On processing through the dataset, we inferred that 34.9% of the patients are Diabetic and the rest of the 65.1% of them do not
have diabetes. We also have the feature wise comparison results.
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 4161
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 10 Issue VII July 2022- Available at www.ijraset.com
VI. CONCLUSION
Logistic Regression has evidently shown to be one of the most effective algorithms to build the predictive model for diabetes.
Study also conveys that choice of algorithm is not only what is required for higher accuracy, but also other factors too. Here other
factors include, Data pre-processing, removal and replacement of null values, training and testing the data and many more.
Detecting diseases at earlier stages can help to be treated more easily and effectively [3]. This proposed system serves the exact
need of the time in many developing regions. System has successfully solved one of the crucial problems by giving efficient,
quicker and higher accuracy of the prediction model than other algorithms and made the model ready to process and adapt new
datasets. System is now a platform for intelligence and knowledge prediction in real time handling of larger volume of data [3].
The goal of this research deals with the study of diabetic treatment which may give in healthcare industry by analyzing the data.
This system can mainly focus on the patients in the rural areas [3]. Patients there can be treated at a low cost as the prediction will
be done in less time compared to the current system.
This system can further be developed to find how likely are the non-diabetic patients to become diabetic in coming years.
REFERENCES
[1] Gauri D. Kalyankar, Shivananda R. Poojara and Nagaraj V. Dharwadkar,” Predictive Analysis of Diabetic Patient Data Using Machine Learning and Hadoop”,
International Conference On I-SMAC, 978-1-5090-3243-3, 2017.
[2] B. Nithya and Dr. V. Ilango,” Predictive Analytics in Health Care Using Machine Learning Tools and Techniques”, International Conference on Intelligent
Computing and Control Systems, 978-1-5386-2745-7, 2017.
[3] Dr Saravana kumar N M, Eswari T, Sampath P and Lavanya S,” Predictive Methodology for Diabetic Data Analysis in Big Data”, 2nd International Symposium on
Big Data and Cloud Computing, 2015.
[4] Han Wu, Shengqi Yang, Zhangqin Huang, Jian He, Xiaoyi Wang,Type 2 diabetes mellitus prediction model based on data mining, Informatics in Medicine
Unlocked, Volume 10, 2018, Pages 100-107, ISSN 2352-9148
[5] Changsheng Zhu, Christian Uwa Idemudia, Wenfang Feng,Improved logistic regression model for diabetes prediction by integrating PCA and K-means techniques,
Informatics in Medicine Unlocked, Volume 17, 2019, 100179,ISSN 2352-9148
[6] Temurtas, H., Yumusak, N., Temurtas, F., "A comparative study on diabetes disease diagnosis using neural networks", Expert Syst, Vol. 36, pp. 8610–15, 2009.
[7] Chavey, A., Kioon, M., Bailbé, D., "Programming Of Beta-Cell Disorders And Intergenerational Risk Of Type 2 Diabetes Diabetes", Maternal Diabetes, Vol.40,
No.5, pp. 323-30, 2014.
[8] Analysis of Various Data Mining Techniques to Predict Diabetes Mellitus, Omar Kassem Khalil Aissa Boudjella, 2016 Sixth International Conference on
Developments in eSystems Engineering.
[9] Ayush Anand and Divya Shakti,” Prediction of Diabetes Based on Personal Lifestyle Indicators”, 1st International Conference on Next Generation Computing
Technologies, 978-1-4673-6809-4, September 2015
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 4162