Loan Prediction System

Download as pdf or txt
Download as pdf or txt
You are on page 1of 5

Volume 8, Issue 10, October – 2023 International Journal of Innovative Science and Research Technology

ISSN No:-2456-2165

Loan Prediction System


1 2
Prem Doshi Paarth Soni
Information Technology Information Technology
Dwarkadas Jivanlal Sanghvi College of Engineering, K.J Somaiya College of Engineering,
Mumbai,India Mumbai,India

3 4
Param Shah Naman Limani
Information Technology Computer Engineering
Vasantdada Patil Pratishthan college of Engineering, Bharati Vidyapeeth College of Engineering,
Mumbai,India Mumbai,India

Abstract:- The modern financial world is marked by approval process, lowers the number of defaults, and meets
lending practices that are always getting better and a the needs of both consumers and banks as they grow. [1]
greater reliance on advanced technologies to speed up
the decision-making process. Financial companies put a People are having a harder time getting loans, but
lot of value on the Loan Prediction System because it banks are getting a lot more requests for loans. In the normal
helps them decide which applicants are creditworthy. process, qualifying is carefully checked and judged, which
This study gives an in-depth look at a sophisticated Loan is tough and takes a long time. It costs banks a lot of money
Prediction System that takes into account important when people don't pay back their loans, which makes things
factors, mainly the applicant's and co-applicant's worse. Utilizing a machine learning methodology is seen as
income. the most effective means to mitigate these issues. The
objective of this project is to enhance the efficiency of the
Keywords:- Machine Learning, Loan Prediction. loan approval process through the utilization of machine
learning tools incorporating categorization algorithms. This
I. INTRODUCTION will enhance the precision of decision-making processes and
reduce dependence on human labor. Ultimately, this
In business, it's very important to check someone's approach expedites the loan approval procedure, diminishes
creditworthiness before lending them money. Machine the incidence of defaults, and effectively addresses the
learning methods such as Random Forest and Gradient escalating demands of borrowers and financial institutions
Boosting Regression are used along with Python to make alike. In this study, we aim to investigate the impact of
this happen. This way of making decisions about loan social media usage on mental health.. These algorithms are
acceptance more accurate and automatic by looking at data subjected to rigorous comparison and evaluation using
from past loans. This data includes details about the established metrics. Notably, Logistic Regression emerges
applicant's credit background, income, and other important as the top-performing model, achieving an accuracy rate of
things. Because they use machine learning, these algorithms 92% and outshining its counterparts with an impressive F1-
learn to find complex patterns and links in the information. Score of 96%.[2]
This helps them guess how likely it is that a person who
wants a loan will not pay it back. By using prediction This initiative addresses the crucial challenge of
analysis, banks can make better loan decisions, lower risks, identifying safe loan beneficiaries in response to an increase
and speed up the loan approval process. This is good for in loan applications and limited banking assets. The goal is
traders and people who want to borrow money. These to optimize resource allocation and reduce risk by analyzing
technologies are very important in the loan business because huge data derived from prior loan records. The study hopes
they make things run more easily and save money. to accurately predict loan safety by training a machine
learning model on these prior events. The study is divided
II. LITERATURE REVIEW into four sections: data collection, machine learning model
comparison, training on the most promising model, and
A lot of people are asking for loans, even though a lot testing. To improve the accuracy of loan predictions, various
of them need more money. An important part of the process machine learning methods such as classification, logistic
is carefully checking skills, which makes it hard to do and regression, decision tree, and gradient boosting are used.[3]
takes a long time. Banks lose a lot of money when people
don't pay back their loans. This makes things worse. Making The process of loan approval holds significant
use of machine learning is the best way to fix these importance for banking institutions. The loan applications
problems. The study's goal is to find faster ways to get loans were either approved or refused by the system. The process
by using machine learning tools and sorting methods. This of loan recovery plays a substantial role in the financial
will help people make better decisions with less work on statements of a bank. Predicting the likelihood of debt
their part. On top of that, this method speeds up the loan repayment by customers poses a challenging task. In recent

IJISRT23OCT241 www.ijisrt.com 129


Volume 8, Issue 10, October – 2023 International Journal of Innovative Science and Research Technology
ISSN No:-2456-2165
years, there has been a significant amount of research Ultimately, the objective of this project is to employ a
conducted on the development and improvement of loan methodology that relies on data analysis and automation to
acceptance prediction systems. Machine Learning (ML) facilitate the process of loan approvals, thereby making a
methodologies are quite advantageous when it comes to valuable contribution to the ongoing expansion of financial
forecasting results for extensive quantities of data within the institutions. The objective of this project is to reduce the
System. This study employs two machine learning frequency of default and financial losses, improve overall
methodologies, namely Support Vector Machine (SVM) and customer happiness and efficiency in the loan business by
Random Forest (RF), to forecast the approval of customer achieving greater accuracy rates and improving the forecast
loans.The user did not provide any text to rewrite. process.

The process of loan acceptance holds significant IV. METHODOLOGY


importance within the banking sector, and accurately
predicting loan approval is a considerable challenge. The  The establishment of an efficient Loan Prediction System
system responsible for the approval or rejection of loan necessitates the implementation of a complete approach
applications is of utmost importance. The process of loan that spans many stages, including data collection,
recovery significantly affects the financial accounts of a preprocessing, feature engineering, model selection,
bank; nonetheless, accurately predicting consumer training, and evaluation. The subsequent systematic
repayment behavior continues to pose difficulties. Current procedure delineates the fundamental constituents of this
academic research has focused on the development of methodology:
algorithms for predicting loan acceptance. The utilization of
Machine Learning (ML) methodologies is gaining  Data Acquisition: The fundamental component of every
significance in effectively managing substantial amounts of predictive modeling endeavor is a reliable dataset. The
data. The present study employs the Support Vector Loan Prediction System gathers historical data pertaining
Machine (SVM) and Random Forest (RF) algorithms to to loan applications, approvals, rejections, and client
predict customer loan acceptance, acknowledging their information. The dataset should exhibit diversity by
efficacy in handling and evaluating complex data within the incorporating a range of demographic and financial
banking domain. variables, such as applicant income, credit history, loan
amount, employment particulars, and other pertinent
III. OBJECTIVE OF RESEARCH characteristics.
 Data Preprocessing: Raw data frequently exhibits
The major objective of this project is to completely disorderliness and may encompass instances of missing
change how banks approve loans. The project's goal is to values or outliers. Data preprocessing is a crucial step in
make tools that can guess how much loans will be. The main the data analysis process, since it involves the systematic
goal is to use Machine Learning (ML) algorithms to make cleaning and organization of the dataset to enhance its
lending decisions more accurate, effective, and efficient. quality and dependability.
This is because there are a lot of people who need loans and  The selection of a suitable machine learning model is a
it's hard to tell who is qualified. Looking at past loan critical step in ensuring the accuracy of predictions.
approval data is the main goal of this study. The researchers Logistic Regression, Decision Trees, Random Forest,
want to find out how age, type of income, loan annuity, Support Vector Machines, and Gradient Boosting are
credit history, job company, and length of time on the job among the frequently employed models for loan
affected loan approvals. We want to learn new things and prediction. The selection of a particular option is
find important trends in the data. By looking at a number of dependent on several factors, including the attributes of
factors, the study's goal is to make a model that can the data, the interpretability of the model, and the trade-
accurately tell which loan applicants will be good ones. The off between bias and variance..
main goal is to speed up and improve the process of  The selected model is subjected to training using the
reviewing applications, which is known for taking a long preprocessed dataset. Throughout the training procedure,
time and a lot of time and money. the model assimilates information regarding the patterns
and interrelationships inherent in the dataset, hence
.Furthermore, the main aim of this study is to improve facilitating its ability to create predictions. To assess the
the accuracy of credit approval decisions, ultimately aiming efficacy of the model and address the concern of
to reduce the financial risks faced by banks as a result of overfitting, it is typical to partition the dataset into
loan defaults.The objective of this study is to identify the distinct training and validation sets.
key factors that significantly impact the outcomes of loan  The optimization of a model's performance is contingent
acceptance through the application of several machine upon the careful adjustment of its hyperparameters. To
learning algorithms, such as Random Forest, XGBoost, enhance the precision of a model, several techniques
Adaboost, Lightgbm, Decision Tree, and K-Nearest such as grid search and random search are employed to
Neighbor. This comprehensive analysis of machine learning ascertain the optimal configuration of hyperparameters.
methodologies will facilitate the selection of the most  The evaluation of the model involves the utilization of a
appropriate model for the given problem. separate test dataset to examine its efficacy in processing
unfamiliar data, which it has not been exposed to during
training. The evaluation criteria commonly employed for

IJISRT23OCT241 www.ijisrt.com 130


Volume 8, Issue 10, October – 2023 International Journal of Innovative Science and Research Technology
ISSN No:-2456-2165
loan prediction include accuracy, precision, recall, and ensuring the effectiveness and alignment of the Loan
the F1 score. The application of a confusion matrix Prediction System with developing trends and consumer
enables the assessment of the model's efficacy in behaviors.
properly predicting loan approvals and rejections.
 Deployment refers to the process of implementing a In brief, the development of a Loan Prediction System
model in a real-world setting once it has demonstrated necessitates a meticulously designed approach that
satisfactory performance. The deployment phase encompasses the meticulous assessment of data quality, the
involves the integration of the model into the existing judicious selection of models, the comprehensive training
loan approval system of the bank, allowing it to make and evaluation of these models, and the continual
forecasts for new loan applications. maintenance of the system. The use of this comprehensive
 Monitoring and maintenance play crucial roles in methodology guarantees the system's dependability and
guaranteeing the continuous accuracy of the model's efficiency in facilitating well-informed decision-making
performance.. In the event of a modification in the data within the banking industry.
distribution or a decline in the model's performance, it
may be imperative to engage in retraining procedures.  Flow Chart
Regular updates and maintenance are essential for

Fig 1 System Architecture

 Figure Labels: individual's financial earnings, occupational situation, and


The provided flowchart illustrates the application of creditworthiness. Subsequently, the implementation
machine learning techniques in the process of assessing loan incorporates two machine learning algorithms, namely the
eligibility for users. The initial stage involves entering the Random Forest Classifier and the Gradient Boosting
user's information. This may encompass factors such as an Regressor. The algorithms utilized in this study have been

IJISRT23OCT241 www.ijisrt.com 131


Volume 8, Issue 10, October – 2023 International Journal of Innovative Science and Research Technology
ISSN No:-2456-2165
trained using a dataset consisting of loan applications. The V. RESULTS
target variable in this dataset is the outcome of each
application, specifically whether the loan was authorized or The Loan Prediction System's outputs, which used
rejected. The Random Forest Classifier method is employed Random Forest and Gradient Boosting Regressor machine
to ascertain the eligibility of a user for a loan. If the user learning algorithms, provided considerable insights into
meets the eligibility criteria, the Gradient Boosting applicants' eligibility status and proposed loan amounts.
Regressor algorithm is employed to make predictions Random Forest performed admirably, accurately
regarding the loan amount for which the user would be categorizing applications as eligible or ineligible with a high
eligible. In the event that the user does not meet the precision rate. Gradient Boosting Regressor, on the other
necessary criteria to qualify for a loan, a notification is hand, excelled at predicting specific loan amounts,
generated and presented to the user, conveying this providing a detailed understanding of the best financial
information. If the user meets the criteria for loan eligibility, support for qualified candidates. Random Forest's ensemble
a message is generated to tell the user about their eligibility nature and Gradient Boosting Regressor's boosting
status and provide an estimate of the loan amount they are technique proved effective in dealing with complex
likely to get. The presented flowchart illustrates the relationships within the data, contributing to the system's
application of machine learning techniques in automating overall accuracy and reliability in determining eligibility
the loan eligibility screening process. status and recommending appropriate loan amounts for
diverse applicants.

Fig 2 Prediction Input form

Fig 3 Example input for Prediction

IJISRT23OCT241 www.ijisrt.com 132


Volume 8, Issue 10, October – 2023 International Journal of Innovative Science and Research Technology
ISSN No:-2456-2165

Fig 4 Result

VI. CONCLUSION cases, improved data sources, scalability, user experience,


and compliance considerations. These developments hold
Finally, the creation and deployment of a sophisticated the potential to revolutionize the financial industry by
Loan Prediction System represents a big step toward facilitating smarter and more efficient lending decisions
improving financial institutions' efficiency and risk while minimizing risks and optimizing customer
management capabilities. By properly forecasting loan satisfaction.
approval outcomes, the system aids to the prudent use of
scarce funds by leveraging advanced machine learning REFERENCES
models and a careful approach. The technology delivers
valuable insights on the creditworthiness of applicants by [1]. S. Magar, N. S. Nikam, N. Taksale, and S. Hajare,
assessing historical data, preprocessing information, and "Loan Eligibility Prediction using Machine Learning
fine-tuning models. The algorithms employed, whether Algorithms," in journal of journal of emerging
Support Vector Machine, Random Forest, or others, provide technologies and innovative research, SKNCOE,
versatility and accuracy when dealing with varied datasets. SPPU, Pune, India, 2020.
As the financial landscape changes, the system's ongoing [2]. A. Shinde, Y. Patil, I. Kotian, A. Shinde, and R.
monitoring and modification ensures its relevance, allowing Gulwani, "Loan Prediction System Using Machine
banks to make educated decisions, reduce risks, and speed Learning," Department of Information Technology,
loan approval processes. RAIT, Nerul, India, and D.Y. Patil Deemed to be
University, Ramrao Adik Institute of Technology,
FUTURE SCOPE Nerul, Navi Mumbai, India, 2022
[3]. M. A. Mamun, A. Farjana, and M. Mamun, "Predicting
The project on loan prediction systems exhibits Bank Loan Eligibility Using Machine Learning Models
significant potential for future expansion and refinement, and Comparison Analysis," in Proceedings of the IEEE
offering a promising avenue for further research and International Conference on Data Science and Machine
development..Firstly, the incorporation of advanced Learning, Department of Economics & Decision
machine learning algorithms and artificial intelligence Sciences, The University of South Dakota, Vermillion,
techniques can elevate the accuracy and robustness of loan South Dakota, USA, 2022
predictions. The integration of deep learning models, neural [4]. S. Supriya, P. Myneedi, S. Nagarapu, V. K. Namburi,
networks, and natural language processing (NLP) algorithms and K. Vikas, "Loan Prediction by using Machine
can provide a more nuanced analysis of applicant data, Learning Models," B-Tech, Dept of CSE, VVIT,
including unstructured data sources like text, enabling Guntur, AP, 2023.
financial institutions to make even more precise lending [5]. D. I. Gouda, A. K. A, A. M. Madivala, D. K. R., and
decisions. Dr. R. Kumar, "Loan Approval Prediction Based on
Machine Learning," in journal International Research
Secondly, the project can be extended to encompass a Journal of Engineering and Technology (IRJET), 2021
broader range of financial products and services beyond [6]. A. S. Kadam, S. R. Nikam, A. A. Aher, G. V. Shelke,
traditional loans, such as credit cards, mortgages, and and A. S. Chandgude,"Prediction for Loan Approval
microloans. This expansion would require adapting the using Machine Learning Algorithm," in journal
existing prediction model to accommodate different risk International Research Journal of Engineering and
assessment criteria and customer profiles, thereby Technology (IRJET), 2021
diversifying the applicability of the system. n conclusion, [7]. Y. Diwate, P. Rana, and P. Chavan, "Loan Approval
the future scope of the loan prediction system project is vast, Prediction Using Machine Learning," ISB&M College
encompassing advancements in technology, expanded use of Engineering, Pune, India,2021

IJISRT23OCT241 www.ijisrt.com 133

You might also like