An Optimal Genetic Algorithm With Support Vector Machine For Cloud Based Customer Churn Prediction
An Optimal Genetic Algorithm With Support Vector Machine For Cloud Based Customer Churn Prediction
An Optimal Genetic Algorithm With Support Vector Machine For Cloud Based Customer Churn Prediction
net/publication/344897761
An Optimal Genetic Algorithm with Support Vector Machine for Cloud Based
Customer Churn Prediction
CITATIONS READS
0 58
2 authors, including:
M. Jeyakarthic
Annamalai University
65 PUBLICATIONS 124 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by M. Jeyakarthic on 27 October 2020.
Abstract— At present times, Customer Churn Prediction providers and yields a gradual profit. Usually, CRM focus on
(CCP) becomes a familiar research area and is mainly solved devoted users to make important decisions about the churn. To
by the use of machine learning models. The advancements in handle the user churn in an effective manner, it is very
Internet of Things (IoT) and cloud computing platform allows significant to create productive as well as précised user churn
collecting data about the customers to carry out CCP. This predictive method. Subsequently, statistical and DM methods
paper introduces an optimal genetic algorithm (OGA) with are applied to build these churn predictive applications.
support vector machine (SVM) model for CCP. Initially, the
OGA is derived by the use of double chain-quantum-genetic- [2] proposed a technology for CCP to find the customers
algorithm. Next, the derived OGA is applied for optimizing with churn behavior. This method has been developed on the
the parameters of SVM namely C and γ. The outcome of the basis of elegant k-means for clustering; and used JRip to
OGA-SVM model is tested against a benchmark dataset from extract rules available. But, it has a disadvantage where it
telecom industry. The experimental outcome indicated that the cannot be applied at banking sector to detect the user churns
OGA-SVM shows excellent results by offering better and offers several other procedures for client retention. [3]
sensitivity of 94.50, specificity of 66.06, accuracy of 90.27, F- presented a user classifying method according to rough set
Score of 94.30 and kappa value of 61.17 correspondingly. theory for classifying user churn. It has been claimed that
rough set classifier is capable to perform more effectively than
Keywords— Customer Churn Prediction, SVM, Genetic linear regression, J48, and Neural Network (NN). Besides, [4]
algorithm defined the dimension of general discussions of clients in a
telecommunication system. This model helps to detect the
I. INTRODUCTION user's churn behavior by observing the incoming as well as
In general, data mining (DM) is defined as the extraction outgoing calls. Based on this process, it can be easily detected
of private data from massive databases. A DM model enables that a user has relationship with alternate churn where the
the conversion of original data into business information. It is customer is previously churned. Also, it is very complex to
associated with the process of selection, identification as well assume which detective model can be fixed as a reputed
as labeling more number of data to expose hidden data technique to report CCP in an effective manner. It does not
patterns for commercial purpose. A fast development of change the study of ML approach. Though the previous
market in all domains leads to greater production of service operation [5] reveals that SVM is an optimal classifier since it
providers. Many number of participants, novel as well as is capable of handling nonlinearities but [6] addressed that
creative business techniques and good services has been churn prediction in telecommunication industry.
increasing the user acquisition in a developing market, and
several organizations has the knowledge to work hard to In addition, [7] defined that the arbitrary sampling
retains previous clients and to satisfy user needs [1]. As an framework is an optimized technique established in prior to
inclusion, cloud computing (CC) technique has been classification method. This model helps to reduce the irregular
employed which provides the dimension of primary users. data distribution which happens due to the absence of decided
Thus, the service provider has to work hard for predicting the class data from user churn. Also, it has been addressed [8] that
churn. class irregularity cannot enhance the function of prediction
technique. Apart from this, [9] applied weighted random
Nowadays, Customer Relationship Management (CRM) forests (RF) and provided the derived outcome that showcased
methods replace the conventional marketing principles by a little enhancement in CCP method. However, it has been
applying defined marketing procedures. Such kinds of underestimated for its difficulty in learning and interpretation
personalized marketing services contributes in finding a sub- [10]. On the other hand, there are no alternate studies to report
set of old clients mostly stop using good or services provided the factors involved in modeling CCP method for TCI.
by churn or a firm. Since the previous user churn often
provides the result in loss of profit and customers, a minor Here, CCP is said to be a binary classifying issue such that
alteration in the retention value tends in minimum success for users are classified as 2 feasible behaviors namely, Churn, and
Non-Churn. Furthermore, a churn behavior is divided as: functions available namely, Linear kernel, Polynomial kernel
Voluntary customer churn, where the user agree to relive from as well as RBF kernel. In this study, it prefers RBF kernel as
a company, and involuntary customer churn, where a firm major kernel function of SVM, as demonstrated in Eq. (2).
decides to stop a contract signed with a user [11]. Therefore,
the study reveals that a voluntary user churns is very complex ( ) ( ) ( )
to predict this kind of customer churn whereas it is simple to
extract the involuntary customer churn using simple queries.
Besides, this work states that previous models advertise that
there is no other optimal technique to control CCP issue. This
is due to the irregular outcome provided by various datasets.
Hence, there is a major requirement of determining the quality
of a classification model with respect to certainty or
uncertainty, while predicting the customer churn.
( ) ∑ ∑
amplitude to define possible solution as well as quantum convergence. The sign as well as size of a rotation angle,
rotating gate to update chromosomes. depends upon a gradient data for objective function which can
be estimated with the help of following technique.
When the solution of dimensional space optimization has
been considered as vectors, is the variable of optimization Let | |, where and are assumed to be
problem which has to be optimized, and the continuous
optimization issue might be defined as given in the following: respective probability amplitudes of global optimal solution,
and are corresponding probability amplitudes of recent
( ) ( ) solution, If , then the rotation direction is ( ).
Else, positive as well as negative directions should be
( ) approved.
To improve the quality of proposed solutions, it The rotation angle size may be computed on the basis of
defines the fitness function (FF) as follows: gradient step size searching theory for a frequent optimizing
issue. Hence, size of a rotation angle can be estimated as given
( ) ( ) ( ) in the following:
( )
where denotes more amount of FF is the optimization ( )
( ) ( )
task.
By assuming the random behaviour of encoding as where implies a initial rotation angle. As provided
well as conditions of quantum state probability amplitudes, the and are described as:
system of double chains coding has been provided in the
following: ( ) ( )
,| | | |-
( ) ( ) ( )
* + ( )
( ) ( ) ( ) ( ) ( )
,| | | |- ( )
Where ,
represents the population size, implies the In case of discrete optimization problem, size of a rotation
count of quantum bits. Hence, a first line solution from angle might be computed according to the differential of first
chromosome is meant to be cosine solution, and alternate one order among 2 generations as defined by Eq. (11).
is named as sine solution.
( ) ( ) ( )
Obviously, every chromosome consists of group of
qubits. Later, it maps the provided solution which has been
{| ( )| | ( )|} ( )
shown by every chromosomes present at unit space
the solution space of frequent optimizing issues.
Defined by a recent best individual, the updating process of { ( ) ( )}
qubit chromosome has been conducted with the application of
quantum rotation gate that is depicted in Eq. (7): Where and 155 are parent and offspring
chromosomes, correspondingly.
( ) ( )
( ) [ ] ( )
( ) ( ) C. Improved SVM Classifier based on OGA
The OGA-based optimization task of penalty variable C as
Upgrading function could be described as given in the
well as a kernel attribute of SVM classification along with
following:
RBF could be depicted in Fig. 2, which is named as
( ) ( ) ( ) ( ) OGASVM.
[ ][ ] [ ] ( )
( ) ( ) ( ) ( )
The OGA-SVM optimization function is defined as
provided in the following:
where represents a rotation angle, ( ) denotes a quantum
rotation gate.
1. Specimen collection. Gather the samples of various
classes, and classify the instances as training as well
is said to be more important for convergence
as testing group.
superiority, where angle sign agrees the converging
manipulation at the time of angle size deciding the speed of
Description Dataset
Number of Features 21
Number of Class 2
TABLE II
PERFORMANCE EVALUATION OF DIFFERENT TRADITIONAL METHODS WITH PROPOSED
Table 2 and Fig. 4 show the results offered by the OGA- as given in Table 3 and Fig. 5. Upon calculating the final
SVM with the classical models. It is shown that the Vote outcome in terms of accuracy, it shows that the LDT/UDT-1
model offers least results by attaining a specificity of 85.51%, as well as LDT/UDT-2 methods accomplish a minimum and
accuracy of 85.51% and F-score of 92.18%. Besides, the NB similar accuracy of 84. Followed by, a gradual better
model has offered a slightly better outcome by offering a classification result is provided by LDT/UDT-10 that shows
sensitivity of 92.21%, specificity of 57.98%, accuracy of the accuracy of 84.30.
87.64%, F-score of 92.81% and kappa value of 48.44%
respectively. Along with that, the SVM model has offered near TABLE III
optimal results and provided a higher sensitivity of 93.46%, COMPARISON WITH RECENT METHODS FOR APPLIED DATASET
specificity of 60.13%, accuracy of 88.30%, F-score of 93.19% IN TERMS OF ACCURACY AND F-SCORE
and kappa value of 61.40% respectively. However, the
proposed model shows superior results by offering a
maximum sensitivity of 94.50%, specificity of 66.06%, Methods Accuracy F-Measure
accuracy of 90.27%, F-score of 94.30% and kappa value of
61.17% respectively. OGA-SVM 90.27 94.30