Data Mining Implementation Process
Data Mining Implementation Process
Data Mining Implementation Process
Many different sectors are taking advantage of data mining to boost their business
efficiency, including manufacturing, chemical, marketing, aerospace, etc. Therefore, the
need for a conventional data mining process improved effectively. Data mining
techniques must be reliable, repeatable by company individuals with little or no
knowledge of the data mining context. As a result, a cross-industry standard process for
data mining (CRISP-DM) was first introduced in 1990, after going through many
workshops, and contribution for more than 300 organizations.
Data mining is described as a process of finding hidden precious data by evaluating the
huge quantity of information stored in data warehouses, using multiple data mining
techniques such as Artificial Intelligence (AI), Machine learning and statistics.
14.5M
266
Prime Ministers of India | List of Prime Minister of India (1947-2020)
1. Business understanding:
It focuses on understanding the project goals and requirements form a business point of
view, then converting this information into a data mining problem afterward a preliminary
plan designed to accomplish the target.
Tasks:
o It Understands the project targets and prerequisites from a business point of view.
o Thoroughly understand what the customer wants to achieve.
o Reveal significant factors, at the starting, it can impact the result of the project.
Access situation:
o It requires a more detailed analysis of facts about all the resources, constraints,
assumptions, and others that ought to be considered.
o A business goal states the target of the business terminology. For example,
increase catalog sales to the existing customer.
o A data mining goal describes the project objectives. For example, It assumes how
many objects a customer will buy, given their demographics details (Age, Salary,
and City) and the price of the item over the past three years.
o It states the targeted plan to accomplish the business and data mining plan.
o The project plan should define the expected set of steps to be performed during
the rest of the project, including the latest technique and better selection of tools.
2. Data Understanding:
Data understanding starts with an original data collection and proceeds with operations
to get familiar with the data, to data quality issues, to find better insight in data, or to
detect interesting subsets for concealed information hypothesis.
Tasks:
Describe data:
Explore data:
3. Data Preparation:
Tasks:
o Select data
o Clean data
o Construct data
o Integrate data
o Format data
Select data:
Clean data:
o It may involve the selection of clean subsets of data, inserting appropriate defaults
or more ambitious methods, such as estimating missing information by modeling.
Construct data:
Integrate data:
o Integrate data refers to the methods whereby data is combined from various
tables, or documents to create new documents or values.
Format data:
o Formatting data refer mainly to linguistic changes produced to information that
does not alter their significance but may require a modeling tool.
4. Modeling:
In modeling, various modeling methods are selected and applied, and their parameters
are measured to optimum values. Some methods gave particular requirements on the
form of data. Therefore, stepping back to the data preparation phase is necessary.
Tasks:
o It selects the real modeling method that is to be used. For example, decision tree,
neural network.
o If various methods are applied,then it performs this task individually for each
method.
o Generate a procedure or mechanism for testing the validity and quality of the
model before constructing a model. For example, in classification, error rates are
commonly used as quality measures for data mining models. Therefore, typically
separate the data set into train and test set, build the model on the train set and
assess its quality on the separate test set.
Build model:
o To create one or more models, we need to run the modeling tool on the prepared
data set.
Assess model:
o It interprets the models according to its domain expertise, the data mining success
criteria, and the required design.
o It assesses the success of the application of modeling and discovers methods more
technically.
o It Contacts business analytics and domain specialists later to discuss the outcomes
of data mining in the business context.
5. Evaluation:
o At the last of this phase, a decision on the use of the data mining results should be
reached.
o It evaluates the model efficiently, and review the steps executed to build the model
and to ensure that the business objectives are properly achieved.
o The main objective of the evaluation is to determine some significant business
issue that has not been regarded adequately.
o At the last of this phase, a decision on the use of the data mining outcomes should
be reached.
Tasks:
o Evaluate results
o Review process
o Determine next steps
Evaluate results:
o It assesses the degree to which the model meets the organization's business
objectives.
o It tests the model on test apps in the actual implementation when time and budget
limitations permit and also assesses other data mining results produced.
o It unveils additional difficulties, suggestions, or information for future instructions.
Review process:
o The review process does a more detailed evaluation of the data mining
engagement to determine when there is a significant factor or task that has been
somehow ignored.
o It reviews quality assurance problems.
6. Deployment:
Determine:
Tasks:
o Plan deployment
o Plan monitoring and maintenance
o Produce final report
o Review project
Plan deployment:
o To deploy the data mining outcomes into the business, takes the assessment
results and concludes a strategy for deployment.
o It refers to documentation of the process for later deployment.
o It is important when the data mining results become part of the day-to-day
business and its environment.
o It helps to avoid unnecessarily long periods of misuse of data mining results.
o It needs a detailed analysis of the monitoring process.
o A final report can be drawn up by the project leader and his team.
o It may only be a summary of the project and its experience.
o It may be a final and comprehensive presentation of data mining.
Review project:
o Review projects evaluate what went right and what went wrong, what was done
wrong, and what needs to be improved.