Data Mining
Data Mining
Data Mining
General information
Data mining is one of the most useful techniques that help entrepreneurs, researchers, and
individuals to extract valuable information from huge sets of data. Data mining is also
called Knowledge Discovery in Database (KDD).
Data cleaning Data integration Data selection Data transformation Data mining
In general terms, “Mining” is the process of extraction of some valuable material from the earth e.g. coal
mining, diamond mining, etc. In the context of computer science, “Data Mining” can be referred to
as knowledge mining from data, knowledge extraction, data/pattern analysis, data archaeology, and data
dredging. It is basically the process carried out for the extraction of useful information from a bulk of data or
data warehouses. In the case of coal or diamond mining, the result of the extraction process is coal or diamond.
But in the case of Data Mining, the result of the extraction process is not data!! Instead, data mining results are
the patterns and knowledge that we gain at the end of the extraction process. In that sense, we can think of Data
Mining as a step in the process of Knowledge Discovery or Knowledge Extraction.
Example
banks typically use ‘data mining’ to find out their prospective customers who could be interested
in credit cards, personal loans, or insurance as well. Since banks have the transaction details and
detailed profiles of their customers, they analyze all this data and try to find out patterns that help
them predict that certain customers could be interested in personal loans, etc.
Data Mining is a process used by organizations to extract specific data from huge databases to
solve business problems. It primarily turns raw data into useful information.
Advantages of Data Mining
•The Data Mining technique enables organizations to obtain knowledge-based data.
•Data mining enables organizations to make lucrative modifications in operation and production.
•Compared with other statistical data applications, data mining is a cost-efficient.
•Data Mining helps the decision-making process of an organization.
•It Facilitates the automated discovery of hidden patterns as well as the prediction of trends and behaviors.
•It can be induced in the new system as well as the existing platforms.
•It is a quick process that makes it easy for new users to analyze enormous amounts of data in a short time.
Disadvantages of Data Mining
•There is a probability that the organizations may sell useful data of customers to other organizations for
money. As per the report, American Express has sold credit card purchases of their customers to other
organizations.
•Many data mining analytics software is difficult to operate and needs advance training to work on.
•Different data mining instruments operate in distinct ways due to the different algorithms used in their
design. Therefore, the selection of the right data mining tools is a very challenging task.
•The data mining techniques are not precise, so that it may lead to severe consequences in certain
conditions.
Data Mining Applications
Data Mining is primarily used by organizations with intense consumer demands- Retail,
Communication, Financial, marketing company, determine price, consumer preferences, product
positioning, and impact on sales, customer satisfaction, and corporate profits. Data mining
enables a retailer to use point-of-sale records of customer purchases to develop products and
promotions that help the organization to attract the customer.
Data Mining Applications
Data Mining in Healthcare:
It is a framework, such as Rstudio or Tableau that allows you to perform different types of data
mining analysis.
We can perform various algorithms such as clustering or classification on your data set and
visualize the results itself. It is a framework that provides us better insights for our data and the
phenomenon that data represent. Such a framework is called a data mining tool.
Rapid Miner
Rapid Miner is one of the most popular predictive analysis systems created by the company with
the same name as the Rapid Miner. It is written in JAVA programming language. It offers an
integrated environment for text mining, deep learning, machine learning, and predictive analysis.
The instrument can be used for a wide range of applications, including company applications,
commercial applications, research, education, training, application development, machine
learning.
Rapid Miner provides the server on-site as well as in public or private cloud infrastructure. It has
a client/server model as its base. A rapid miner comes with template-based frameworks that
enable fast delivery with few errors(which are commonly expected in the manual coding writing
process)