Data Mining Applications and Feature Scope Survey
Data Mining Applications and Feature Scope Survey
Data Mining Applications and Feature Scope Survey
Abstract: We have concentrated on a range of strategies, methodologies, and distinct fields of research in this article, all
of which are useful and relevant in the field of data mining technologies. As we all know, numerous multinational
corporations and major corporations operate in various parts of the world. Each location of business may create significant
amounts of data. Corporate decision-makers need access to all of these data sources in order to make strategic decisions. The
data warehouse adds substantial value to the firm by increasing the efficiency of management decision-making. The
significance of strategic information systems like these is immediately recognised in an uncertain and highly competitive
corporate climate, but in today's business world, efficiency or speed is not the sole route to competitiveness. This massive
amount of data is available in the form of terabytes to petabytes, which has profoundly impacted research and engineering.
To evaluate, manage, and make decisions with such a large volume of data, we need data mining tools, which will alter
numerous fields. This work provides a greater number of data mining applications as well as a more focused scope of data
mining, which will be useful in future research.
Keywords: Task data mining and web mining, Life cycle in data mining, data mining visualization, Application on
data mining.
1. Introduction
Because the data is available in a variety of formats, the appropriate action may be made. Not only should these
facts be analysed, but they should also be used to make excellent decisions and keep track of them. The data should
be obtained from the database as and when the client requires it in order to make the best decision possible. This
method is referred to as data mining, knowledge hub, or simply KDD (Knowledge Discovery Process). The finding
of helpful the perception of "we are data abundant but information poor" drew a lot of attention in the field of
information technology.
Due to knowledge from massive collections of data in the subject of "Data mining,"
There is a massive amount of data, but we are hardly able to transform it into meaningful information and knowledge
for corporate decision-making. It is necessary to collect a large amount of data in order to develop information.
Different media, such as audio/video, numbers, text, figures, and hypertext formats, may be used. To fully use data,
a tool for automatic data summarization, extraction of the core of stored information, and pattern detection in raw
data is required.
With the massive amounts of data saved in files, databases, and other repositories, it is becoming increasingly vital
to build effective tools for data analysis and interpretation, as well as the extraction of useful information that may
aid decision-making.
The one and only Data Mining' is the answer to all of the above. The extraction of hidden predictive data is known
as data mining. Information from enormous datasets; it's a strong tool with a lot of promise for helping people. In
11 | Page
Publisher: Noida Institute of Engineering & Technology,
19, Knowledge Park-II, Institutional Area, Greater Noida (UP), India.
NIET Journal of Engineering & Technology (NIETJET)
Volume 6, Issue Winter 2017 ISSN: 2229-5828 (Print)
their data warehouses, firms concentrate on the most important information [1,2,3,4]. Data mining software
forecasts future patterns and behaviours, allowing businesses to take preventative measures. Decisions based on
knowledge [2]. Data mining's automated, prospective assessments are a game changer.
Beyond the analysis of previous occurrences offered by prospective decision-making tools,
systems. Data mining techniques can provide answers to queries that were previously too time consuming to address.
it takes a long time to fix They create databases in order to uncover hidden patterns and make predictions.
Information that specialists may overlook because it falls outside of their usual scope.
We presented a new approach of defining the KDD Process. Section 6 provides a brief overview of some of the
most often used data mining techniques. The heart of the article is Chapter 7, in which we examine applications and
recommend feature directions for various data mining applications.
12 | Page
Publisher: Noida Institute of Engineering & Technology,
19, Knowledge Park-II, Institutional Area, Greater Noida (UP), India.
NIET Journal of Engineering & Technology (NIETJET)
Volume 6, Issue Winter 2017 ISSN: 2229-5828 (Print)
The basic goal of data visualisation is to convey the general concept of the data mining methodology. The majority
of the time in data mining, we are getting data from repositories that are concealed. For a user, this is the most
challenging task. As a result, this depiction of the data mining approach aids us in providing the highest levels of
comprehension and trust.
Clustering is a phrase that refers to analysing various data items without consulting a recognised class level.
Unsupervised learning or segmentation are other terms for it. It is the process of dividing or segmenting data into
groups or clusters. Domain specialists evaluate the behaviour of the data to determine the clusters. The phrase
segmentation has a very precise meaning; it refers to the division of a database into separate groups of comparable
tuples. The process of displaying the summarised information from the data is known as summarization. The
association rule determines the relationship between the various properties. The mining of association rules is a
two-step procedure:
Identifying all frequent item sets and generating strong association rules from them.
in their shopping carts. The finding of such relationships, which enhances the business technique, may be found
here. In this approach, merchants employ data mining techniques to determine which consumers' intentions are
(buying the different pattern). In this way, the strategy is employed to increase business revenues while also assisting
in the purchase of connected things.
[1] Introduction to Data Mining and Knowledge Discovery, Third Edition ISBN: 1-892095-02-5, Two Crows
Corporation, 10500 Falls Road, Potomac, MD 20854 (U.S.A.), 1999.
[2] Larose, D. T., “Discovering Knowledge in Data: An Introduction to Data Mining”, ISBN 0-471-66657-2, ohn
Wiley & Sons, Inc, 2005.
[3] Dunham, M. H., Sridhar S., “Data Mining: Introductory and Advanced Topics”, Pearson Education,New Delhi,
ISBN: 81-7758-785-4, 1st Edition, 2006
[4] Chapman, P., Clinton, J., Kerber, R., Khabaza, T., Reinartz, T., Shearer, C. and Wirth, R... “CRISP-DM 1.0 :
Step-by-step data mining guide, NCR Systems Engineering Copenhagen (USA and Denmark),
DaimlerChrysler AG (Germany), SPSS Inc. (USA) and OHRA Verzekeringenen Bank Group B.V (The
Netherlands), 2000”.
[5] Fayyad, U., Piatetsky-Shapiro, G., and Smyth P., “From Data Mining to Knowledge
[6] Discovery in Databases,” AI Magazine, American Association for Artificial Intelligence, 1996.
[7] Tan Pang-Ning, Steinbach, M., Vipin Kumar. “Introduction to Data Mining”, Pearson Education, New Delhi,
ISBN: 978-81-317-1472-0, 3rd Edition, 2009. Bernstein, A. and Provost, F., “An Intelligent Assistant for the
Knowledge Discovery Process”, Working Paper of the Center for Digital Economy Research, New York
University and also presented at the IJCAI 2001 Workshop on Wrappers for Performance Enhancement in
Knowledge Discovery in Databases.
[8] Baazaoui, Z., H., Faiz, S., and Ben Ghezala, H., “A Framework for Data Mining Based Multi-Agent: An
Application to Spatial Data, volume 5, ISSN 1307-6884,” Proceedings of World Academy of Science,
Engineering and Technology, April 2005.
[9] Rantzau, R. and Schwarz, H., “A Multi-Tier Architecture for High-Performance Data Mining, A Technical
Project Report of ESPRIT project, The consortium of CRITIKAL project, Attar Software Ltd. (UK), Gehe AG
(Denmark); Lloyds TSB Group (UK), Parallel Applications Centre, University of Southampton (UK), BWI,
University of Stuttgart (Denmark), IPVR, University of Stuttgart (Denmark)”.
[10] Botia, J. A., Garijo, M. y Velasco, J. R., Skarmeta, A. F., “A Generic Data mining System basic design and
implementation guidelines”, A Technical Project Report of
14 | Page
Publisher: Noida Institute of Engineering & Technology,
19, Knowledge Park-II, Institutional Area, Greater Noida (UP), India.
NIET Journal of Engineering & Technology (NIETJET)
Volume 6, Issue Winter 2017 ISSN: 2229-5828 (Print)
[11] CYCYTprojectofSpanishGovernment.1998.WebSite:
[12] Campos, M. M., Stengard, P. J., Boriana, L. M., “Data-Centric Automated Data
[13] Amit ,Choudhary S P Singh, V K Pandey; 'A Low Power and High Gain CMOS Tunable OTA with Cascade
Current Mirrors',Volume No.2,Issue No.1,2013,PP.075-078,ISSN :2229-5828
[14] Anju Gauniya Pandey , Sanjita Das , S. P.Basu, Palak Srivastava; 'Design and Evaluation Of Nanoemulsion
For Delivery of Diclofenac Sodium',Volume No.2,Issue No.1,2013,PP.079-082,ISSN :2229-5828
[15] Raj Kumar Goel , Rinku Sharma Dixit, Dr. Manu Pratap Singh; 'Implementaion of Pattern Storage Neural
network As Associative Memory For Storage and Recalling of Finger Prints',Volume No.2,Issue
No.1,2013,PP.083-090,ISSN :2229-5828
[16] Amit Kumar Yadav, Satyendra Sharma; 'Design and Simulation of Multiplier for High -speed
Application',Volume No.2,Issue No.2,2014,PP.001-007,ISSN :2229-5828
[17] Deepak Kumar ,Anjana Rani Gupta, Somesh Kumar; 'Dynamic Simulation of Multiple Effect Evaporators in
Paper Industry Using MATLAB',Volume No.2,Issue No.2,2014,PP.008-014,ISSN :2229-5828
[18] Devendra Pratap, Satyendra Sharma; 'Planning and Modelling of Indoor WLAN Through Field Measurement
at 2.437 GHz Frequency',Volume No.2,Issue No.2,2014,PP.015-019,ISSN :2229-5828
15 | Page
Publisher: Noida Institute of Engineering & Technology,
19, Knowledge Park-II, Institutional Area, Greater Noida (UP), India.