Data mining Algorithm
0 Followers
Recent papers in Data mining Algorithm
This paper compares the accuracy of Decision Tree and Bayesian Network algorithms for predicting the academic performance of undergraduate and postgraduate students at two very different academic institutes: Can Tho University (CTU), a... more
This paper describes a text mining tool that performs two tasks, namely document clustering and text summarization. These tasks have, of course, their corresponding counterpart in "conventional" data mining. However, the... more
This paper presents a Multi-Agent Market simulator designed for developing new agent market strategies based on a complete understanding of buyer and seller behaviors, preference models and pricing algorithms, considering user risk... more
Educational Data Mining (EDM) is the process of converting raw data from educational systems to useful information that can be used by educational software developers, students, teachers, parents, and other educational researchers. EDM is... more
Educational Data Mining (EDM) is the process of converting raw data from educational systems to useful information that can be used by educational software developers, students, teachers, parents, and other educational researchers. EDM is... more
Data mining is an interdisciplinary exercise. Statistics, database technology, machine learning, pattern recognition, artificial intelligence, and visualization, all play a role. And just as it is difficult to define sharp boundaries... more
Rare association rules are those that only appear infrequently even though they are highly associated with very specific data. In consequence, these rules can be very appropriate for using with educational datasets since they are usually... more
Several data mining algorithms use iterative optimization methods for learning predictive models. It is not easy to determine upfront which optimization method will perform best or converge fast for such tasks. In this paper, we analyze... more
In this paper, we show how using data mining algorithms can help discovering pedagogically relevant knowledge contained in databases obtained from Web-based educational systems. These findings can be used both to help teachers with... more
Data privacy is a major threat to the widespread deployment of data grids in domains such as health care and finance. We propose a novel technique for obtaining knowledge-by way of a data mining model-from a data grid, while ensuring that... more
As a result of today's competitive business environment, companies have been trying to improve the utilization of funds effectively in their budgets for information technology investments. These companies retrieve more information with... more
... If one describes an animal as having wings and feathers, it is easy to classify it as a bird, and if ones asks you to describe a bird, you will likely get the major ... Talbert, DA, Honeycutt, M., Smith, A., Talbert, SR: An adaptive,... more
In a data warehousing process, the data preparation phase is crucial. Mastering this phase allows multidimensional analysis or the use of data mining algorithms, as well as substantial gains in terms of time and performance when... more
Mining of association rules is to find associations among data items that appear together in some transactions or business activities. As of today, algorithms for association rule mining, as well as for other data mining tasks, are mostly... more
This thesis is devoted to the design of data mining algorithms for evolving data streams and for the extraction of closed frequent trees. First, we deal with each of these tasks separately, and then we deal with them together, developing... more
The vast majority of data mining algorithms require the setting of many input parameters. The dangers of working with parameter-laden algorithms are twofold. First, incorrect settings may cause an algorithm to fail in finding the true... more
The aim of the Code4Thought project was to deliver a tool supported methodology that would facilitate the evaluation of a software product's quality according to ISO/IEC-9126 software engineering quality standard. It was a joint... more
Sampling of large datasets for data mining is important for at least two reasons. The processing of large amounts of data results in increased computational complexity. The cost of this additional complexity may not be justifiable. On the... more
The productivity of a compiler development team depends on its ability not only to the design effective solutions to known code generation problems, but also to uncover potential code improvement opportunities. This paper describes a data... more
Many high level representations of time series have been proposed for data mining, including Fourier transforms, wavelets, eigenwaves, piecewise polynomial models, etc. Many researchers have also considered symbolic representations of... more
Web structure mining has been a well-researched area during recent years. Based on the observation that data on the web may change at any time in any way, some incremental data mining algorithms have been proposed to update the mining... more
This chapter describes a principled approach to meta-learning that has three distinctive features. First, whereas most previous work on meta-learning focused exclusively on the learning task, our approach applies meta-learning to the full... more
In this paper we report ongoing research on the Open Academic Analytics Initiative (OAAI), a project aimed at increasing college student retention by performing early detection of academic risk using data mining methods. The paper... more
Mining published articles in biology and medicine is a favored means of identifying potential biomarkers in comparison to conventional reviewing process. This is made possible by the development of public literature databases and data... more
Natural systems exhibit random, chaotic, and multiply periodic behaviors that are driven by gravity, weather, and man-made disturbances. Modeling them on a large scale is challenging because behaviors vary discontinuously both spatially... more
Generalized association rule extraction is a powerful tool to discover a high level view of the interesting patterns hidden in the analyzed data. However, since the patterns are extracted at any level of abstraction, the mined rule set... more
This paper describes a text mining tool that performs two tasks, namely document clustering and text summarization. These tasks have, of course, their corresponding counterpart in "conventional" data mining. However, the... more
We describe a grid-based approach for enterprisescale data mining that leverages database technology for I/O parallelism, and on-demand compute servers for compute parallelism in the statistical computations. By enterprise-scale, we mean... more
Data mining algorithms, especially those used for unsupervised learning, generate a large quantity of rules. In particular this applies to the Apriori family of algorithms for the determination of association rules. It is hence impossible... more
In this research we applied classification models for prediction of students’ performance, and cluster models for grouping students based on their cognitive styles in e-learning environment. Classification models described in this paper... more
Data mining on clinical data is a challenging area in the field of medical research, aiming at predicting and discovering patterns of disease occurrence and prognosis based on detected symptoms and reported health conditions. Data mining... more
Recent studies have demonstrated the prospects of data mining algorithms for addressing the task of seriation in pa-leontological data (ie the age-based ordering of the sites of excavation). A prominent approach is spectral ordering that... more
In this paper, we used data mining techniques for the automatic discovering of useful temporal abstraction in reinforcement learning. This idea was motivated by the ability of data mining algorithms in automatic discovering of structures... more
The study of interictal transient events may substantially complement the analysis of seizures in the presurgical evaluation of intractable epilepsy. A comprehensive methodology of quantifying reproducibility of activation patterns in... more
An innovative knowledge-based methodology for terrorist detection by using Web traffic content as the audit information is presented. The proposed methodology learns the typical behavior ('profile') of terrorists by applying a... more
AbstractMost Data mining Algorithms and Tools when applied to Industrial problems such as CRM (Customer Relationship Management) are useful in pointing out Customers who are likely attritors and customers who are Loyal , but they require... more
The problem of finding a suitable dataset to test different data mining algorithms and techniques and specifically association rule mining for Market Basket Analysis is a big challenge. A lot of dataset generators have been implemented in... more
We consider the problem of resource allocation in mining multiple data streams. Due to the large volume and the high speed of streaming data, mining algorithms must cope with the effects of system overload. How to realize maximum mining... more
Data Mining techniques are commonly used to extract patterns, like association rules and decision trees, from huge volumes of data. The comparison of patterns is a fundamental issue, which can be exploited, among others, to synthetically... more
This paper introduces Cougar^2, an innovative open source Java framework and toolset that assists researchers in designing, developing, and using machine learning and data mining algorithms. The primary mission for Cougar^2 is to provide... more
In the remote sensing field, a frequently recurring question is: Which computational intelligence or data mining algorithms are most suitable for the retrieval of essential information given that most natural systems exhibit very high... more
Generalized association rule extraction is a powerful tool to discover a high level view of the interesting patterns hidden in the analyzed data. However, since the patterns are extracted at any level of abstraction, the mined rule set... more
Classification algorithms of data mining have been successfully applied in the recent years to predict cancer based on the gene expression data. Micro-array is a powerful diagnostic tool that can generate handful information of gene... more
To answer user queries, a data integration system employs a set of semantic mappings between the mediated schema and the schemas of data sources. In dynamic environments sources often undergo changes that invalidate the mappings. Hence,... more