Feature Selection
10,373 Followers
Recent papers in Feature Selection
Face expression recognition technology is one of the most recently developed fields in machine learning and has profoundly helped its users through forensic, security, and biometric applications. Many researchers and program developers... more
In our previous study we have shown that identification of bacteria species with the use of Fresnel diffraction patterns is possible with high accuracy and at low cost. Fresnel diffraction patterns were recorded with the optical system... more
Electronic Commerce is one of the most significant fields in internet appli- cations. With the focus moving from B2C-commerce to B2B-commerce, in- tegrating internet and legacy systems within one company and getting technologies used in... more
Reuters Corpus Volume I (RCV1) is an archive of over 800,000 manually categorized newswire stories recently made available by Reuters, Ltd. for research purposes. Use of this data for research on text categorization requires a detailed... more
Each complex network (or class of networks) presents specific topological features which characterize its connectivity and highly influence the dynamics of processes executed on the network. The analysis, discrimination, and synthesis of... more
Correct prediction of the translation initiation site (TIS) is an important issue in genomic research. We show that feature generation together with correlation based feature selection can be used with a variety of machine learning... more
We propose a supervised approach to detect falls in home environment using an optimised descriptor adapted to real-time tasks. We introduce a realistic dataset of 191 videos, a new metric allowing to evaluate fall detection performance in... more
Increase in the amount of information on the Web has caused the need for accurate automated classifiers for Web pages to maintain Web directories and to increase search engines’ performance. As every (HTML/XML) tag and every term on each... more
Co-clustering has been defined as a way to organize simultaneously subsets of instances and subsets of features in order to improve the clustering of both of them. In previous work [1], we proposed an efficient co-similarity measure... more
Over the past few years, there has been a considerable spread of microarray technology in many biological patterns, particularly in those pertaining to cancer diseases like leukemia, prostate, colon cancer, etc. The primary bottleneck... more
The application of soft computing in decision support system in disease prediction is one of the emerging interdisciplinary research areas in the field of computer science. Machine learning algorithms plays an important role in risk... more
Feature selection is the task of choosing a small set out of a given set of features that capture the relevant properties of the data. In the context of supervised classification problems the relevance is determined by the given labels on... more
Abstract In this paper, we introduce a new approach to learn dissimilarity for interactive search in content based image retrieval. In literature, dissimilarity is often learned via the feature space by feature selection, feature... more
Our previous works suggest that fractal texture feature is useful to detect pediatric brain tumor in multimodal MRI. In this study, we systematically investigate efficacy of using several different image features such as intensity,... more
Telemarketers of online job advertising firms face significant challenges understanding the advertising demands of small-sized enterprises. The effective use of data mining approach can offer e-recruitment companies an improved... more
An investigation into the feature extraction and selection of infant cry with asphyxia is presented in this paper. The feature of the cry signal was extracted using mel frequency cepstrum coefficient (MFCC) analysis and the significant... more
Women who have recovered from breast cancer (BC) always fear its recurrence. The fact that they have endured the painstaking treatment makes recurrence their greatest fear. However, with current advancements in technology, early... more
Dimensionality refers to number of terms in a web page. While classifying web pages high dimensionality of web pages causes problem. The main objective of reducing dimensionality of web pages is improving the performance of classifier.... more
We introduced a novel method employing a hierarchical domain ontology structure to extract features representing documents in our previous publication (Wang 2002). All raw words in the training documents are mapped to concepts in a... more
Pattern recognition generally requires that objects be described in terms of a set of measurable features. The selection and quality of the features representing each pattern have a considerable bearing on the success of subsequent... more
An intense research around classifier fusion in recent years revealed that combining performance strongly depends on careful selection of classifiers to be combined. Classifier performance depends, in turn, on careful selection of... more
This paper studies the total variation regularization model with an L 1 fidelity term (TV-L 1 ) for decomposing an image into features of different scales. We first show that the images produced by this model can be formed from the... more
We compared four automated methods for hippocampal segmentation using different machine learning algorithms (1) hierarchical AdaBoost, (2) Support Vector Machines (SVM) with manual feature selection, (3) hierarchical SVM with automated... more
As the Internet grows at a phenomenal rate email systems has become a widely used electronic form of communication. Everyday, a large number of people exchange messages in this fast and inexpensive way. With the excitement on electronic... more
Genetic algorithms have been created as an optimization strategy to be used especially when complex response surfaces do not allow the use of better-known methods (simplex, experimental design techniques, etc.). This paper shows that... more
Available in: http://www.redalyc.org/src/inicio/ArtPdfRed.jsp?iCve=43019328016 ... Redalyc Scientific Information System Network of Scientific Journals from Latin America, the Caribbean, Spain and ... A harmony search algorithm for... more
Extractive speech summarization, which purports to select an indicative set of sentences from a spoken document so as to succinctly represent the most important aspects of the document, has garnered much research over the years. In this... more
Robustness of feature selection techniques is a topic of recent interest, especially in high dimensional domains with small sample sizes, where selected feature subsets are subsequently analysed by domain experts to gain more insight into... more
Many scientific problems can be formulated as classification tasks. Data that harbor relevant information are usually described by a large number of features. Frequently, many of these features are irrelevant for the class prediction. The... more
In this study, we propose a novel method for medical problem, it is the integration of particle swarm optimization (PSO) and decision tree (C4.5) named PSO + C4.5 algorithm. To evaluate the effectiveness of PSO + C4.5 algorithm, it is... more
Leucocytes in the blood of mammals form a powerful protective system against a wide range of dangerous pathogens. There are several types of immune cells that has specific role in the whole immune system. The number and type of immune... more
An automated approach to degradation analysis is proposed that uses a rotating machine's acoustic signal to determine Remaining Useful Life (RUL). High resolution spectral features are extracted from the acoustic data collected over the... more
Recently a large number of studies has been published in the area of Fractal Analysis. In this paper we review briefly the IFS (Iterated Function System) theory, and we show how this theoretical tool leads to new applications in Pattern... more
Identifying an appropriate set of predictors for the outcome of interest is a major challenge in clinical prediction research. The aim of this study is to show the application of some variable selection methods, usually used in... more
LASSO (Least Absolute Shrinkage and Selection Operator) is a useful tool to achieve the shrinkage and variable selection simultaneously. Since LASSO uses the L 1 penalty, the optimization should rely on the quadratic program (QP) or... more
We present MEDUSA, an integrative method for learning motif models of transcription factor binding sites by incorporating promoter sequence and gene expression data. We use a modern large-margin machine learning approach, based on... more
Contemporary biological technologies produce extremely high-dimensional data sets from which to design classifiers, with 20,000 or more potential features being common place. In addition, sample sizes tend to be small. In such settings,... more
This paper considers a class of hybrid (heterogeneous) ensembles purely composed of symbolic elements. In learning diagnostic rules from gene expressions they demonstrate a significant improvement of accuracy with a small number of... more
Recent work on extracting features of gaps in handwritten text allows a classification into inter-word and intraword classes using suitable classification techniques. In this paper, we apply 5 different supervised classification... more
ABSTRACT Relief algorithms are successful attribute estimators. They are able to detect conditional dependencies between attributes and provide a unified view on the attribute estimation. In this paper, we propose a variant of ReliefF... more
With the enabling Information Technology, disseminating every kind of information through the Internet became an enforcing condition for information providers. Now, it is more than being fashion and fun. Serving information on the... more
Nonnegative matrix factorization (NMF), a popular part-based representation technique, does not capture the intrinsic local geometric structure of the data space. Graph regularized NMF (GNMF) was recently proposed to avoid this limitation... more
This paper targets the automatic detection of personality traits in a meeting environment by means of audio and visual features; information about the relational context is captured by means of acoustic features designed to that purpose.... more