Author's: Mi Zhou
Pages: [51] - [78]
Received Date: January 21, 2016
Submitted by:
DOI: http://dx.doi.org/10.18642/jmsaa_7100121627
Fisher score and genetic algorithm are widely used for feature
selection. However, some redundant features will be selected by Fisher
score, and the convergence properties may be worse if the initial
population of genetic algorithm is generated by a random manner. To
improve the performance of feature selection by Fisher score and
genetic algorithm, we propose a hybrid feature selection method, which
merging the advantages of Fisher score and genetic algorithm together.
It aims at utilizing the features\' Fisher score to generate the
initial population of genetic algorithm. To begin with, the Fisher
scores of all the features will be mapped into a specific interval by
a linear function, and then the rescaled Fisher scores will be
utilized to generate the initial population of genetic algorithm.
Finally, the initial population will be used in the subsequent
procedure of genetic algorithm to perform feature selection with
elitist strategy for reference. In this paper, we choose four data
sets of Sonar, WDBC, Arrhythmia, and Hepatitis to test the performance
of our algorithm. Feature subsets of the four data sets will be
selected by our algorithm, and then the dimensionality of data sets
will be reduced according to the selected feature subsets,
respectively. 1-NN classifier is used to classify the dimensionality
reduced data sets, and respectively, achieving the classification
accuracy of with ten-fold cross validation method. The
experiment results show that, compared to the performance of Fisher
score, genetic algorithm and Fisher score genetic algorithm, our
algorithm is fit for eliminate redundant features, and can select
discriminative features. Above all, our method is effective in feature
selection.
feature selection, Fisher score, genetic algorithm, elitist strategy.