Volume no :7, Issue no: 1, September and December (2017)

NAÏVE BAYES CLASSIFICATION USING KERNEL INDEPENDENT COMPONENT ANALYSIS

Author's: Abdullah Bashir Musa and Fuad Abedalrazeq Mussallum
Pages: [39] - [55]
Received Date: July 17, 2017; Revised October 5, 2017
Submitted by:
DOI: http://dx.doi.org/10.18642/ijamml_7100121858

Abstract

Naïve Bayes is one of the most well-known efficient learning algorithms. It has been used extensively in a number of different areas of data mining and machine learning. It is known to have outstanding classification performance, which competitive with modern methods such as support vector machines (SVM). Naïve Bayes classifier is used for building on independent features. Although the assumption of independence on which naïve Bayes is based is rarely satisfied in practice, these features can be modified to make the independent assumption hold, which result in improving naïve Bayes classifier. A number of current techniques have been used for that modification. Typically these methods find a set of uncorrelated/independent components of the features. Kernel independent component analysis (KICA) is a popular example of such methods. This paper proposes an application of kernel independent component analysis (KICA) with naïve Bayes for classification. Kernel independent component analysis (KICA) is a kernel version of independent component analysis (ICA). Applying KICA yields new reduced independent features space on which the naïve Bayes classifier can be built. For assessing the classification performance of this technique, this method is compared with independent component analysis (ICA) and kernel principle component analysis (KPCA) as a kernel approach. A number of performance measures have been used in the comparison; accuracy, sensitivity, specificity, and precision, F-score, the area under receiver operating characteristic curve (AUC) and the receiver operating characteristic (ROC) analysis.

Keywords

naïve Bayes, classification, kernel independent component analysis (KICA), independent component analysis (ICA), kernel principle component analysis (KPCA).