[1] Harry Zhang, The optimality of naive Bayes, AA 1(2) (2004), 3.
[2] Zengchang Qin, Naive Bayes classification given probability
estimation trees, Machine Learning and Applications 2006, ICMLA’06,
5th International Conference on. IEEE, 2006.
[3] Andrew McCallum and Kamal Nigam, A comparison of event models for
naive Bayes text classification, AAAI-98 Workshop on Learning for Text
Categorization 752 (1998).
[4] Pedro Domingos and Michael Pazzani, On the optimality of the
simple Bayesian classifier under zero-one loss, Machine Learning
29(2-3) (1997), 103-130.
[5] Marco Bressan and Jordi Vitria, On the selection and
classification of independent features, Pattern Analysis and Machine
Intelligence, IEEE Transactions on 25(10) (2003), 1312-1317.
[6] David D. Lewis, Naïve (Bayes) at forty: The independence
assumption in information retrieval, Machine Learning: ECML-98,
Springer Berlin Heidelberg (1998), 4-15.
[7] Nir Friedman, Dan Geiger and Moises Goldszmidt, Bayesian network
classifier, Machine Learning 29 (1997), 131-163.
[8] Mehran Sahami, Learning Limited Dependence Bayesian Classifier,
KDD 96 (1996).
[9] Pat Langley, Wayne Iba and Kevin Thompson, An analysis of Bayesian
classifier, AAAI 90 (1992).
[10] Fan Liwei, Independent component analysis for naive Bayes
classification, Diss. 2010.
[11] A. Hyvärinen, J. Karhunen and E. Oja, Independent Component
Analysis, Wiley, New York, 2001.
[12] A. Hyvarinen and E. Oja, Independent component analysis:
Algorithms and applications, Neural Net. 13 (2000), 411-430.
[13] A. Hyvarinen and E. Oja, A fast fixed-point algorithm for
independent component analysis, Neural Computation 9(7) (1997),
483-1492.
[14] Richard O. Duda and Peter E. Hart, Pattern Classification and
Scene Analysis, Vol. 3, Wiley, New York, 1973.
[15] David D. Wis, Evaluating and optimizing autonomous text
classification systems, Proceedings of the 18th annual international
ACM SIGIR conference on Research and development in information
retrieval, ACM 1995.
[16] Hao Shen, Stefanie Jegelka and Arthur Gretton, Fast kernel ICA
using anapproximate newton method, International Conference on
Artificial Intelligence and Statistics, 2007.
[17] J. Demsar, Statistical comparisons of classifier over multiple
data sets, J. Mach. Learn. Res. 7 (2006), 1-3.
[18] Laurens van der Maaten, Statistical Pattern Recognition Toolbox
for Matlab (stprtool) version 2.11, version 0.7.2b (2010).
[19] Hugo Gavert, Jarmo Hurri, Jaakko Sarela and Aapo Hyvarinen, Fast
ICA for Matlab 7.x and 6.x, Version 2.5 (2005).
[20] A. P. Bradley, The use of the area under the ROC curve in the
evaluation of machine learning algorithms, Pattern Recognition 30
(1997), 1145-1159.
[21] Xindong Wu et al., Top 10 algorithms in data mining, Knowledge
and Information Systems 14(1) (2008), 1-37.
[22] Xindong Wu, and Vipin Kumar, eds, The Top Ten Algorithms in Data
Mining, CRC Press, 2009.
[23] Lee H. Dicker, and Sihai D. Zhao, High-dimensional classification
via nonparametric empirical Bayes and maximum likelihood inference,
Biometrika (2016), asv067.
[24] David J. Hand, and Keming Yu, Idiot’s Bayes-not so stupid after
all?. International Statistical Review 69(3) (2001), 385-398.
[25] Abdallah Bashir Musa, A comparison of ℓ1-regularizion, PCA,
KPCA and ICA for dimensionality reduction in logistic regression,
International Journal of Machine Learning and Cybernetics 5(6) (2014),
861-873.
[26] Abdallah Bashir Musa, Logistic regression classification for
uncertain data, Research Journal of Mathematical and Statistical
Sciences-ISSN 2320 (2014), 6047.
[27] Abdallah Bashir Musa, Gene expression data classification with
kernel independent component analysis, Research Journal of
Mathematical and Statistical Sciences ISSN 2320: 6047.
[28] Xin Jin et al., Kernel independent component analysis for gene
expression data clustering, Independent Component Analysis and Blind
Signal Separation, Springer, Berlin Heidelberg (2006), 454-461.
[29] R. Francis Bach and Michael I. Jordan, Kernel independent
component analysis, The Journal of Machine Learning Research 3 (2003),
1-48.
[30] Abdallah Bashir Musa, Comparative study on classification
performance between support vector machine and logistic regression,
International Journal of Machine Learning and Cybernetics 4(1) (2013),
13-24.
[31] J. Q. Gao, L. Y. Fan, L. Li et al., A practical application of
kernel based fuzzy discriminant analysis, International Journal of
Applied Mathematics and Computer Science 23(4) (2013), 887-903.
[32] J. Gao and L. Fan, Kernel-based weighted discriminant analysis
with QR decomposition and its application face recognition, WSEAS
Transactions on Mathematics 10(10) (2011), 358-367.