[1] F. Sebastiani, Machine learning in automated text categorization,
ACM Computing Surveys 34(1) (2002), 1-47.
[2] Jin-Shu Su, Bo-Feng Zhang and Xin Xu, Advances in machine learning
based text categorization, Journal of Software 17(9) (2006),
1848-1859.
[3] Yewang Chen, Hua-Zhen Wang, Haibo Li, Binengm Zhong, Jin Gou and
Duansheng Chen, A topic extraction method for Chinese web text based
on BaiduBaike and text classification, Journal of Chinese Computer
Systems 33(12) (2012), 2605-2610.
[4] Thomas Hofmann, Probabilistic latent semantic indexing,
Proceedings of the Twenty-Second Annual, International SIGIR
Conference on Research and Development in Information Retrieval
(SIGIR-99), 1999.
[5] D. Blei, A. Ng and M. Jordan, Latent Dirichlet allocation, Journal
of Machine Learning Research 3 (2003), 993-1022.
[6] Ge Xu and Hou-Feng Wang, The development of topic models in
natural language processing, Chinese Journal of Computers 34(8)
(2011), 1423-1436.
[7] C. Huang and H. Zhao, Which is essential for Chinese word
segmentation character versus word, In Proceedings of the 20th Pacific
Asia Conference on Language, Information and Computation (PACLIC20),
(2006), 1-12.
[8] C. Huang and H. Zhao, Chinese word segmentation: A decade review,
Journal of Chinese Information Processing 21(3) (2007), 8-18.
[9] Hua-Ping Zhang, Hong-Kui Yu, De-Yi Xiong and Qun Liu, HHMM-based
Chinese Lexical Analyzer ICTCLAS, Second SIGHAN workshop affiliated
with 41th ACL; Sapporo Japan, (2003), 184-187.
[10] Yun-Qing Xia, Kam-Fai Wong and Pu. Zhang, Toward anomalous and
dynamic nature of the Chinese network chat language, Journal of
Chinese Information Processing 21(3) (2007), 83-91.
[11] Shengli Song, Shaolong Wang and Ping Chen, Chinese text semantic
representation for text classification, Journal of Xidian University
40(2) (2013), 89-97.
[12] Yu-Yan Jiang, Ping Li and Qing Wang, An improved labeled latent
Dirichlet allocation model for multi-label classification, Journal of
Nanjing University: Nat. Sci. Ed. 49(4) (2013), 425-432.
[13] Wen-Bo Li, Le Sun and Da-Kun Zhang, Text classification based on
labeled-LDA model, Chinese Journal of Computers 31(4) (2008),
621-627.
[14] Shaohua Teng, Study on Chinese Short-Text Classification, Master
Degree Thesis of Tsinghua University, 2009.
[15] Ronglu Li [OL]: http://download.csdn.net/detail/superyangtze/271055
9.
[16] Wikipedia [OL]: http://en.wikipedia.org/wiki/Suffix
tree.
[17] Alex Rodriguez and Alessandro Laio, Clustering by fast search and
find of density peaks, Science 344 (2014), 1492-1496.
[18] Fudan NLP [OL]: http://www.datatang.com/data/44082.
[19] SogouC [OL]: http://www.sogou.com/labs/dl/c.html.