References

A NEW METHOD FOR CLASSIFYING CHINESE TEXT BASED ON SEMANTIC TOPICS AND DENSITY PEAKS


[1] F. Sebastiani, Machine learning in automated text categorization, ACM Computing Surveys 34(1) (2002), 1-47.

[2] Jin-Shu Su, Bo-Feng Zhang and Xin Xu, Advances in machine learning based text categorization, Journal of Software 17(9) (2006), 1848-1859.

[3] Yewang Chen, Hua-Zhen Wang, Haibo Li, Binengm Zhong, Jin Gou and Duansheng Chen, A topic extraction method for Chinese web text based on BaiduBaike and text classification, Journal of Chinese Computer Systems 33(12) (2012), 2605-2610.

[4] Thomas Hofmann, Probabilistic latent semantic indexing, Proceedings of the Twenty-Second Annual, International SIGIR Conference on Research and Development in Information Retrieval (SIGIR-99), 1999.

[5] D. Blei, A. Ng and M. Jordan, Latent Dirichlet allocation, Journal of Machine Learning Research 3 (2003), 993-1022.

[6] Ge Xu and Hou-Feng Wang, The development of topic models in natural language processing, Chinese Journal of Computers 34(8) (2011), 1423-1436.

[7] C. Huang and H. Zhao, Which is essential for Chinese word segmentation character versus word, In Proceedings of the 20th Pacific Asia Conference on Language, Information and Computation (PACLIC20), (2006), 1-12.

[8] C. Huang and H. Zhao, Chinese word segmentation: A decade review, Journal of Chinese Information Processing 21(3) (2007), 8-18.

[9] Hua-Ping Zhang, Hong-Kui Yu, De-Yi Xiong and Qun Liu, HHMM-based Chinese Lexical Analyzer ICTCLAS, Second SIGHAN workshop affiliated with 41th ACL; Sapporo Japan, (2003), 184-187.

[10] Yun-Qing Xia, Kam-Fai Wong and Pu. Zhang, Toward anomalous and dynamic nature of the Chinese network chat language, Journal of Chinese Information Processing 21(3) (2007), 83-91.

[11] Shengli Song, Shaolong Wang and Ping Chen, Chinese text semantic representation for text classification, Journal of Xidian University 40(2) (2013), 89-97.

[12] Yu-Yan Jiang, Ping Li and Qing Wang, An improved labeled latent Dirichlet allocation model for multi-label classification, Journal of Nanjing University: Nat. Sci. Ed. 49(4) (2013), 425-432.

[13] Wen-Bo Li, Le Sun and Da-Kun Zhang, Text classification based on labeled-LDA model, Chinese Journal of Computers 31(4) (2008), 621-627.

[14] Shaohua Teng, Study on Chinese Short-Text Classification, Master Degree Thesis of Tsinghua University, 2009.

[15] Ronglu Li [OL]: http://download.csdn.net/detail/superyangtze/271055 9.

[16] Wikipedia [OL]: http://en.wikipedia.org/wiki/Suffix tree.

[17] Alex Rodriguez and Alessandro Laio, Clustering by fast search and find of density peaks, Science 344 (2014), 1492-1496.

[18] Fudan NLP [OL]: http://www.datatang.com/data/44082.

[19] SogouC [OL]: http://www.sogou.com/labs/dl/c.html.