Study on Kazak text categorization based on SVM

Journal of Computer Applications ›› 2010, Vol. 30 ›› Issue (06): 1676-1678.

• Software process technology & Chinese information processing • Previous Articles Next Articles

Study on Kazak text categorization based on SVM

Received:2009-12-11 Revised:2010-04-02 Online:2010-06-01 Published:2010-06-01

基于SVM的哈萨克语文本分类

王花¹,古丽拉·阿东别克²,吴守用³

1. 新疆大学信息科学与工程学院
2.
3. 新疆大学信息科学与工程学院

通讯作者: 王花
基金资助:
国家自然科学基金资助项目

Abstract

Abstract: This paper introduced the basic theory of the Support Vector Machine (SVM) and k-Nearest Neighbor (kNN) algorithm and two different features selection methods in Kazak natural language. An empirical study of using the SVM, kNN, Bayes algorithm to categorize the Kazak text was conducted. The experimental results show that compared with kNN, Bayes, SVM has better categorization of the Kazak text. Due to the characteristics of Kazak's morpheme and configuration, the precision and recall will be lowered if the word is cut with affix.

Key words: Kazak text categorization, SVM, featrur selection, KNN

摘要： 介绍了支持向量机(SVM)和k-最近邻法(kNN)分类算法的思想和两种哈萨克语特征提取方法。对SVM、kNN和Bayes算法在哈萨克语文本分类的实验进行了比较。实验结果表明:在处理哈萨克语文本分类问题上,SVM较kNN和Bayes有较好的分类效果。由于哈萨克文单词的语素和构形的特点,若对哈萨克语词缀进行切分,则会降低文本分类的准确率和查全率。

关键词: 哈萨克语文本分类, SVM, 特征选择, KNN

王花古丽拉·阿东别克吴守用. 基于SVM的哈萨克语文本分类[J]. 计算机应用, 2010, 30(06): 1676-1678.

[1]	Min SUN, Qian CHENG, Xining DING. CBAM-CGRU-SVM based malware detection method for Android [J]. Journal of Computer Applications, 2024, 44(5): 1539-1545.
[2]	Chenghao YANG, Jie HU, Hongjun WANG, Bo PENG. Incomplete multi-view clustering algorithm based on attention mechanism [J]. Journal of Computer Applications, 2024, 44(12): 3784-3789.
[3]	Enbao QIAO, Xiangyang GAO, Jun CHENG. Self-recovery adaptive Monte Carlo localization algorithm based on support vector machine [J]. Journal of Computer Applications, 2024, 44(10): 3246-3251.
[4]	Xueyu HUANG, Huaiyu HE, Huimin LIN, Jinshui CHEN. Classification and recognition method of copper alloy metallograph based on feature aggregation [J]. Journal of Computer Applications, 2023, 43(8): 2593-2601.
[5]	Haiyong ZHANG, Xianjin FANG, Enwan ZHANG, Baoyu LI, Chao PENG, Jianxiang MU. Fingerprint positioning method based on measurement report signal clustering [J]. Journal of Computer Applications, 2023, 43(12): 3947-3954.
[6]	Xiangxi WEN, Yating PENG, Kexin BI, Yuming HENG, Minggong WU. Situation prediction of flight conflict network based on online fuzzy least squares support vector machine with optimal training set [J]. Journal of Computer Applications, 2023, 43(11): 3632-3640.
[7]	Zhonghua ZHANG, Fuyuan ZHAO, Junfeng GUO, Gaochang ZHAO. Integrated prediction model of Cauchy adaptive backtracking search and least square support vector machine [J]. Journal of Computer Applications, 2022, 42(6): 1829-1836.
[8]	Lei YANG, Hongdong ZHAO, Kuaikuai YU. End-to-end speech emotion recognition based on multi-head attention [J]. Journal of Computer Applications, 2022, 42(6): 1869-1875.
[9]	Zhen QU, Kunting LI, Zhixi FENG. Remote sensing image scene classification based on effective channel attention [J]. Journal of Computer Applications, 2022, 42(5): 1431-1439.
[10]	Guifang QIAO, Shouming HOU, Yanyan LIU. Facial expression recognition algorithm based on combination of improved convolutional neural network and support vector machine [J]. Journal of Computer Applications, 2022, 42(4): 1253-1259.
[11]	Yunzhi QIU, Tinghua WANG, Xiaolu DAI. Doubly feature-weighted fuzzy support vector machine [J]. Journal of Computer Applications, 2022, 42(3): 683-687.
[12]	Xiangzhou QI, Hongjie XING. Centered kernel alignment based multiple kernel one-class support vector machine [J]. Journal of Computer Applications, 2022, 42(2): 349-356.
[13]	Wang TAN, Yi LI. Synthesis of loop bound functions for loop programs [J]. Journal of Computer Applications, 2022, 42(2): 565-573.
[14]	Qian GE, Guangbin ZHANG, Xiaofeng ZHANG. Automatic feature selection algorithm based on interaction of ReliefF with maximum information coefficient and SVM [J]. Journal of Computer Applications, 2022, 42(10): 3046-3053.
[15]	Hongfei JIA, Xi LIU, Yu WANG, Hongbing XIAO, Suxia XING. Application of 3DPCANet in image classification of functional magnetic resonance imaging for Alzheimer’s disease [J]. Journal of Computer Applications, 2022, 42(1): 310-315.

Study on Kazak text categorization based on SVM

基于SVM的哈萨克语文本分类

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics