Linear kernel support vector machine based on dual random projection

doi:10.11772/j.issn.1001-9081.2017.06.1680

Journal of Computer Applications ›› 2017, Vol. 37 ›› Issue (6): 1680-1685.DOI: 10.11772/j.issn.1001-9081.2017.06.1680

Previous Articles Next Articles

Linear kernel support vector machine based on dual random projection

XI Xi, ZHANG Fengqin, LI Xiaoqing, GUAN Hua, CHEN Guirong, WANG Mengfei

Information and Navigation College, Air Force Engineering University, Xi'an Shaanxi 710077, China

Received:2016-11-10 Revised:2016-12-29 Online:2017-06-14 Published:2017-06-10
Supported by:
This work is partially supported by the National Natural Science Foundation of China (71503260), the Natural Science Foundation of Shaanxi Province (2014JM8345).

基于对偶随机投影的线性核支持向量机

席茜, 张凤琴, 李小青, 管桦, 陈桂茸, 王梦非

空军工程大学信息与导航学院, 西安 710077

通讯作者: 席茜
作者简介:席茜(1993-),女,山西新绛人,硕士研究生,CCF会员,主要研究方向:数据挖掘、机器学习;张凤琴(1964-),女,山西芮城人,副教授,硕士,CCF会员,主要研究方向:数据挖掘、复杂网络、分布式数据库;李小青(1982-),女,陕西泾阳人,讲师,博士,主要研究方向:数据智能处理;管桦(1963-),男,湖北孝感人,教授,硕士,主要研究方向:指挥自动化;陈桂茸(1970-),女,陕西合阳人,讲师,硕士,主要研究方向:复杂网络;王梦非(1992-),男,山东济南人,硕士研究生,主要研究方向:复杂网络、机器学习。
基金资助:
国家自然科学基金资助项目（71503260）；陕西省自然科学基金资助项目（2014JM8345）。

Abstract

Abstract: Aiming at the low classification accuracy problem of large-scale Support Vector Machine (SVM) after random-projection-based feature dimensionality reduction, Linear kernel SVM based on dual random projection (drp-LSVM) for large-scale classification problems was proposed with the introduction of the dual recovery theory. Firstly, the relevant geometric properties of drp-LSVM were analyzed and demonstrated. It's proved that, with maintaining the similar geometric advantages of Linear kernel SVM based on dual random projection (rp-LSVM), the divided hyperplane of drp-LSVM was more close to the primitive classifier trained by complete data. Then, in view of the fast solution to drp-LSVM, the traditional Sequential Minimal Optimization (SMO) algorithm was improved and the drp-LSVM classifier based on improved SMO algorithm was completed. Finally, the experimental results show that, drp-LSVM inherits the advantages of rp-LSVM, reduces classification error, improves training accuracy, and all its performance indexes are more close to the classifier trained by primitive data; the classifier designed based on the improved SMO algorithm can reduce memory consumption and achieve higher training accuracy.

Key words: machine learning, Support Vector Machine (SVM), random projection, Sequential Minimal Optimization (SMO) algorithm, dimensionality reduction

摘要： 针对大型支持向量机（SVM）经随机投影特征降维后分类精度下降的问题，结合对偶恢复理论，提出了面向大规模分类问题的基于对偶随机投影的线性核支持向量机（drp-LSVM）。首先，分析论证了drp-LSVM相关几何性质，证明了在保持与基于随机投影降维的支持向量机（rp-LSVM）相近几何优势的同时，其划分超平面更接近于用全部数据训练得到的原始分类器。然后，针对提出的drp-LSVM快速求解问题，改进了传统的序列最小优化（SMO）算法，设计了基于改进SMO算法的drp-LSVM分类器。最后实验结果表明，drp-LSVM在继承rp-LSVM优点的同时，减小了分类误差，提高了训练精度，并且各项性能评价更接近于用原始数据训练得到的分类器；设计的基于改进SMO算法的分类器不但可以减少内存消耗，同时可以拥有较高的训练精度。

关键词: 机器学习, 支持向量机, 随机投影, 序列最小优化算法, 降维

CLC Number:

TP181

XI Xi, ZHANG Fengqin, LI Xiaoqing, GUAN Hua, CHEN Guirong, WANG Mengfei. Linear kernel support vector machine based on dual random projection[J]. Journal of Computer Applications, 2017, 37(6): 1680-1685.

席茜, 张凤琴, 李小青, 管桦, 陈桂茸, 王梦非. 基于对偶随机投影的线性核支持向量机[J]. 计算机应用, 2017, 37(6): 1680-1685.

References

[1] CORTES C, VAPNIK V. Support-vector networks[J]. Machine Learning, 1995, 20(3):273-297.
[2] KUMAR K, BHATTACHARYYA C, HARIHARAN R. A randomized algorithm for large scale support vector learning[EB/OL].[2016-10-09]. http://hariharan-ramesh.com/papers/krichiram_nips_07.pdf.
[3] JETHAVA V, SURESH K, BHATTACHARYYA C, et al. Randomized algorithms for large scale SVMs[EB/OL].[2016-10-09]. https://www.researchgate.net/publication/45873558_Randomized_Algorithms_for_Large_scale_SVMs.
[4] PAUL S, BOUTSIDIS C, MAGDON-ISMAIL M, et al. Random projections for linear support vector machines[J]. ACM Transactions on Knowledge Discovery from Data, 2014, 8(4):Article No. 22.
[5] ZHANG L J, MAHDAVI M, JIN R, et al. Recovering the optimal solution by dual random projection[J]. Journal of Machine Learning Research, 2012, 30:135-157.
[6] 周志华.机器学习[M].北京:清华大学出版社,2016:121-145.(ZHOU Z H. Machine Learning[M]. Beijing:Tsinghua University Press, 2016:121-145.)
[7] 刘红,刘蓉,李书玲.基于随机投影的加速度手势识别[J].计算机应用,2015,35(1):189-193.(LIU H, LIU R, LI S L. Acceleration gesture recognition based on random projection[J]. Journal of Computer Applications, 2015, 35(1):189-193.)
[8] 王萍,蔡思佳,刘宇.基于随机投影技术的矩阵填充算法的改进[J].计算机应用,2014,34(6):1587-1590.(WANG P, CAI S J, LIU Y. Improvement of matrix completion algorithm based on random projection[J]. Journal of Computer Applications, 2014, 34(6):1587-1590.)
[9] PLATT J C. Fast training of support vector machines using sequential minimal optimization[M]. Cambridge, MA:MIT Press, 1999:185-208.
[10] CHANG C C, LIN C J. LIBSVM:a library for support vector machines[J]. ACM Transactions on Intelligent Systems & Technology, 2011, 2(3):Article No. 27.
[11] FAN R E, CHANG K W, HSIEH C J, et al. LIBLINEAR:a library for large linear classification[J]. Journal of Machine Learning Research, 2008, 9:1871-1874.
[12] GOLUB T R, SLONIM D K, TAMAYO P, et al. Molecular classification of cancer:class discovery and class prediction by gene expression monitoring[J]. Science, 1999, 286(5439):531-537.
[13] LEWIS D D, YANG Y, ROSE T G, et al. RCV1:a new benchmark collection for text categorization research[J]. Journal of Machine Learning Research, 2004, 5:361-397.

[1]	Xuebin CHEN, Zhiqiang REN, Hongyang ZHANG. Review on security threats and defense measures in federated learning [J]. Journal of Computer Applications, 2024, 44(6): 1663-1672.
[2]	Zihao YAO, Yuanming LI, Ziqiang MA, Yang LI, Lianggen WEI. Multi-object cache side-channel attack detection model based on machine learning [J]. Journal of Computer Applications, 2024, 44(6): 1862-1871.
[3]	Min SUN, Qian CHENG, Xining DING. CBAM-CGRU-SVM based malware detection method for Android [J]. Journal of Computer Applications, 2024, 44(5): 1539-1545.
[4]	Shengjie MENG, Wanjun YU, Ying CHEN. Feature selection algorithm for high-dimensional data with maximum correlation and maximum difference [J]. Journal of Computer Applications, 2024, 44(3): 767-771.
[5]	Wei SHE, Yang LI, Lihong ZHONG, Defeng KONG, Zhao TIAN. Hyperparameter optimization for neural network based on improved real coding genetic algorithm [J]. Journal of Computer Applications, 2024, 44(3): 671-676.
[6]	Yi ZHENG, Cunyi LIAO, Tianqian ZHANG, Ji WANG, Shouyin LIU. Image denoising-based cell-level RSRP estimation method for urban areas [J]. Journal of Computer Applications, 2024, 44(3): 855-862.
[7]	Xuebin CHEN, Changsheng QU. Overview of backdoor attacks and defense in federated learning [J]. Journal of Computer Applications, 2024, 44(11): 3459-3469.
[8]	Enbao QIAO, Xiangyang GAO, Jun CHENG. Self-recovery adaptive Monte Carlo localization algorithm based on support vector machine [J]. Journal of Computer Applications, 2024, 44(10): 3246-3251.
[9]	Renke SUN, Zhiyu HUANGFU, Hu CHEN, Zhongnian LI, Xinzheng XU. Survey of neural architecture search [J]. Journal of Computer Applications, 2024, 44(10): 2983-2994.
[10]	Wenze CHAI, Jing FAN, Shukui SUN, Yiming LIANG, Jingfeng LIU. Overview of deep metric learning [J]. Journal of Computer Applications, 2024, 44(10): 2995-3010.
[11]	Chunyong YIN, Yongcheng ZHOU. Automatically adjusted clustered federated learning for double-ended clustering [J]. Journal of Computer Applications, 2024, 44(10): 3011-3020.
[12]	Haoyang CUI, Hui ZHANG, Lei ZHOU, Chunming YANG, Bo LI, Xujian ZHAO. Multi-similarity K-nearest neighbor classification algorithm with ordered pairs of normalized real numbers [J]. Journal of Computer Applications, 2023, 43(9): 2673-2678.
[13]	Xueyu HUANG, Huaiyu HE, Huimin LIN, Jinshui CHEN. Classification and recognition method of copper alloy metallograph based on feature aggregation [J]. Journal of Computer Applications, 2023, 43(8): 2593-2601.
[14]	Jing ZHONG, Chen LIN, Zhiwei SHENG, Shibin ZHANG. Quantum K-Means algorithm based on Hamming distance [J]. Journal of Computer Applications, 2023, 43(8): 2493-2498.
[15]	Mengjie LAN, Jianping CAI, Lan SUN. Self-regularization optimization methods for Non-IID data in federated learning [J]. Journal of Computer Applications, 2023, 43(7): 2073-2081.

Linear kernel support vector machine based on dual random projection

基于对偶随机投影的线性核支持向量机

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics