基于随机素描方法的在线核回归

doi:10.11772/j.issn.1001-9081.2021040869

《计算机应用》唯一官方网站 ›› 2022, Vol. 42 ›› Issue (3): 676-682.DOI: 10.11772/j.issn.1001-9081.2021040869

• 2021年中国计算机学会人工智能会议(CCFAI 2021) • 上一篇下一篇

基于随机素描方法的在线核回归

刘清华, 廖士中()

天津大学智能与计算学部，天津 300350

收稿日期:2021-05-25 修回日期:2021-05-27 接受日期:2021-05-28 发布日期:2021-11-09 出版日期:2022-03-10
通讯作者: 廖士中
作者简介:刘清华（1997—），女，陕西渭南人，硕士研究生，主要研究方向：在线学习、素描方法；
基金资助:
国家自然科学基金资助项目(62076181)

Online kernel regression based on random sketching method

Qinghua LIU, Shizhong LIAO()

College of Intelligence and Computing，Tianjin University，Tianjin 300350，China

Received:2021-05-25 Revised:2021-05-27 Accepted:2021-05-28 Online:2021-11-09 Published:2022-03-10
Contact: Shizhong LIAO
About author:LIU Qinghua， born in 1997， M. S. candidate. Her research interests include online learning， sketching method.
Supported by:
National Natural Science Foundation of China(62076181)

摘要/Abstract

摘要：

在线核回归学习中，每当一个新的样本到来，训练器都需要计算核矩阵的逆矩阵，这个过程的计算复杂度至少为关于回合数的平方级别。提出将素描方法应用于假设的更新，给出一个基于素描方法的更高效的在线核回归算法。首先，将损失函数设定为平方损失，应用Nystr?m近似方法来近似核，并借鉴跟导方法（FTL）的思想，提出一个新的梯度下降算法，称之为FTL-在线核回归（F-OKR）；然后，应用素描方法对其加速，使得F-OKR的计算复杂度降低到关于回合数和素描规模线性、关于数据维度平方的级别；最后，设计了一个高效的素描在线核回归算法（SOKR）。与F-OKR相比，SOKR的精度几乎没有影响，而同时在适当的数据集上，运行时间减少16.7%左右。在理论上证得了两种算法的亚线性后悔界。实验结果也验证了所提算法与Nystr?m在线梯度下降算法（NOGD）相比有更好的表现，平均损失降低约64%。

关键词: 在线学习, 素描方法, 后悔分析, 回归, 核方法

Abstract:

In online kernel regression learning， the inverse matrix of the kernel matrix needs to be calculated when a new sample arrives， and the computational complexity is at least the square of the number of rounds. The idea of applying sketching method to hypothesis updating was introduced， and a more efficient online kernel regression algorithm via sketching method was proposed. Firstly， The loss function was set as the square loss， a new gradient descent algorithm， called FTL-Online Kernel Regression （F-OKR） was proposed， using the Nystr?m approximation method to approximate the Kernel， and applying the idea of Follow-The-Leader （FTL）. Then， sketching method was used to accelerate F-OKR so that the computational complexity of F-OKR was reduced to the level of linearity with the number of rounds and sketch scale， and square with the data dimension. Finally， an efficient online kernel regression algorithm called Sketched Online Kernel Regression （SOKR） was designed. Compared to F-OKR， SOKR had no change in accuracy and reduced the runtime by about 16.7% on some datasets. The sub-linear regret bounds of these two algorithms were proved， and experimental results on standard regression datasets also verify that the algorithms have better performance than NOGD （Nystr?m Online Gradient Descent） algorithm， the average loss of all the datasets was reduced by about 64%.

Key words: online learning, sketching method, regret analysis, regression, kernel method

中图分类号:

TP181

刘清华, 廖士中. 基于随机素描方法的在线核回归[J]. 计算机应用, 2022, 42(3): 676-682.

Qinghua LIU, Shizhong LIAO. Online kernel regression based on random sketching method[J]. Journal of Computer Applications, 2022, 42(3): 676-682.

图/表 3

表1 选定的数据集信息

Tab.1 Information of selected datasets

数据集	样本数	特征数	数据集	样本数	特征数
Parkinson	5 875	22	cadata	20 640	9
cpusmall	8 195	13	slice	53 500	380

表2 核参数γ和步长η

Tab. 2 Kernel parameter γ and step size η

数据集	$γ$	$η$	数据集	$γ$	$η$
Parkinson	0.003	0.013	cadata	0.050 0	0.007
cpusmall	0.003	0.011	slice	0.000 5	0.004

表2 核参数γ和步长η

Tab. 2 Kernel parameter γ and step size η

数据集	$γ$	$η$	数据集	$γ$	$η$
Parkinson	0.003	0.013	cadata	0.050 0	0.007
cpusmall	0.003	0.011	slice	0.000 5	0.004

图1 三种算法在4个数据集上的实验结果对比

Fig. 1 Experimental results comparison of three algorithms on 4 datasets

参考文献 24

1	李志杰，李元香，王峰，等. 面向大数据分析的在线学习算法综述［J］. 计算机研究与发展， 2015， 52（8）： 1707-1721. 10.7544/issn1000-1239.2015.20150185
	LI Z J， LI Y X， WANG F， et al. Online learning algorithm for big data analytics： a survey［J］. Journal of Computer Research and Development， 2015， 52（8）： 1707-1721. 10.7544/issn1000-1239.2015.20150185
2	翟婷婷，高阳，朱俊武. 面向流数据分类的在线学习综述［J］. 软件学报， 2020， 31（4）：912-931. 10.13328/j.cnki.jos.005916
	ZHAI T T， GAO Y， ZHU J W. Survey of online learning algorithms for streaming data classification［J］. Journal of Software， 2020， 31（4）：912-931. 10.13328/j.cnki.jos.005916
3	潘志松，唐斯琪，邱俊洋，等. 在线学习算法综述［J］. 数据采集与处理，2016， 31（6）：1067-1082.
	PAN Z S， TANG S Q， QIU J Y， et al. Survey on online learning algorithms［J］. Journal of Data Acquisition and Processing，2016， 31（6）：1067-1082.
4	WANG Z， CRAMMER K， VUCETIC S. Breaking the curse of kernelization： budgeted stochastic gradient descent for large-scale SVM training ［J］. Journal of Machine Learning Research， 2012， 13： 3103-3131.
5	CAVALLANTI G， CESA-BIANCHI N， GENTILE C. Tracking the best hyperplane with a simple budget perceptron ［J］. Machine Learning， 2007， 69（2/3）：143-167. 10.1007/s10994-007-5003-0
6	DEKEL O， SHALEV-SHWARTZ S， SINGER Y. The forgetron： a kernel-based perceptron on a budget ［J］. SIAM Journal on Computing， 2008， 37（5）：1342-1372. 10.1137/060666998
7	CRAMMER K， KANDOLA J S， SINGER Y. Online classification on a budget ［EB/OL］.［2021-06-22］. .
8	LU J， HOI S C， WANG J， et al. Large scale online kernel learning ［J］. Journal of Machine Learning Research， 2016， 17（47）：1-43.
9	SHALEV-SHWARTZ S. Online learning and online convex optimization ［J］. Foundations and Trends in Machine Learning， 2012， 4（2）：107-194.
10	HUANG Z F. Near optimal frequent directions for sketching dense and sparse matrices ［EB/OL］.［2021-06-22］. .
11	CALANDRIELLO D， LAZARIC A， VALKO M. Efficient second-order online kernel learning with adaptive embedding ［EB/OL］.［2020-06-22］. .
12	CALANDRIELLO D， LAZARIC A， VALKO M. Second-order kernel online convex optimization with adaptive sketching ［EB/OL］.［2021-06-22］. .
13	LUO L， CHEN C， ZHANG Z， et al. Robust frequent directions with application in online learning ［J］. Journal of Machine Learning Research， 2019， 20（45）：1-41.
14	ENGEL Y， MANNOR S， MEIR R. The kernel recursive least-squares algorithm［J］. IEEE Transactions on Signal Processing， 2004， 52（8）：2275-2285. 10.1109/tsp.2004.830985
15	MU X， ZHU F， DU J， et al. Streaming classification with emerging new class by class matrix sketching［C］// Proceedings of the 31st AAAI Conference on Artificial Intelligence. Menlo Park， CA： AAAI， 2017： 2373-2379.
16	YE Q， LUO L， ZHANG Z. Frequent direction algorithms for approximate matrix multiplication with applications in CCA［EB/OL］.［2021-06-22］. .
17	LUO H， AGARWAL A， CESA-BIANCHI N， et al. Efficient second order online learning via sketching ［C］// Proceedings of the 30th International Conference on Neural Information Processing Systems.Red Hook， NY：Curran Associates Inc.，2016： 910-918.
18	张骁，廖士中.在线核选择的随机素描方法［D］. 天津：天津大学， 2019： 11. 10.1145/3357384.3358019
	ZHANG X， LIAO S Z. Online kernel selection via randomized sketching［D］. Tianjin： Tianjin University， 2019： 11. 10.1145/3357384.3358019
19	WILLIAMS C K I， SEEGER M W. Using the Nyström method to speed up kernel machines ［C］// Proceedings of the 13th International Conference on Neural Information Processing Systems. Cambridge， MA： MIT Press， 2000： 682-688.
20	YANG T， LI Y， MAHDAVI M， et al. Nyström method vs random Fourier features： a theoretical and empirical comparison ［J］. Advances in Neural Information Processing Systems， 2012， 25： 485-493.
21	GHASHAMI M， LIBERTY E， PHILLIPS J M， et al. Frequent directions： simple and deterministic matrix sketching ［J］. SIAM Journal on Computing， 2016， 45（5）：1762-1792. 10.1137/15m1009718
22	DRINEAS P， MAHONEY M W， MUTHUKRISHNAN S. Relative-error CUR matrix decompositions ［J］. SIAM Journal on Matrix Analysis and Applications， 2008， 30（2）：844-881. 10.1137/07070471x
23	HAZAN E， AGARWAL A， KALE S. Logarithmic regret algorithms for online convex optimization ［J］. Machine Learning， 2007， 69（2/3）：169-192. 10.1007/s10994-007-5016-8
24	GAO W， JIN R， ZHU S， et al. One-pass auc optimization ［J］. Artificial Intelligence， 2016， 236：1-29. 10.1016/j.artint.2016.03.003

[1]	高磊, 罗关凤, 刘荡, 闵帆. 基于聚类和局部线性回归的初至波自动拾取算法[J]. 《计算机应用》唯一官方网站, 2022, 42(2): 655-662.
[2]	杨悦, 王士同. 基于随机特征映射的四层多核学习方法[J]. 《计算机应用》唯一官方网站, 2022, 42(1): 16-25.
[3]	严海升, 马新强. 基于径向基函数的多目标回归特征构建算法[J]. 计算机应用, 2021, 41(8): 2219-2224.
[4]	肖振远, 王逸涵, 罗建桥, 熊鹰, 李柏林. 基于部分加权损失函数的RefineDet[J]. 计算机应用, 2021, 41(7): 1928-1932.
[5]	郭一村, 陈华辉. 在线哈希算法研究综述[J]. 《计算机应用》唯一官方网站, 2021, 41(4): 1106-1112.
[6]	张志浩, 林耀进, 卢舜, 郭晨, 王晨曦. 缺失标记下基于类属属性的多标记特征选择[J]. 计算机应用, 2021, 41(10): 2849-2857.
[7]	蒋阳升, 王胜男, 涂家祺, 李莎, 王红军. 面向高铁站的热舒适度和能耗综合预测[J]. 计算机应用, 2021, 41(1): 249-257.
[8]	汪志远, 降爱莲, 奥斯曼·穆罕默德. 基于正则互表示的无监督特征选择方法[J]. 计算机应用, 2020, 40(7): 1896-1900.
[9]	朱相荣, 王磊, 杨雅婷, 董瑞, 张俊. 基于非自回归方法的维汉神经机器翻译[J]. 计算机应用, 2020, 40(7): 1891-1895.
[10]	徐晓翔, 常相茂, 陈方进. 基于RFID标签阵列的睡眠期间呼吸量连续监测系统[J]. 计算机应用, 2020, 40(5): 1534-1538.
[11]	阮灿华, 林甲祥. 动态事件时间数据的多任务Logistic生存预测方法[J]. 计算机应用, 2020, 40(5): 1284-1290.
[12]	单晓欢, 张志国, 宋宝燕, 任成林. 事件社交网中基于有向标签图及用户反馈的活动推荐方法[J]. 《计算机应用》唯一官方网站, 2020, 40(2): 448-453.
[13]	汪敏, 武禹伯, 闵帆. 基于多种聚类算法和多元线性回归的多分类主动学习算法[J]. 计算机应用, 2020, 40(12): 3437-3444.
[14]	孙鹤立, 孙玉柱, 张晓云. 基于事件描述的社交事件参与度预测[J]. 计算机应用, 2020, 40(11): 3101-3106.
[15]	王雅萍, 张正军, 颜子寒, 金亚洲. 基于改进的迁移率模型的生物地理学优化算法[J]. 计算机应用, 2019, 39(9): 2511-2516.

基于随机素描方法的在线核回归

Online kernel regression based on random sketching method

RichHTML

PDF

可视化

摘要/Abstract

引用本文

使用本文

图/表 3

参考文献 24

相关文章 15

编辑推荐

Metrics