计算机应用 ›› 2017, Vol. 37 ›› Issue (11): 3039-3047.DOI: 10.11772/j.issn.1001-9081.2017.11.3039

• 第十六届中国机器学习会议(CCML 2017) •    下一篇

大数据下的典型机器学习平台综述

焦嘉烽, 李云   

  1. 南京邮电大学 计算机学院, 软件学院, 网络空间安全学院, 南京 210003
  • 收稿日期:2017-05-16 修回日期:2017-07-21 出版日期:2017-11-10 发布日期:2017-11-11
  • 通讯作者: 李云
  • 作者简介:焦嘉烽(1991-),男,江苏江阴人,硕士研究生,主要研究方向:大规模机器学习;李云(1974-),男,安徽望江人,教授,博士生导师,博士,主要研究方向:机器学习、特征工程。
  • 基金资助:
    国家自然科学基金资助项目(61603197);江苏省自然科学基金资助项目(BK20140885)。

Review of typical machine learning platforms for big data

JIAO Jiafeng, LI Yun   

  1. School of Computer Science, Nanjing University of Posts and Telecommunications, Nanjing Jiangsu 210003, China
  • Received:2017-05-16 Revised:2017-07-21 Online:2017-11-10 Published:2017-11-11
  • Supported by:
    This work is partially supported by the National Natural Science Foundation of China (61603197), the Natural Science Foundation of Jiangsu Province (BK20140885).

摘要: 由于大数据海量、复杂多样、变化快,传统的机器学习平台已不再适用,因此,设计一个高效的、通用的大数据机器学习平台成为目前的研究热点。通过介绍和分析机器学习算法的特点以及大规模机器学习的数据和模型并行化,引出常见的并行计算模型。简单介绍了整体同步并行模型(BSP)、SSP并行计算模型以及BSP、SSP模型与AP模型的区别,主要介绍了基于这些并行模型的典型的机器学习平台和这些平台的优缺点,并指出各个平台最适合处理何种大数据问题。最后从采用的抽象数据结构、并行计算模型、容错机制等方面对典型的机器学习平台进行了总结,并提出一些建议和展望。

关键词: 大数据, 机器学习平台, 并行计算模型, 参数服务器

Abstract: Due to the volume, complex and fast-changing characteristics of big data, traditional machine learning platforms are not applicable. Therefore, designing an efficient and general machine learning platform for big data has become an important research issue. By introducing and analyzing the characteristics of machine learning algorithms and the data and model parallelization for large-scale machine learning, some common parallel computing models were presented. Bulk Synchronous Parallel (BSP), Stale Synchronous Parallel (SSP) computing models and the differences between BSP, SSP, and Asynchronous Parallel model (AP) were introduced. Then the typical machine learning platforms based on these parallel models and the advantages and disadvantages of these platforms were mainly introduced, and what kind of big data each typical machine learning platform was best suited for was pointed out. Finally, the typical machine learning platforms were summarized from the aspects of abstract data structure, parallel computing model and fault tolerance mechanism. Some suggestions and prospects were put forward.

Key words: big data, machine learning platform, parallel computing model, parameter server

中图分类号: