计算机应用 ›› 2018, Vol. 38 ›› Issue (5): 1404-1409.DOI: 10.11772/j.issn.1001-9081.2017102861

• 先进计算 • 上一篇    下一篇

多种存储环境下压缩数据库的缓存优化

张佳辰, 刘晓光, 王刚   

  1. 南开大学 计算机与控制工程学院, 天津 300350
  • 收稿日期:2017-12-06 修回日期:2017-12-07 出版日期:2018-05-10 发布日期:2018-05-24
  • 通讯作者: 刘晓光
  • 作者简介:张佳辰(1994-),男,河北乐亭人,硕士研究生,主要研究方向:存储虚拟化;刘晓光(1974-),男,河北安国人,教授,博士,CCF高级会员,主要研究方向:分布式系统、搜索引擎;王刚(1974-),男,北京人,教授,博士,CCF会员,主要研究方向:云存储、搜索引擎。

Cache optimization for compressed databases in various storage environments

ZHANG Jiachen, LIU Xiaoguang, WANG Gang   

  1. College of Computer and Control Engineering, Nankai University, Tianjin 300350, China
  • Received:2017-12-06 Revised:2017-12-07 Online:2018-05-10 Published:2018-05-24
  • Contact: 刘晓光

摘要: 近年来,各行业数据量增速提升,对承担数据存储任务的数据库系统进行性能优化的需求也越来越强烈。利用关系型数据库I/O密集型、服务器CPU相对空闲的特点,在数据库中引入数据压缩技术,节省了数据存储空间和I/O传输带宽。但当今主流数据库系统的压缩技术都是针对传统的存储和运行环境设计,并未考虑固态硬盘(SSD)等新型存储设备和云数据库等虚拟化运行环境对系统性能的影响,因此,以数据库压缩系统在不同存储环境的缓存优化作为切入点,对系统整体性能的影响进行分析,给出了数据库压缩系统性能的分析模型,并以MySQL为例进行具体分析,给出了对应的缓存优化措施。在内核虚拟机(KVM)和MySQL数据库测试平台上的性能评估结果表明,所提出的优化方法使得系统性能最高有超过40%的提升,在某些配置下获得了优于物理机的性能。

关键词: 数据库, 数据压缩, 虚拟化, 磁盘I/O, 固态硬盘

Abstract: In recent years, the amount of data in various industries grows rapidly, which results in the increasing of optimization demands in database storage system. Relational databases are I/O-intensive, take use of relatively free CPU time, data compression technology could save data storage space and I/O bandwidth. However, the compression features of current database systems were designed for traditional storage and computing environments, without considering the impact of virtualized environments or the use of Solid State Drive (SSD) on system performance. To optimize the cache performance of database compression system, a database compression system performance model was proposed, and the impact on the I/O performance of various system environments was analyzed. Take the open source database MySQL as an example, the corresponding cache optimization methods were given based on analysis. Evaluation results on Kernel-based Virtual Machine (KVM) and MySQL database show that the optimized version has an increase of more than 40% in performance under some configurations, even close to superior physical machine performance.

Key words: database, data compression, virtualization, disk I/O, Solid State Drive (SSD)

中图分类号: