Journal of Computer Applications ›› 2022, Vol. 42 ›› Issue (4): 1244-1252.DOI: 10.11772/j.issn.1001-9081.2021111917

• The 36 CCF National Conference of Computer Applications (CCF NCCA 2020) • Previous Articles    

Fast failure recovery method based on local redundant hybrid code

Jingyu LIU, Qiuxia NIU, Xiaoyan LI, Qiaoshuo SHI, Youxi WU()   

  1. College of Artificial Intelligence,Hebei University of Technology,Tianjin 300401,China
  • Received:2021-11-12 Revised:2021-12-24 Accepted:2021-12-24 Online:2021-12-31 Published:2022-04-10
  • Contact: Youxi WU
  • About author:LIU Jingyu, born in 1976, Ph. D., associate professor. His research interests include network storage, information security.
    NIU Qiuxia, born in 1992, M. S. candidate. Her research interests include network storage.
    LI Xiaoyan, born in 1996, M. S. candidate. Her research interests include network storage.
    SHI Qiaoshuo, born in 1974, Ph. D., associate professor. Her research interests include machine learning.

基于局部冗余混合编码的故障快速恢复方法

刘靖宇, 牛秋霞, 李萧言, 史巧硕, 武优西()   

  1. 河北工业大学 人工智能与数据科学学院,天津 300401
  • 通讯作者: 武优西
  • 作者简介:刘靖宇(1976—),男,天津人,副教授,博士,CCF会员,主要研究方向:网络存储、信息安全
    牛秋霞(1992—),女,河北邯郸人,硕士研究生,CCF会员,主要研究方向:网络存储
    李萧言(1996—),女,山西阳泉人,硕士研究生,CCF会员,主要研究方向:网络存储
    史巧硕(1974—),女,天津人,副教授,博士,CCF会员,主要研究方向:机器学习

Abstract:

The parity blocks of the Maximum-Distance-Separable (MDS) code are all global parity blocks. The length of the reconstruction chain increases with the expansion of the storage system, and the reconstruction performance gradually decreases. Aiming at the above problems, a new type of Non-Maximum-Distance-Separable (Non-MDS) code called local redundant hybrid code Code-LM(sc) was proposed. Firstly, two types of local parity blocks called horizontal parity block in the strip-set and horizontal-diagonal parity block were added in any strip-sets to reduce the length of the reconstruction chain, and the parity layout of the local redundant hybrid code was designed. Then, four reconstruction formulations of the lost data blocks were designed according to the generation rules of the parity blocks and the common block existed in the reconstruction chains of different data blocks. Finally, double-disk failures were divided into three situations depending on the distances of the strip-sets where the failed disks located and the corresponding reconstruction methods were designed. Theoretical analysis and experimental results show that with the same storage scale, compared with RDP (Row-Diagonal Parity), the reconstruction time of CodeM(sc) for single-disk failure and double-disk failure can be reduced by 84% and 77% respectively; compared with V2-Code, the reconstruction time of Code-LM(sc) for single-disk failure and double-disk failure can be reduced by 67% and 73% respectively. Therefore, local redundant hybrid code can support fast recovery from failed disks and improve reliability of storage system.

Key words: RAID6 (Redundant-Array-of-Independent-Disks-6), Non-Maximum-Distance-Separable (Non-MDS) code, local redundant hybrid code, reconstruction, failure recovery

摘要:

最大距离可分(MDS)码中校验块均为全局校验块,重构链长度随着存储系统规模扩大而增长,重构性能逐渐降低。针对上述问题提出一种新型的非最大距离可分(Non-MDS)码:局部冗余混合编码Code-LM(sc)。首先,为缩小重构链长度,任意条带单元组内只有局部校验块,分别为组内水平校验块和水平对角校验块,并设计了局部冗余混合编码的校验布局;然后,根据不同校验块的生成规则,设计了失效数据块的4种重构方式,不同失效块的重构链具有公共块;最后,根据两个故障磁盘所在条带单元组距离不同,将双盘故障分为3种情况,并设计了对应的重构算法。理论分析和实验结果表明,存储规模相同时,与RDP相比,Code-LM(sc)的单盘重构时间和双盘重构时间可减少84%和77%;与V2-Code相比,Code-LM(sc)的单盘重构时间和双盘重构时间可减少67%和73%。因此局部冗余混合编码可支持故障磁盘快速恢复,提高存储系统可靠性。

关键词: RAID6, 非最大距离可分码, 局部冗余混合编码, 重构, 故障恢复

CLC Number: