[1] VISHWANATH K V, NAGAPPAN N. Characterizing cloud computing hardware reliability[C]//Proceedings of the 1st ACM Symposium on Cloud Computing. New York:ACM, 2010:193-204. [2] XIN Q, MILLER E L, SCHWARZ T, et al. Reliability mechanisms for very large storage systems[C]//Proceedings of the 20th IEEE/11th NASA Goddard Conference on Mass Storage Systems and Technologies. Piscataway:IEEE,2003:146-156. [3] ZHANG M, HAN S, LEE P P C. A simulation analysis of reliability in erasure-coded data centers[C]//Proceedings of the IEEE 36th Symposium on Reliable Distributed Systems. Piscataway:IEEE,2017:144-153. [4] LI J,JI X,JIA Y,et al. Hard drive failure prediction using classification and regression trees[C]//Proceedings of the 44th Annual IEEE/IFIP International Conference on Dependable Systems and Networks. Piscataway:IEEE,2014:383-394. [5] MA A,DOUGLIS F,LU G,et al. RAIDShield:characterizing, monitoring,and proactively protecting against disk failures[C]//Proceedings of the 13th USENIX Conference on File and Storage Technologies. Berkeley:USENIX Association,2015:241-256. [6] WU S,JIANG H,MAO B. Proactive data migration for improved storage availability in large-scale data centers[J]. IEEE Transactions on Computers,2015,64(9):2637-2651. [7] XU C,WANG G,LIU X,et al. Health status assessment and failure prediction for hard drives with recurrent neural networks[J]. IEEE Transactions on Computers,2016,65(11):3502-3508. [8] LI J,STONES R J,WANG G,et al. Being accurate is not enough:new metrics for disk failure prediction[C]//Proceedings of the IEEE 35th Symposium on Reliable Distributed Systems. Piscataway:IEEE,2016:71-80. [9] LI J,STONES R J,WANG G,et al. Hard drive failure prediction using decision trees[J]. Reliability Engineering and System Safety, 2017,164:55-65. [10] QIN A,HU D,LIU J,et al. Fatman:cost-saving and reliable archival storage based on volunteer resources[J]. Proceedings of the VLDB Endowment,2014,7(13):1748-1753. [11] JI X,MA Y,MA R,et al. A proactive fault tolerance scheme for large scale storage systems[C]//Proceedings of the 2015 International Conference on Algorithms and Architectures for Parallel Processing, LNCS 9530. Cham:Springer, 2015:337-350. [12] ONGARO D,RUMBLE S M,STUTSMAN R,et al. Fast crash recovery in RAMCloud[C]//Proceedings of the 23rd ACM Symposium on Operating Systems Principles. New York:ACM, 2011:29-41. [13] SHVACHKO K, KUANG H, RADIA S, et al. The Hadoop distributed file system[C]//Proceedings of the IEEE 26th Symposium on Mass Storage Systems and Technologies. Piscataway:IEEE,2010:1-10. [14] 李静, 刘冬实. 主动容错云存储系统的可靠性评价模型[J]. 计算机应用,2018,38(9):2631-2636,2649.(LI J,LIU D S. Reliability evaluation model for cloud storage systems with proactive fault tolerance[J]. Journal of Computer Applications, 2018,38(9):2631-2636,2649.) [15] 章宏灿, 薛巍. 集群RAID5存储系统可靠性分析[J]. 计算机研究与发展,2010,47(4):727-735.(ZHANG H C,XUE W. Reliability analysis of cluster RAID5 storage system[J]. Journal of Computer Research and Development,2010,47(4):727-735.) [16] SCHROEDER B,GINSON G A. Disk failures in the real world:what does an MTTF of 1,000,000 hours mean to you?[C]//Proceedings of the 5th USENIX Conference on File and Storage Technologies. Berkeley:USENIX Association,2007:1-16. [17] LU Y, MILLER A A, HOFFMANN R, et al. Towards the automated verification of Weibull distributions for system failure rates[C]//Proceedings of the 21st International Workshop on Formal Methods for Industrial Critical Systems/16th International Workshop on Automated Verification of Critical Systems,LNCS 9933. Cham:Springer,2016:81-96. [18] ELERATH J G,SCHINDLER J. Beyond MTTDL:a closed-form RAID 6 reliability equation[J]. ACM Transactions on Storage, 2014,10(2):No. 7. [19] VENKATESAN V,ILIADIS I. A general reliability model for data storage systems[C]//Proceedings of the 9th International Conference on Quantitative Evaluation of Systems Quantitative Evaluation of Systems. Piscataway:IEEE,2012:209-219. [20] EPSTEIN A,KOLODNER E K,SOTNIKOV D. Network aware reliability analysis for distributed storage systems[C]//Proceedings of the IEEE 35th Symposium on Reliable Distributed Systems. Piscataway:IEEE,2016:249-258. [21] WANG J, WU H, WANG R. A new reliability model in replication-based big data storage systems[J]. Journal of Parallel and Distributed Computing,2017,108:14-27. [22] HALL R J. Tools for predicting the reliability of large-scale storage systems[J]. ACM Transactions on Storage,2016,12(4):No. 24 [23] ECKART B,CHEN X,HE X,et al. Failure prediction models for proactive fault tolerance within storage systems[C]//Proceedings of the 2008 IEEE International Symposium on Modeling,Analysis and Simulation of Computers and Telecommunication Systems. Piscataway:IEEE,2008:1-8. [24] LI J,LI M,WANG G,et al. Global reliability evaluation for cloud storage systems with proactive fault tolerance[C]//Proceedings of the 2015 International Conference on Algorithms and Architectures for Parallel Processing,LNCS 9531. Cham:Springer,2015:189-203. [25] LI J,LI P,STONES R J,et al. Reliability equations for cloud storage systems with proactive fault tolerance[J]. IEEE Transactions on Dependable and Secure Computing,2020,17(4):782-794. |