[1] SWARTZ N. Gartner warns firms of "dirty data"[J]. Information Management Journal, 2007, 41(3): 6-7. [2] ECKERSON W W. Data quality and the bottom line: achieving business success through a commitment to high quality data[EB/OL].[2016-03-10]. http://download.101com.com/pub/tdwi/Files/DQReport.pdf. [3] GRAHAM C. Forecast: data quality tools, worldwide, 2006-2011[EB/OL].[2016-03-10]. https://www.gartner.com/doc/507207/forecast-data-quality-tools-worldwide. [4] 覃远翔, 段亮, 岳昆. 基于信息熵的不确定性数据清理方法[J]. 计算机应用, 2013, 33(9): 2490-2492.(QIN Y X, DUAN L, YUE K. Approach for cleaning uncertain data based on information entropy theory[J]. Journal of Computer Applications, 2013, 33(9):2490-2492.) [5] RAHM E, DO H H. Data cleaning: problems and current approaches[J]. IEEE Data Engineering Bulletin, 2000, 23(4): 3-13. [6] 杨明花, 古志民. 基于兴趣特征的WUM数据预处理方法[J]. 计算机应用, 2006, 26(10): 133-134.(YANG M H, GU Z M. Data preprocessing method based on characteristic of interests for WUM[J]. Journal of Computer Applications, 2006, 26(10):2393-2388.) [7] GALHARDAS H, FLORESCU D, SHASHA D, et al. Declarative data cleaning: language, model, and algorithms[C]//VLDB 2001: Proceedings of the 27th International Conference on Very Large Data Bases. San Francisco: Morgan Kaufmann Publishers, 2001: 371-380. [8] VOLKOVS M, CHIANG F, SZLICHTA J, et al. Continuous data cleaning[C]//Proceedings of the 2014 IEEE 30th International Conference on Data Engineering. Piscataway, NJ: IEEE, 2014: 244-255. [9] OLIVEIRA P, RODRIGUES F, HENRIQUES P, et al. A taxonomy of data quality problems[EB/OL].[2016-03-10]. https://www.researchgate.net/profile/Helena_Galhardas/publication/250693546_A_Taxonomy_of_Data_Quality_Problems/links/02e7e534798484567c000000.pdf. [10] EBAID A, ELMAGARMID A, ILYAS I F, et al. NADEEF: a generalized data cleaning system[J]. Proceedings of the VLDB Endowment, 2013, 6(12): 1218-1221. [11] DALLACHIESA M, EBAID A, ELDAWY A, et al. NADEEF: a commodity data cleaning system[C]//Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data. New York: ACM, 2013: 541-552. [12] 李俊奎, 王元珍, 李专. AzszpClean: 一种基于规则的数据清洗方案[J]. 山东大学学报(理学版), 2007, 42(9):71-74.(LI J K, WANG Y Z, LI Z. AzszpClean: a rule-based solution to data cleaning[J]. Journal of Shandong University (Natural Science), 2007, 42(9):71-74.) [13] BOHANNON P, FAN W, FLASTER M, et al. A cost-based model and effective heuristic for repairing constraints by value modification[C]//Proceedings of the 2005 ACM SIGMOD International Conference on Management of Data. New York: ACM, 2005: 143-154. [14] CHOMICKJ J, MARCINKOWSKI J. Minimal-change integrity maintenance using tuple deletions[J]. Information and Computation, 2005, 197(1): 90-121. [15] WIJSEN J. Database repairing using updates[J]. ACM Transactions on Database Systems, 2005, 30(3): 722-768. [16] FAN W, GEERTS F, JIA X, et al. Conditional functional dependencies for capturing data inconsistencies[J]. ACM Transactions on Database Systems, 2008, 33(2): 6. [17] BRAVO L, FAN W, MA S. Extending dependencies with conditions[EB/OL].[2016-03-10]. http://www.vldb.org/conf/2007/papers/research/p243-bravo.pdf. [18] GOLAB L, KARLOFF H, KORN F, et al. On generating near-optimal tableaux for conditional functional dependencies[J]. Proceedings of the VLDB Endowment, 2008, 1(1): 376-390. [19] CHU X, ILYAS I F, PAPOTTI P. Holistic data cleaning: put violations into context[C]//Proceedings of the 2013 IEEE 29th International Conference on Data Engineering. Piscataway, NJ: IEEE, 2013:458-469. [20] FAN W, MA S, TANG N, et al. Interaction between record matching and data repairing[J]. Journal of Data and Information Quality, 2014, 4(4): Article No 16. [21] YAKOUT M, ELMAGARMID A K, NEVILLE J, et al. Guided data repair[J]. Proceedings of the VLDB Endowment, 2011, 4(5): 279-289. [22] VWRBORGH R, DE W M. Using OpenRefine[M]. Birmingham: Packt Publishing, 2013:53. [23] PROCTOR M, NEALE M, LIN P, et al. Drools documentation[EB/OL].[2016-03-10]. http://www.jboss.org/drools/documentation.html. [24] 丁晶, 陈晓岚, 吴萍. 基于正则表达式的深度包检测算法[J]. 计算机应用, 2007, 27(9): 2184-2186.(DING J, CHEN X L, WU P. Deep packet inspection algorithm based on regular expressions[J]. Journal of Computer Applications, 2007, 27(9):2184-2186.) [25] 周傲英, 金澈清, 王国仁, 等. 不确定性数据管理技术研究综述[J]. 计算机学报, 2009, 32(1): 1-16.(ZHOU A Y, JIN C Q, WANG G R, et al. A survey on the management of uncertain data[J]. Chinese Journal of Computers, 2009, 32(1):1-16.) |