[1] ELMAGARMID A K, IPEIROTIS P G, VERYKIOS V S. Duplicate record detection:a survey[J]. IEEE Transactions on Knowledge and Data Engineering, 2007, 19(1):1-16. [2] KÖPCKE H, RAHM E. Frameworks for entity matching:a comparison[J]. Data & Knowledge Engineering, 2010, 69(2):197-210. [3] HERNÁNDEZ M A, STOLFO S J. The merge/purge problem for large databases[C]//Proceedings of the 1995 ACM SIGMOD International Conference on Management of Data. New York:ACM, 1995:127-138. [4] 王宏志,樊文飞.复杂数据上的实体识别技术研究[J].计算机学报,2011,34(10):1843-1852.(WANG H Z, FAN W F. Object identification on complex data:a survey[J]. Chinese Journal of Computers, 2011, 34(10):1843-1852.) [5] 孙琛琛,申德荣,寇月,等.面向关联数据的联合式实体识别方法[J].计算机学报,2015,38(9):1739-1754.(SUN C C, SHEN D R, KOU Y, et al. A related data oriented joint entity resolution approach[J]. Chinese Journal of Computers, 2015, 38(9):1739-1754.) [6] 寇月,申德荣,刘恒,等.异构网络中关联实体识别模型及增量式验证算法研究[J].计算机学报,2013,36(10):2096-2108.(KOU Y, SHEN D R, LIU H, et al. Research on related entity identification model and incremental verification algorithm for heterogeneous networks[J]. Chinese Journal of Computers, 2013, 36(10):2096-2108.) [7] ANANTHAKRISHNA R, CHAUDHURI S, GANTI V. Eliminating fuzzy duplicates in data warehouses[C]//Proceedings of the 28th International Conference on Very Large Data Bases. San Francisco, CA:Morgan Kaufmann, 2002:586-597. [8] BHATTACHARYA I, GETOOR L. Collective entity resolution in relational data[J]. ACM Transactions on Knowledge Discovery from Data, 2007, 1(1):Article No. 5. [9] ALTWAIJRY H, KALASHNIKOV D V, MEHROTRA S. Query-driven approach to entity resolution[J]. Proceedings of the VLDB Endowment, 2013, 6(14):1846-1857. [10] ALTWAIJRY H, MEHROTRA S, KALASHNIKOV D V. QuERy:a framework for integrating entity resolution with query processing[J]. Proceedings of the VLDB Endowment, 2015, 9(3):120-131. [11] BHATTACHARYA I, GETOOR L. Query-time entity resolution[J]. Journal of Artificial Intelligence Research, 2007, 30(1):621-657. [12] IOANNOU E, NEJDL W, NIEDERÉE C, et al. On-the-fly entity-aware query processing in the presence of linkage[J]. Proceedings of the VLDB Endowment, 2010, 3(1/2):429-438. [13] SISMANIS Y, WANG L, FUXMAN A, et al. Resolution-aware query answering for business intelligence[C]//Proceedings of the 2009 IEEE 25th International Conference on. Washington, DC:IEEE Computer Society, 2009:976-987. [14] ALTOWIM Y, KALASHNIKOV D V, MEHROTRA S. Progressive approach to relational entity resolution[J]. Proceedings of the VLDB Endowment, 2014, 7(11):999-1010. [15] WHANG S E, MARMAROS D, GARCIA-MOLINA H. Pay-as-you-go entity resolution[J]. IEEE Transactions on Knowledge and Data Engineering, 2013, 25(5):1111-1124. [16] GRUENHEID A, DONG X L, SRIVASTAVA D. Incremental record linkage[J]. Proceedings of the VLDB Endowment, 2014, 7(9):697-708. [17] WHANG S E, GARCIA-MOLINA H. Incremental entity resolution on rules and data[J]. The VLDB Journal, 2014, 23(1):77-102. [18] CORMODE G, GAROFALAKIS M, HAAS P J, et al. Synopses for massive data:samples, histograms, wavelets, sketches[J]. Foundations and Trends in Databases, 2012, 4(1/2/3):1-294. [19] GAROFALAKIS N, GIBBONS P B. Approximate query processing:taming the terabytes[C]//VLDB 2001:Proceedings of 27th International Conference on Very Large Data Bases. San Francisco, CA:Morgan Kaufmann, 2001:169-212. [20] ACHARYA S, GIBBONS P B, POOSALA V, et al. The Aqua approximate query answering system[C]//Proceedings of the 1999 ACM SIGMOD International Conference on Management of Data. New York:ACM, 1999:574-576. [21] AGARWAL S, MOZAFARI B, PANDA A, et al. BlinkDB:queries with bounded errors and bounded response times on very large data[C]//Proceedings of the 8th ACM European Conference on Computer Systems. New York:ACM, 2013:29-42. [22] BABCOCK B, CHAUDHURI S, DAS G. Dynamic sample selection for approximate query processing[C]//Proceedings of the 2003 ACM SIGMOD International Conference on Management of data. New York:ACM, 2003:539-550. [23] CHAUDHURI S, DAS G, NARASAYYA V. Optimized stratified sampling for approximate query processing[J]. ACM Transactions on Database Systems, 2007, 32(2):9. [24] CONDIE T, CONWAY N, ALVARO P, et al. Online aggregation and continuous query support in MapReduce[C]//Proceedings of the 2010 ACM SIGMOD International Conference on Management of data. New York:ACM, 2010:1115-1118. [25] HELLERSTEIN J M, HASS P J, WANG H J. Online aggregation [C]// Proceedings of the 1997 ACM SIGMOD International Conference on Management of Data. New York: ACM, 1997: 171-182. [26] PANSARE N, BORKAR V R, JERMAINE C, et al. Online aggregation for large MapReduce jobs [J]. Proceedings of the VLDB Endowment, 2011, 4(11): 1135-1145. [27] WU S, JIANG S X, OOI B C, et al. Distributed online aggregations [J]. Proceedings of the VLDB Endowment, 2009, 2(1): 443-454. [28] WANG J N, KRISHNAN S, FRANKLIN M J, et al. A sample-and-clean framework for fast and accurate query processing on dirty data [C]// Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data. New York: ACM, 2014: 469-480. |