[1] BERGMAN M K. The deep Web: surfacing hidden value [J]. Journal of Electronic Publishing, 2001,7(1):113-153. [2] HE B, PATEL M, ZHANG Z, et al. Accessing the deep Web: a survey [J]. Communications of ACM, 2007,50(5):94-101. [3] MADHAVAN J, JEFFERY S, COHEN S, et al. Web-scale data integration: you can only afford to pay as you go [EB/OL]. [2015-01-04]. http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.66.9358&rep=rep1&type=pdf. [4] CAFARELLA M J, HALEVY A, MADHAVAN J. Structured data on the Web [J]. Communications of ACM, 2011,54(2):72-79. [5] MADHAVAN J, KO D, KOT L, et al. Google's deep Web crawl [J]. Proceedings of the Very Large Data Base Endowment, 2008,1(2):1241-1252. [6] ARGUELLO J, CALLAN J, DIAZ F. Classification-based resource selection [C]//Proceedings of the 18th ACM Conference on Information and Knowledge Management. New York: ACM, 2009:1277-1286. [7] SHAN J, MAN L. Simple may be best -a simple and effective method for federated Web search via search engine impact factor estimation [EB/OL]. [2015-01-06]. http://trec.nist.gov/pubs/trec23/papers/pro-ECNU_federated.pdf. [8] CALLAN J, CONNELL M. Query-based sampling of text databases [J]. ACM Transactions on Information Systems, 2011,19(2):97-130. [9] HIEMSTRA D, DEMEESTER T, TRIESCHNIGG D. TREC federated Web search track [EB/OL]. [2015-01-03]. https://sites.google.com/site/trecfedweb/. [10] CALLAN J P, LU Z, CROFT W B. Searching distributed collections with inference networks [C]//Proceedings of the 18th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New York: ACM, 1995:21-28. [11] SI L, JIN R, CALLAN J, et al. A language modeling framework for resource selection and results merging [C]//Proceedings of the 11th International Conference on Information and Knowledge Management. New York: ACM, 2002:391-397. [12] SEO J, CROFT W B. Blog site search using resource selection [C]//Proceedings of the 17th ACM Conference on Information and Knowledge Management. New York: ACM, 2008:1053-1062. [13] SI L, CALLAN J. Relevant document distribution estimation method for resource selection [C]//Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New York: ACM, 2003:298-305. [14] SHOKOUHI M. Central-rank-based collection selection in uncooperative distributed information retrieval [C]//Proceedings of the 29th European Conference on Information Retrieval. Berlin: Springer, 2007:160-172. [15] IPEIROTIS P G, GRAVANO L. Classification-aware hidden-Web text database selection [EB/OL]. [2015-01-08]. http://128.59.11.212/~gravano/Papers/2008/tois08.pdf. [16] BELLOGIN A, GEBREMESKEL G G, HE J, et al. CWI and TU delft at TREC 2013: contextual suggestion, federated Web search, KBA, and Web tracks [EB/OL]. [2015-01-08]. http://ir.ii.uam.es/~alejandro/2013/trec.pdf. [17] XU J, CROFT W B. Cluster-based language models for distributed retrieval [C]//Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New York: ACM, 1999:254-261. [18] BAILLIE M, CARMEN M, CRESTANI F. A multiple-collection latent topic model for federated search [J]. Information Retrieval, 2011,14(4):390-412. [19] DEMEESTER T, NGUYEN D, TRIESCHNIGG D, et al. What snippets say about pages in federated Web search [C]//Proceedings of the 8th Asia Information Retrieval Societies Conference. Berlin: Springer, 2012:250-261. [20] DEMEESTER T, NGUYEN D, TRIESCHNIGG D, et al. Snippet-based relevance predictions for federated Web search [C]//Proceedings of the 35th European Conference on Advances in Information Retrieval. Berlin: Springer, 2013:697-700. [21] CALLAN J. Distributed IR testbed definitions [EB/OL]. [2015-01-08]. http://boston.lti.cs.cmu.edu/callan/Data/#DIR. [22] NGUYEN D, DEMEESTER T, TRIESCHNIGG D, et al. Federated search in the wild: the combined power of over a hundred search engines [C]//Proceedings of the 21st ACM Conference on Information and Knowledge Management. New York: ACM, 2012:1874-1878. [23] DEMEESTER T, TRIESCHNIGG D, NGUYEN D, et al. Overview of the TREC 2013 federated Web search track [EB/OL]. [2015-01-02]. https://biblio.ugent.be/input/download?func=downloadFile&recordOId=4402037&fileOId=4402038. [24] DEMEESTER T, TRIESCHNIGG D, NGUYEN D, et al. Overview of the TREC 2014 Federated Web Search Track [EB/OL]. [2015-01-02]. http://www.dcs.gla.ac.uk/~zhouke/papers/trec2014fedweb-draft.pdf. [25] DEMEESTER T, ALY R, HIEMSTRA D, et al. Exploiting user disagreement for Web search evaluation: an experimental approach [C]//Proceedings of the 7th ACM International Conference on Web Search and Data Mining. New York: ACM, 2014:33-42. [26] KEKÄLÄINEN J, JÄRVELIN K. Using graded relevance assessments in IR evaluation [J]. Journal of the American Society for Information Science and Technology, 2002,53(13):1120-1129. [27] MCCALLUM A K. MALLET: a machine learning for language toolkit [EB/OL]. [2015-01-02]. http://mallet.cs.umass.edu. [28] LIU Z, ZHANG Y, CHANG E Y, et al. PLDA+: parallel latent Dirichlet allocation with data placement and pipeline processing [J]. ACM Transactions on Intelligent Systems and Technology, 2011,2(3):Article No. 26. [29] SHOKOUHI M, SI L. Federated search [J]. Foundations and Trends in Information Retrieval, 2011,5(1):1-102. [30] GABRILOVICH E, MARKOVITCH S. Computing semantic relatedness using Wikipedia-based explicit semantic analysis [C]//Proceedings of the 20th International Joint Conference on Artificial Intelligence. San Francisco: Morgan Kaufmann Publishers, 2007:1606-1611. |