1 于满泉,陈铁睿,许洪波.基于分块的网页信息解析器的研究与设计[J].计算机应用,2005,25(4):974. YUM Q, CHENT R, XUH B. Research and design of HTML parser based on page segmentation [J]. Journal of Computer Applications, 2005, 25(4): 974.
2 程岚岚.基于正则表达式的大规模网页术语对抽取研究[J].情报杂志,2008,27(11):62-64,68. CHENGL L. The study of large-scale Web term-pairs extraction based on regular expressions[J]. Journal of Intelligence, 2008, 27(11): 62-64, 68.
3 胡军伟,秦奕青,张伟.正则表达式在Web信息抽取中的应用[J].北京信息科技大学学报(自然科学版),2011,26(0):86-89. HUJ W, QINY Q, ZHANGW. Regular expression and its applications to Web information extraction[J]. Journal of Beijing Information Science & Technology University, 2011, 26(6): 86-89.
4 靳小川,刘万军,赵雷.基于正则表达式的企业主页信息抽取[J].计算机系统应用,2010,19(8):70-73. JINX C, LIUW J, ZHAOL. Enterprise homepage information extraction based on regular expression [J]. Computer Systems & Applications, 2010, 19(8): 70-73.
5 朱文琰,郑肖雄.基于正则表达式构建学习的网页信息抽取方法[J].计算机应用与软件, 2017, 34(2):14. ZHUW Y, ZHENGX X. A webpage information extraction method based on regex construction learning [J]. Computer Applications and Software, 2017, 34(2): 14.
6 COWIEJ, LEHNERTW. Information extraction [J]. Communications of the Association for Computing Machinery, 1996, 39(1): 80-91.
7 陈钊,张冬梅.Web信息抽取技术综述[J].计算机应用研究,2010,27(12):4401. CHENZ, ZHANGD M. Survey of Web information extraction technologies [J]. Application Research of Computers, 2010, 27(12): 4401.
8 YEONGSUK, SEUNGWOOL. SVM-based Web content mining with leaf classification unit from DOM-tree [C]// Proceedings of the 9th International Conference on Knowledge and Smart Technology. Piscataway, NJ: IEEE, 2017: 359-364.
9 陈琼,苏文健.基于网页结构树的Web信息抽取方法[J].计算机工程,2005,31(20):54-55,140. CHENQ, SUW J. Web information extraction based on Web structure tree [J]. Computer Engineering, 2005, 31(20): 54-55, 140.
10 李效东,顾毓清.基于DOM的Web信息提取[J].计算机学报,2002,25(5):526-533. LIX D, GUY Q. DOM-based information extraction for the Web sources [J]. Chinese Journal of Computers, 2002, 25(5): 526-533.
11 王敬普,林亚平,周顺先,等.基于包装器模型的文本信息抽取[J].计算机应用,2006,26(3):655-658. WANGJ P, LINY P, ZHOUS X, et al. Text information extraction based on wrapper model [J]. Journal of Computer Applications, 2006, 26(3): 655-658.
12 王辉,郁波,洪宇,等.基于知识图谱的Web信息抽取系统[J].计算机工程,2017,43(6):118 WANGH, YUB, HONGY, et al. Web information extraction system based on knowledge graph[J]. Computer Engineering, 2017, 43(6): 118.)
13 ZHOUP, EL-GOHARYN. Ontology-based automated information extraction from building energy conservation codes [J]. Automation in Construction, 2017,74: 103-117.)
14 BASTIANM R, PURWARIANTIA. Information extraction in statistics indicator tables using rule generalizations and ontology [C]// Proceedings of the 2016 International Conference on Information Technology Systems and Innovation. Piscataway: IEEE, 2016: 1-6.
15 王放,顾宁,吴国文.基于本体的Web表格信息抽取[J].小型微型计算机系统,2003,24(12):2142-2146. WANGF, GUN, WUG W. Extracting information from ontology-based Web table[J]. Journal of Chinese Computer Systems, 2003, 24(12): 2142-2146.
16 李贯峰,张鹏.一个基于农业本体的Web知识抽取模型[J].江苏农业科学,2018,46(4):201-205. LIG F, ZHANGP. A Web knowledge extraction model based on agricultural ontology[J]. Jiangsu Agricultural Sciences, 2018, 46(4): 201-205.
17 VIANIN, LARIZZAC, TIBOLLOV, et al. Information extraction from italian medical reports: an ontology-driven approach [J]. International Journal of Medical Informatics, 2018, 111(3): 140-148.
18 徐维.本体应用中术语本体和信息本体解析——以生物医学信息学领域为例[J].图书馆杂志,2015,34(6):11-16. XUW. Analysis of terminology ontology and informatics ontology: an example in biomedical informatics [J]. Library Journal, 2015, 34(6): 11-16.
19 LIC X, SUY R, WANGR J, et al. Structured AJAX data extraction based on agricultural ontology [J]. Journal of Integrative Agriculture, 2012, 11(5): 784-791.
20 段宇锋,黄思思.中文植物物种多样性描述文本的信息抽取研究[J].现代图书情报技术,2016,32(1):87-96. DUANY F, HUANGS S. Information extraction from Chinese plant species diversity description text [J]. New Technology of Library and Information Service, 2016, 32(1): 87-96.
21 LIUL, ÖZSUM. Encyclopedia of Database Systems [M]. 2nd ed. New York: Springer, 2017: 1613-1619.
22 国家发改委.招标公告和公示信息发布管理办法[EB/OL].[2019-07-01]. http://www.gov.cn/gongbao/content/2018/content_5264881.htm. National Development and Reform Commission.Measures for the administration of bidding announcement and publication of published information[EB/OL]. [2019-07-01]. http://www.gov.cn/gongbao/content/2018/content_5264881.htm.)
23 冯志伟.自然语言处理简明教程[M].上海:上海外语教育出版社,2012:75-82. FENGZ W. A Concise Course of Natural Language Processing [M]. Shanghai: Shanghai Foreign Language Education Press, 2012: 75-82.
24 顾韵华,高原,高宝,等.基于模板和领域本体的Deep Web信息抽取研究[J].计算机工程与设计,2014,35(1):327-332. GUY H, GAOY, GAOB, et al. Research on Deep Web information extraction based on template and domain ontology [J]. Computer Engineering and Design, 2014, 35(1): 327-332. |