[1]Adobe Systems Incorporated.PDF reference:sixth edition[EB/OL].[2010-10-23].http://www.adobe.com/content/dam/Adobe/en/devnet/acrobat/pdfs/pdf_reference_1-7.pdf.[2]杨道良.面向对象的中文PDF阅读器的设计与实现[J].计算机应用,1999,19(6):1-4.[3]李强,刘时进.PDF阅读器的设计与实现[J].计算机工程与设计,2010,31(7):1635-1638.[4]李贵林,李建中,杨艳.Plug-in实现对PDF文件的信息提取[J].计算机应用,2003,23(2):110-112.[5]李珍,田学东.PDF文件信息的抽取与分析[J].计算机应用,2003,23(12):145-147.[6]张秀秀,张立峰.PDF文件文本内容提取研究[J].科技情报开发与经济,2008,18(36):118-120.[7]WILLIAM S L,DAVID F B.Document analysis of PDF files:methods,results and implications[J].Electronic Publishing Origination Dissemination and Design,1995,8(2/3):207-220.[8]YUAN FANG,LIU BO,YU GE.A study on information extraction from PDF files[C]// ICMLC 2005:Proceedings of the 4th International Conference Advances in Machine Learning and Cybernetics,LNCS 3930.Berlin:Springer-Verlag,2005:258-267.[9]CHAO HUI,FAN JIAN.Layout and content extraction for PDF documents[C]// DAS 2004:Proceedings of Document Analysis Systems,LNCS 3108.Berlin:Springer-Verlag,2004:213-224.[10]TAMIR H,ROBERT B.Intelligent text extraction from PDF documents[C]// CIMCA/IAWTIC 2005:Proceedings of the 2005 International Conference on Computational Intelligence for Modelling,Control and Automation,and International Conference on Intelligent Agents,Web Technologies and Internet Commerce.Washington,DC:IEEE Computer Society,2005:2-6.[11]宋艳娟,张文德.基于XML的PDF文档信息抽取系统的研究[J].现代图书情报技术,2005,21(9):10-13.[12]陈俊林,张文德.基于XSLT的PDF论文元数据的优化抽取[J].现代图书情报技术,2007,23(2):18-23.[13]宋艳娟,李金铭,陈振标.基于XSLT的PDF信息抽取技术的研究[J].计算机与数字工程,2008,36(5):156-159.[14]GONZALO N,MATHIEU R.Flexible pattern matching in strings:practical on-line search algorithms for texts and biological sequences[M].Cambridge:Cambridge University Press,2002:49-54. |