基于网页文本结构的网页去重
魏丽霞 郑家恒
Detection and elimination of similar Web pages based on text structure
Li-Xia WEI jia-heng zheng
计算机应用 . 2007, (11): 2854 -2856 .