基于网页正文结构和特征串的相似网页去重算法
熊忠阳 牙漫 张玉芳
Detection and elimination of similar Web pages based on text structure and string of feature code
XIONG Zhongyang YA Man ZHANG Yufang
计算机应用 . 2013, (02): 554 -557 .  DOI: 10.3724/SP.J.1087.2013.00554