Journal of Computer Applications ›› 2012, Vol. 32 ›› Issue (12): 3565-3568.DOI: 10.3724/SP.J.1087.2012.03565
• Typical applications • Previous Articles
ZHU Wu-hui,WANG Mei-qing
Received:
Revised:
Online:
Published:
Contact:
竺吴辉,王美清
通讯作者:
作者简介:
基金资助:
Abstract: In a time flooded with massive spam messages, clearing them waste a huge amount of effort and time. The mining sent feature of spam messages is the key to solving this problem. On the basis of analyzing current text-message filtering mechanisms, an effective interaction period is proposed by combining the discrete interaction units of the message sender into a consecutive interaction unit according to the essence of median filter. Utilizing the ratio of input to output and Effective Interaction Period (EIP), a general filtering algorism of spam message is built. Experimenting on 20 millions real messages, the recall ratio of the proposed algorithm is 99.51% and the precision ratio is 49.90%. The experimental results indicate that the novel algorism greatly enhances the efficiency and velocity of detection, which can be applied to spam messages real-time intercepted technology.
Key words: spam message, interaction unit, EIP, the ratio of input to output, precision ratio, recall ratio
摘要: 在一个垃圾短信泛滥的时代,清除垃圾短信将耗费大量的时间和精力,挖掘垃圾短信的发送特征是解决这一问题的关键。在分析现有的短信过滤机制(算法)的基础上,根据中值滤波的思想,将短信发送者离散的交互单元合并成一个连续的交互单元,进而提出有效交互周期的概念,以入出比、有效交互周期等特征建立垃圾短信的综合过滤算法。通过对2000万条真实短信记录进行实验,统计得到过滤算法针对垃圾短信的查全率达到99.51%,查准率为49.90%。实验结果表明,算法提高了垃圾短信检测的效率和速度,可适用于垃圾短信实时拦截技术。
关键词: 垃圾短信, 交互单元, 有效交互周期, 入出比, 查准率, 查全率
ZHU Wu-hui WANG Mei-qing. Spam phone number filtering method based on SMS submission pattern[J]. Journal of Computer Applications, 2012, 32(12): 3565-3568.
竺吴辉 王美清. 基于短信发送模式的垃圾号码过滤算法[J]. 计算机应用, 2012, 32(12): 3565-3568.
0 / Recommend
Add to citation manager EndNote|Ris|BibTeX
URL: https://www.joca.cn/EN/10.3724/SP.J.1087.2012.03565
https://www.joca.cn/EN/Y2012/V32/I12/3565