- 87
- 0
- 约1.22万字
- 约 11页
- 2017-04-02 发布于湖北
- 举报
垃圾邮件识别和过滤的方法谈兆年北京理工大学计算机学,北京 100081(1120131743@)Methods for Identifying and Filtering Junk Mail or SpamTan Zhaonian(ClassSchool of Computer Science, Beijing Institute of Technology, Beijing 100081)AbstractIdentifying and Filtering Spam is an important research subject in computer network. In this thesis, I have studied the history of spam filtering technology, which mainly includes the first generation of rule-based filtering technology, the second generation of content-based filtering technology and the third generation of behavior-based filtering technology. 1. Rule-based filtering includes IP address based filtering, mail header based filtering. 2. Content-based filtering includes Bayesian filtering, Memory-based method, decision tree, Boosting method, Support Vector Machine (SVM), etc. 3. Behavior-based filtering includes Email data stream based filtering, mail header based filtering, sender reputation based filtering, mail fingerprint based filtering, behavioral characteristics weighted based filtering, etc. The spammers’ common spurious methods are summarized. Through the reference to large amount of anti-spam documents and data from home and broad, an analysis is made on existing anti-spam techniques and in particular the content-based spam filtering methods.Keywords spam filtering; rule; content; text categorization; Na?ve Bayes; behavior摘要垃圾邮件识别和过滤是计算机网络领域的一个重要研究课题。垃圾邮件识别和过滤目前已经发展出了三代技术,第一代过滤技术是基于规则的,例如:基于IP地址、基于邮件头的过滤技术。第二代过滤技术是基于内容的,例如:贝叶斯分类算法、Memory-Based方法、决策树、Boosting方法、支持向量机等方法。第三代过滤技术是基于行为的,例如:基于邮件数据流、基于邮件头信息、基于发送方信誉、基于邮件指纹、基于行为特征加权的决策树等过滤方法。本文归纳总结了当前垃圾邮件发送者经常采用的欺骗手段和方法,并参阅国内外大量反垃圾邮件文献和数据,对已有的垃圾邮件技术作出分析和总结,尤其是对基于内容的垃圾邮件过滤方法进行了研究。关键词垃圾邮件过滤;规则;内容;文本分类;简单贝叶斯;行为随着互联网的发展,垃圾邮件常常让人头痛不已,最新报告称美国为垃圾邮件第一大国,中国排名第三(图1)[1]。垃圾邮件问题如今已经成为一个社会热点,近些年来,研究人员们提出了很多垃圾邮件识别和过滤的方法。这些方法的发展经历了三代,第一代过滤技术是基于规则的,例如:基于IP地址、基于邮件头的过滤技术。第二代过滤技术是基于内容的,例如:贝叶斯分类算法、Memory-Based方法、决策树、Boosting方法、支持向量机等方法。第三代过滤技术是基于行为的,例如:基于邮件数据流、基于邮件头信息、基于发送方信誉、基于邮件指纹、基于行为特征加权的决策树等过滤方法。
您可能关注的文档
最近下载
- 工业机器人离线编程与仿真 教案 项目1--4 仿真软件的安装与工作站的构建---仿真软件的应用.docx VIP
- 人文艺术欣赏ppt课件(优质ppt).pptx VIP
- 伤口评估与护理记录PPT课件.pptx
- 骨科手术高龄患者并发症及相关护理措施-来源:现代养生(下半月版)(第2018001期)-河北省医疗气功医院.pdf VIP
- 小学一二年级全册体育教案.pdf VIP
- 大数据环境下网络安全问题探讨.doc VIP
- 论司法确认程序审查规则检视与优化.docx VIP
- 2022年中国社会科学院法律史考博真题、考博参考书,考博资料,难度分析.pdf VIP
- 大数据背景下网络信息安全问题的研究.doc VIP
- 《绘本赏析与阅读指导》学前教育专业全套教学课件.pptx
原创力文档

文档评论(0)