CORC  > 北京大学  > 信息科学技术学院
A Multi-Resolution-Concentration Based Feature Construction Approach for Spam Filtering
Mi, Guyue ; Zhang, Pengtao ; Tan, Ying
2013
关键词SUPPORT VECTOR MACHINES
英文摘要This paper proposes a multi-resolutionconcentration (MRC) based feature construction approach for spam filtering by progressively partitioning an email into local areas on smaller and smaller resolutions. The MRC approach depicts a dynamic process of gradual refinement in locating the pathogens by calculating concentrations of detectors on local areas, and is considered to be able to extract the position-correlated and process-correlated information from emails. Furthermore, A weighted MRC (WMRC) approach is presented by considering the different activity levels of detectors in calculation of concentrations. A generic structure of the MRC model, which mainly contains detector sets construction and multi-resolution concentrations calculation, is designed. The implementations of MRC and WMRC approaches are described in detail. Experiments are conducted on five benchmark corpora using cross-validation to evaluate the proposed MRC model. Comprehensive experimental results suggest that the MRC and WMRC approaches perform far better than the prevalent bag-of-words approach in both performance and efficiency. Compared with the concentration based feature construction approach and local-concentration based feature extraction approach, MRC and WMRC achieve higher accuracy and F-1 measure, which demonstrates the effectiveness of the MRC model. In addition, it is shown that both the MRC and WMRC approaches cooperate well with variety of classification methods, which endows the MRC model with flexible capability in the real world.; Computer Science, Artificial Intelligence; Computer Science, Hardware & Architecture; Engineering, Electrical & Electronic; EI; CPCI-S(ISTP); 0
语种英语
DOI标识10.1109/IJCNN.2013.6706876
内容类型其他
源URL[http://ir.pku.edu.cn/handle/20.500.11897/292667]  
专题信息科学技术学院
推荐引用方式
GB/T 7714
Mi, Guyue,Zhang, Pengtao,Tan, Ying. A Multi-Resolution-Concentration Based Feature Construction Approach for Spam Filtering. 2013-01-01.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace