CORC  > 北京大学  > 信息科学技术学院
Active Semi-supervised Framework with Data Editing
Zhang, Xue ; Xiao, Wangxin
刊名computer science and information systems
2012
关键词sparsely labeled text classification active learning semi-supervised learning data editing
DOI10.2298/CSIS120202045Z
英文摘要In order to address the insufficient training data problem, many active semi-supervised algorithms have been proposed. The self-labeled training data in semi-supervised learning may contain much noise due to the insufficient training data. Such noise may snowball themselves in the following learning process and thus hurt the generalization ability of the final hypothesis. Extremely few labeled training data in sparsely labeled text classification aggravate such situation. If such noise could be identified and removed by some strategy, the performance of the active semi-supervised algorithms should be improved. However, such useful techniques of identifying and removing noise have been seldom explored in existing active semi-supervised algorithms. In this paper, we propose an active semi-supervised framework with data editing (we call it ASSDE) to improve sparsely labeled text classification. A data editing technique is used to identify and remove noise introduced by semi-supervised labeling. We carry out the data editing technique by fully utilizing the advantage of active learning, which is novel according to our knowledge. The fusion of active learning with data editing makes ASSDE more robust to the sparsity and the distribution bias of the training data. It further simplifies the design of semi-supervised learning which makes ASSDE more efficient. Extensive experimental study on several real-world text data sets shows the encouraging results of the proposed framework for sparsely labeled text classification, compared with several state-of-the-art methods.; Computer Science, Information Systems; Computer Science, Software Engineering; SCI(E); 0; ARTICLE; 4,SI; 1513-1532; 9
语种英语
内容类型期刊论文
源URL[http://ir.pku.edu.cn/handle/20.500.11897/291797]  
专题信息科学技术学院
推荐引用方式
GB/T 7714
Zhang, Xue,Xiao, Wangxin. Active Semi-supervised Framework with Data Editing[J]. computer science and information systems,2012.
APA Zhang, Xue,&Xiao, Wangxin.(2012).Active Semi-supervised Framework with Data Editing.computer science and information systems.
MLA Zhang, Xue,et al."Active Semi-supervised Framework with Data Editing".computer science and information systems (2012).
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace