CORC  > 北京大学  > 信息科学技术学院
TML: a general high-performance text mining language
Li, Jiajing ; Li, Xiaoming ; Meng, Tao
刊名jisuanji yanjiu yu fazhancomputer research and development
2015
DOI10.7544/issn1000-1239.2015.20131546
英文摘要This paper proposes a general-purpose programming language named TML for text mining. TML is the abbreviation of 'text mining language', and it aims at turning complicated text mining tasks into easy jobs. The implementation of TML includes a compiler, a runtime virtual machine (interpreter), and an IDE. TML has supplied most usual text mining techniques, which are implemented as grammars and reserved words. Users can use TML to program, and the code will be compiled into bytecodes, which will be next interpreted in the virual runtime machine. TML has the following characteristics: 1) It supplies a formal way to model the searching area, object definition and mining methods of text mining jobs, so users can program with it to make a declarative text mining easily; 2) The TML runtime machine implements usual text mining techniques, and organizes them into an efficient text analysis pipeline; 3) The TML compiler fully explores the possibility of concurrently executing its byte codes, and the execution has good performance on very large collections of documents and user-written rules. TML has been used in several large-scale online data analysis applications, including commodity purchase intention analysis, fine-grained reputation analysis of brands and products, and legal document analysis. ?, 2015, Jisuanji Yanjiu yu Fazhan/Computer Research and Development. All right reserved.; EI; 0; 3; 553-560; 52
语种英语
内容类型期刊论文
源URL[http://ir.pku.edu.cn/handle/20.500.11897/329394]  
专题信息科学技术学院
推荐引用方式
GB/T 7714
Li, Jiajing,Li, Xiaoming,Meng, Tao. TML: a general high-performance text mining language[J]. jisuanji yanjiu yu fazhancomputer research and development,2015.
APA Li, Jiajing,Li, Xiaoming,&Meng, Tao.(2015).TML: a general high-performance text mining language.jisuanji yanjiu yu fazhancomputer research and development.
MLA Li, Jiajing,et al."TML: a general high-performance text mining language".jisuanji yanjiu yu fazhancomputer research and development (2015).
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace