TCMNER and PubMed: A Novel Chinese Character-Level-Based Model and a Dataset for TCM Named Entity Recognition | |
Liu Z(刘智)1,2,3; Luo, Changyong4; Zheng ZY(郑泽宇)1,2; Li, Yan5; Fu DZ(付殿峥)1,2; Yu, Xinzhu6; Zhao, Jiawei7 | |
刊名 | JOURNAL OF HEALTHCARE ENGINEERING |
2021 | |
卷号 | 2021页码:1-10 |
ISSN号 | 2040-2295 |
产权排序 | 1 |
英文摘要 | Intelligent traditional Chinese medicine (TCM) has become a popular research field by means of prospering of deep learning technology. Important achievements have been made in such representative tasks as automatic diagnosis of TCM syndromes and diseases and generation of TCM herbal prescriptions. However, one unavoidable issue that still hinders its progress is the lack of labeled samples, i.e., the TCM medical records. As an efficient tool, the named entity recognition (NER) models trained on various TCM resources can effectively alleviate this problem and continuously increase the labeled TCM samples. In this work, on the basis of in-depth analysis, we argue that the performance of the TCM named entity recognition model can be better by using the character-level representation and tagging and propose a novel word-character integrated self-attention module. With the help of TCM doctors and experts, we define 5 classes of TCM named entities and construct a comprehensive NER dataset containing the standard content of the publications and the clinical medical records. The experimental results on this dataset demonstrate the effectiveness of the proposed module. |
资助项目 | Program of Liaoning Provincial Natural Science Foundation[2019-KF-03-03] ; National Natural Science Foundation of China[62003335] |
WOS关键词 | MEDICINE |
WOS研究方向 | Health Care Sciences & Services |
语种 | 英语 |
WOS记录号 | WOS:000687452300001 |
资助机构 | Program of Liaoning Provincial Natural Science Foundation [2019-KF-03-03] ; National Natural Science Foundation of ChinaNational Natural Science Foundation of China (NSFC) [62003335] |
内容类型 | 期刊论文 |
源URL | [http://ir.sia.cn/handle/173321/29508] |
专题 | 沈阳自动化研究所_数字工厂研究室 |
通讯作者 | Li, Yan; Fu DZ(付殿峥) |
作者单位 | 1.University of Chinese Academy of Sciences, Beijing 100049, China 2.Institutes for Robotics and Intelligent Manufacturing, Chinese Academy of Sciences, Shenyang 110016, China 3.Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang 110016, China 4.Department of Infectious Diseases, Dongfang Hospital of Beijing University of Chinese Medicine, Beijing 100078, China 5.Education Section, Dongzhimen Hospital of Beijing University of Chinese Medicine, Beijing 101121, China 6.School of Information Science and Engineering, Shenyang University of Technology, Shenyang, China 7.College of Electrical Engineering, Zhejiang University, Hangzhou 310027, China |
推荐引用方式 GB/T 7714 | Liu Z,Luo, Changyong,Zheng ZY,et al. TCMNER and PubMed: A Novel Chinese Character-Level-Based Model and a Dataset for TCM Named Entity Recognition[J]. JOURNAL OF HEALTHCARE ENGINEERING,2021,2021:1-10. |
APA | Liu Z.,Luo, Changyong.,Zheng ZY.,Li, Yan.,Fu DZ.,...&Zhao, Jiawei.(2021).TCMNER and PubMed: A Novel Chinese Character-Level-Based Model and a Dataset for TCM Named Entity Recognition.JOURNAL OF HEALTHCARE ENGINEERING,2021,1-10. |
MLA | Liu Z,et al."TCMNER and PubMed: A Novel Chinese Character-Level-Based Model and a Dataset for TCM Named Entity Recognition".JOURNAL OF HEALTHCARE ENGINEERING 2021(2021):1-10. |
个性服务 |
查看访问统计 |
相关权益政策 |
暂无数据 |
收藏/分享 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论