TCMNER and PubMed: A Novel Chinese Character-Level-Based Model and a Dataset for TCM Named Entity Recognition
Liu Z(刘智)1,2,3; Luo, Changyong4; Zheng ZY(郑泽宇)1,2; Li, Yan5; Fu DZ(付殿峥)1,2; Yu, Xinzhu6; Zhao, Jiawei7
刊名JOURNAL OF HEALTHCARE ENGINEERING
2021
卷号2021页码:1-10
ISSN号2040-2295
产权排序1
英文摘要

Intelligent traditional Chinese medicine (TCM) has become a popular research field by means of prospering of deep learning technology. Important achievements have been made in such representative tasks as automatic diagnosis of TCM syndromes and diseases and generation of TCM herbal prescriptions. However, one unavoidable issue that still hinders its progress is the lack of labeled samples, i.e., the TCM medical records. As an efficient tool, the named entity recognition (NER) models trained on various TCM resources can effectively alleviate this problem and continuously increase the labeled TCM samples. In this work, on the basis of in-depth analysis, we argue that the performance of the TCM named entity recognition model can be better by using the character-level representation and tagging and propose a novel word-character integrated self-attention module. With the help of TCM doctors and experts, we define 5 classes of TCM named entities and construct a comprehensive NER dataset containing the standard content of the publications and the clinical medical records. The experimental results on this dataset demonstrate the effectiveness of the proposed module.

资助项目Program of Liaoning Provincial Natural Science Foundation[2019-KF-03-03] ; National Natural Science Foundation of China[62003335]
WOS关键词MEDICINE
WOS研究方向Health Care Sciences & Services
语种英语
WOS记录号WOS:000687452300001
资助机构Program of Liaoning Provincial Natural Science Foundation [2019-KF-03-03] ; National Natural Science Foundation of ChinaNational Natural Science Foundation of China (NSFC) [62003335]
内容类型期刊论文
源URL[http://ir.sia.cn/handle/173321/29508]  
专题沈阳自动化研究所_数字工厂研究室
通讯作者Li, Yan; Fu DZ(付殿峥)
作者单位1.University of Chinese Academy of Sciences, Beijing 100049, China
2.Institutes for Robotics and Intelligent Manufacturing, Chinese Academy of Sciences, Shenyang 110016, China
3.Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang 110016, China
4.Department of Infectious Diseases, Dongfang Hospital of Beijing University of Chinese Medicine, Beijing 100078, China
5.Education Section, Dongzhimen Hospital of Beijing University of Chinese Medicine, Beijing 101121, China
6.School of Information Science and Engineering, Shenyang University of Technology, Shenyang, China
7.College of Electrical Engineering, Zhejiang University, Hangzhou 310027, China
推荐引用方式
GB/T 7714
Liu Z,Luo, Changyong,Zheng ZY,et al. TCMNER and PubMed: A Novel Chinese Character-Level-Based Model and a Dataset for TCM Named Entity Recognition[J]. JOURNAL OF HEALTHCARE ENGINEERING,2021,2021:1-10.
APA Liu Z.,Luo, Changyong.,Zheng ZY.,Li, Yan.,Fu DZ.,...&Zhao, Jiawei.(2021).TCMNER and PubMed: A Novel Chinese Character-Level-Based Model and a Dataset for TCM Named Entity Recognition.JOURNAL OF HEALTHCARE ENGINEERING,2021,1-10.
MLA Liu Z,et al."TCMNER and PubMed: A Novel Chinese Character-Level-Based Model and a Dataset for TCM Named Entity Recognition".JOURNAL OF HEALTHCARE ENGINEERING 2021(2021):1-10.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace