A Class-specific Copy Network for Handling the Rare Word Problem in Neural Machine Translation
Feng Wang; Wei Chen; Zhen yang; Xiaowei Zhang; Shuang Xu; Bo Xu
2017
会议日期14-19 May 2017
会议地点Anchorage, AK, USA
关键词Neural Machine Translation (Nmt) Copy Network Rare Word
页码2658-2664
英文摘要Neural machine translation (NMT) has shown promising results and rapidly gained adoption in many large-scale settings. With the NMT model being widely used in empirical productions, its long-standing weakness in handling the rare and out of vocabulary words has been amplified a lot. In order to release the model from the stress of “understanding” the rare words, copy mechanism has been proposed to deal with the rare and unseen words for the neural network models using attention. However the negative side of the copy mechanism is that the model is only able to decide whether to copy or not. It is unable to detect which class should the rare word be copied to, such as person, location, and organization. This paper deeply investigates this limitation of the NMT model. As a result, we propose a new NMT model by novelly incorporating a class-specific copy network. With the network, the proposed NMT model is able to decide which class the words in the target belong to and which class in the source should be copied to. Experimental results on Chinese-English translation tasks show that the proposed model outperforms the traditional NMT model with a large margin especially for sentences containing the rare words.
会议录IJCNN
语种英语
内容类型会议论文
源URL[http://ir.ia.ac.cn/handle/173211/41055]  
专题数字内容技术与服务研究中心_听觉模型与认知计算
通讯作者Wei Chen
推荐引用方式
GB/T 7714
Feng Wang,Wei Chen,Zhen yang,et al. A Class-specific Copy Network for Handling the Rare Word Problem in Neural Machine Translation[C]. 见:. Anchorage, AK, USA. 14-19 May 2017.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace