Cross-Modal Cross-Domain Moment Alignment Network for Person Search
Jing Y(荆雅)2,3; Wang W(王威)2,3; Wang L(王亮)1,2,3; Tan TN(谭铁牛)1,2,3
2020-06
会议日期2020-6
会议地点virtual
英文摘要

Text-based person search has drawn increasing attention due to its wide applications in video surveillance. However, most of the existing models depend heavily on paired image-text data, which is very expensive to acquire. Moreover, they always face huge performance drop when directly exploiting them to new domains. To overcome this problem, we make the first attempt to adapt the model to new target domains in the absence of pairwise labels, which combines the challenges from both cross-modal (text-based) person search and cross-domain person search. Specially, we propose a moment alignment network (MAN) to solve the cross-modal cross-domain person search task in this paper. The idea is to learn three effective moment alignments including domain alignment (DA), cross-modal alignment (CA) and exemplar alignment (EA), which together can learn domain-invariant and semantic aligned cross-modal representations to improve model generalization. Extensive experiments are conducted on CUHK Person Description dataset (CUHK-PEDES) and Richly Annotated Pedestrian dataset (RAP). Experimental results show that our proposed model achieves the state-of-the-art performances on five transfer tasks.
 

内容类型会议论文
源URL[http://ir.ia.ac.cn/handle/173211/44446]  
专题自动化研究所_智能感知与计算研究中心
通讯作者Wang W(王威)
作者单位1.Center for Excellence in Brain Science and Intelligence Technology (CEBSIT), Institute of Automation, Chinese Academy of Sciences (CASIA)
2.University of Chinese Academy of Sciences (UCAS)
3.Center for Research on Intelligent Perception and Computing (CRIPAC), National Laboratory of Pattern Recognition (NLPR)
推荐引用方式
GB/T 7714
Jing Y,Wang W,Wang L,et al. Cross-Modal Cross-Domain Moment Alignment Network for Person Search[C]. 见:. virtual. 2020-6.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace