TwinNet: Twin Structured Knowledge Transfer Network for Weakly Supervised Action Localization
Xiao-Yu Zhang3
刊名Machine Intelligence Research
2022
卷号19页码:227-246
关键词Knowledge transfer weakly supervised learning self-attention mechanism representation learning action localization
ISSN号2731-538X
DOI10.1007/s11633-022-1333-4
英文摘要Action recognition and localization in untrimmed videos is important for many applications and have attracted a lot of attention. Since full supervision with frame-level annotation places an overwhelming burden on manual labeling effort, learning with weak video-level supervision becomes a potential solution. In this paper, we propose a novel weakly supervised framework to recognize actions and locate the corresponding frames in untrimmed videos simultaneously. Considering that there are abundant trimmed videos publicly available and well-segmented with semantic descriptions, the instructive knowledge learned on trimmed videos can be fully leveraged to analyze untrimmed videos. We present an effective knowledge transfer strategy based on inter-class semantic relevance. We also take advantage of the self-attention mechanism to obtain a compact video representation, such that the influence of background frames can be effectively eliminated. A learning architecture is designed with twin networks for trimmed and untrimmed videos, to facilitate transferable self-attentive representation learning. Extensive experiments are conducted on three untrimmed benchmark datasets (i.e., THUMOS14, ActivityNet1.3, and MEXaction2), and the experimental results clearly corroborate the efficacy of our method. It is especially encouraging to see that the proposed weakly supervised method even achieves comparable results to some fully supervised methods.
内容类型期刊论文
源URL[http://ir.ia.ac.cn/handle/173211/48809]  
专题自动化研究所_学术期刊_International Journal of Automation and Computing
作者单位1.School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China
2.School of Computer Science, Beijing Institute of Technology, Beijing 100081, China
3.Institute of Information Engineering, Chinese Academy of Sciences, Beijing 100093, China
推荐引用方式
GB/T 7714
Xiao-Yu Zhang. TwinNet: Twin Structured Knowledge Transfer Network for Weakly Supervised Action Localization[J]. Machine Intelligence Research,2022,19:227-246.
APA Xiao-Yu Zhang.(2022).TwinNet: Twin Structured Knowledge Transfer Network for Weakly Supervised Action Localization.Machine Intelligence Research,19,227-246.
MLA Xiao-Yu Zhang."TwinNet: Twin Structured Knowledge Transfer Network for Weakly Supervised Action Localization".Machine Intelligence Research 19(2022):227-246.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace