Extracting Effective Image Attributes with Refined Universal Detection
Yu, Qiang1,3; Xiao, Xinyu1,3; Zhang, Chunxia2; Song, Lifei3; Pan, Chunhong1
刊名SENSORS
2021
卷号21期号:1页码:16
关键词attribute extraction Refined Universal Detection word tree image captioning
DOI10.3390/s21010095
英文摘要

Recently, image attributes containing high-level semantic information have been widely used in computer vision tasks, including visual recognition and image captioning. Existing attribute extraction methods map visual concepts to the probabilities of frequently-used words by directly using Convolutional Neural Networks (CNNs). Typically, two main problems exist in those methods. First, words of different parts of speech (POSs) are handled in the same way, but non-nominal words can hardly be mapped to visual regions through CNNs only. Second, synonymous nominal words are treated as independent and different words, in which similarities are ignored. In this paper, a novel Refined Universal Detection (RUDet) method is proposed to solve these two problems. Specifically, a Refinement (RF) module is designed to extract refined attributes of non-nominal words based on the attributes of nominal words and visual features. In addition, a Word Tree (WT) module is constructed to integrate synonymous nouns, which ensures that similar words hold similar and more accurate probabilities. Moreover, a Feature Enhancement (FE) module is adopted to enhance the ability to mine different visual concepts in different scales. Experiments conducted on the large-scale Microsoft (MS) COCO dataset illustrate the effectiveness of our proposed method.

资助项目National Key Research and Development Program of China[2020AAA0104903] ; National Natural Science Foundation of China[62072039] ; National Natural Science Foundation of China[91646207] ; National Natural Science Foundation of China[62071466]
WOS研究方向Chemistry ; Engineering ; Instruments & Instrumentation
语种英语
出版者MDPI
WOS记录号WOS:000606121300001
资助机构National Key Research and Development Program of China ; National Natural Science Foundation of China
内容类型期刊论文
源URL[http://ir.ia.ac.cn/handle/173211/42578]  
专题自动化研究所_模式识别国家重点实验室_遥感图像处理团队
通讯作者Xiao, Xinyu
作者单位1.Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
2.Beijing Inst Technol, Sch Comp Sci & Technol, Beijing 100081, Peoples R China
3.Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
推荐引用方式
GB/T 7714
Yu, Qiang,Xiao, Xinyu,Zhang, Chunxia,et al. Extracting Effective Image Attributes with Refined Universal Detection[J]. SENSORS,2021,21(1):16.
APA Yu, Qiang,Xiao, Xinyu,Zhang, Chunxia,Song, Lifei,&Pan, Chunhong.(2021).Extracting Effective Image Attributes with Refined Universal Detection.SENSORS,21(1),16.
MLA Yu, Qiang,et al."Extracting Effective Image Attributes with Refined Universal Detection".SENSORS 21.1(2021):16.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace