Learning to assess visual aesthetics of food images
Sheng, Kekai2,3; Dong, Weiming3; Huang, Haibin5; Chai, Menglei1; Zhang, Yong4; Ma, Chongyang5; Hu, Bao-Gang3
刊名COMPUTATIONAL VISUAL MEDIA
2021-03-01
卷号7期号:1页码:139-152
关键词image aesthetic assessment food image analysis dataset regularization
ISSN号2096-0433
DOI10.1007/s41095-020-0193-5
通讯作者Dong, Weiming(weiming.dong@ia.ac.cn)
英文摘要Distinguishing aesthetically pleasing food photos from others is an important visual analysis task for social media and ranking systems related to food. Nevertheless, aesthetic assessment of food images remains a challenging and relatively unexplored task, largely due to the lack of related food image datasets and practical knowledge. Thus, we present the Gourmet Photography Dataset (GPD), the first large-scale dataset for aesthetic assessment of food photos. It contains 24,000 images with corresponding binary aesthetic labels, covering a large variety of foods and scenes. We also provide a non-stationary regularization method to combat over-fitting and enhance the ability of tuned models to generalize. Quantitative results from extensive experiments, including a generalization ability test, verify that neural networks trained on the GPD achieve comparable performance to human experts on the task of aesthetic assessment. We reveal several valuable findings to support further research and applications related to visual aesthetic analysis of food images. To encourage further research, we have made the GPD publicly available at https://github.com/Openning07/GPA.
资助项目National Natural Science Foundation of China[61832016] ; National Natural Science Foundation of China[61672520] ; CASIA-Tencent Youtu joint research project
WOS关键词NETWORKS
WOS研究方向Computer Science
语种英语
出版者SPRINGERNATURE
WOS记录号WOS:000648692800009
资助机构National Natural Science Foundation of China ; CASIA-Tencent Youtu joint research project
内容类型期刊论文
源URL[http://ir.ia.ac.cn/handle/173211/44633]  
专题自动化研究所_模式识别国家重点实验室_多媒体计算与图形学团队
通讯作者Dong, Weiming
作者单位1.Snap Inc, Santa Monica, CA 90405 USA
2.Tencent, Youtu Lab, Shanghai 200233, Peoples R China
3.Chinese Acad Sci, Inst Automat, NLPR, Beijing 100190, Peoples R China
4.Tencent Inc, AI Lab, Shenzhen 518000, Peoples R China
5.Kuaishou Technol, Beijing 100085, Peoples R China
推荐引用方式
GB/T 7714
Sheng, Kekai,Dong, Weiming,Huang, Haibin,et al. Learning to assess visual aesthetics of food images[J]. COMPUTATIONAL VISUAL MEDIA,2021,7(1):139-152.
APA Sheng, Kekai.,Dong, Weiming.,Huang, Haibin.,Chai, Menglei.,Zhang, Yong.,...&Hu, Bao-Gang.(2021).Learning to assess visual aesthetics of food images.COMPUTATIONAL VISUAL MEDIA,7(1),139-152.
MLA Sheng, Kekai,et al."Learning to assess visual aesthetics of food images".COMPUTATIONAL VISUAL MEDIA 7.1(2021):139-152.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace