Image Description Method Based on Generative Adversarial Networks

微信服务号

微信订阅号

Home > Archive>Volume 29, Issue S2, 2018 >30-43

Image Description Method Based on Generative Adversarial Networks
DOI:
                        
Author:
                        
Affiliation:
Clc Number:
Fund Project:Basal Research Fund of Academy of Broadcasting Science, National Radio and Television Administration (130016018000123)

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

In recent years, deep learning has gained more and more attention in image description. The existing deep learning methods using CNNs to extract features and RNNs to fold into one sentence. Nevertheless, when dealing with complex images, the feature extraction is inaccurate. And the fixed mode of sentence generation model leads to inconsistent sentences. To solve this problem, this study proposes a method combine channel-wise attention model and GANs, named CACNN-GAN. The channel-wise attention mechanism is added to each conv-layer to extract features, compare with the COCO dataset, and select the top features to generate sentence. Using GANs to generate the sentences, which is generated by the game process between the generator and the discriminator. After that, we can get a sentence generator contains the varied syntax, smooth sentence, and rich vocabulary. Experiments on real datasets illustrates that CACNN-GAN can effectively describe images, and get higher accuracy compared with the state-of-art.

Reference

Cited by

Get Citation

薛子育,郭沛宇,祝晓斌,张乃光.一种基于生成式对抗网络的图像描述方法.软件学报,2018,29(S2):30-43

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:April 16,2018
Revised:
Adopted:
Online: August 07,2019
Published:

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address：4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code：100190
Phone：010-62562563 Fax：010-62562533 Email：jos@iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063

微信服务号

微信订阅号

Get Citation

Share

Article Metrics

History