Author:
Singh Yajush Pratap,Ezaz Ahmed Sayed Abu Lais,Singh Prabhishek,Kumar Neeraj,Diwakar Manoj
Abstract
Abstract
In modern science there is a rapid development of artificial intelligence, image processing has gradually fascinated and inspired the attention of many researchers in the field of artificial intelligence and has become an interesting and demanding task. The main idea of Image caption is to automatically generate natural language descriptions according to the information observed in an image, this is an important portion of scene understanding, which combines all the knowledge and information available of computer vision and natural language processing. The use of image caption is broad and noteworthy, for example, the understanding of human-computer collaboration. This paper reviews the related methods and focuses on the attention mechanism, which plays a vital role in computer vision and is broadly used in image caption generation tasks. Furthermore, the advantages and the shortcomings of these methods are discussed, providing the commonly used datasets and evaluation criteria in this field. Finally, this paper proposes some open challenges in the image caption task.
Subject
General Physics and Astronomy
Reference24 articles.
1. Convolutional image captioning;Aneja
2. Boosting image captioning with attributes;Yao
3. Areas of attention for image captioning;Pedersoli
4. Paying attention to descriptions generated by image captioning models;Tavakoli
5. SemStyle: learning to generate stylised image captions using unaligned text;Mathews
Cited by
11 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献