Image Captioning using Artificial Intelligence-Reference-Cited by-同舟云学术

Image Captioning using Artificial Intelligence

Published:2021-04-01 Issue:1 Volume:1854 Page:012048
ISSN:1742-6588
Container-title:Journal of Physics: Conference Series
language:
Short-container-title:J. Phys.: Conf. Ser.

Author:

Singh Yajush Pratap,Ezaz Ahmed Sayed Abu Lais,Singh Prabhishek,Kumar Neeraj,Diwakar Manoj

Abstract

Abstract In modern science there is a rapid development of artificial intelligence, image processing has gradually fascinated and inspired the attention of many researchers in the field of artificial intelligence and has become an interesting and demanding task. The main idea of Image caption is to automatically generate natural language descriptions according to the information observed in an image, this is an important portion of scene understanding, which combines all the knowledge and information available of computer vision and natural language processing. The use of image caption is broad and noteworthy, for example, the understanding of human-computer collaboration. This paper reviews the related methods and focuses on the attention mechanism, which plays a vital role in computer vision and is broadly used in image caption generation tasks. Furthermore, the advantages and the shortcomings of these methods are discussed, providing the commonly used datasets and evaluation criteria in this field. Finally, this paper proposes some open challenges in the image caption task.

Publisher

IOP Publishing

Subject

General Physics and Astronomy

Link

https://iopscience.iop.org/article/10.1088/1742-6596/1854/1/012048/pdf

Reference24 articles.

1. Convolutional image captioning;Aneja

2. Boosting image captioning with attributes;Yao

3. Areas of attention for image captioning;Pedersoli

4. Paying attention to descriptions generated by image captioning models;Tavakoli

5. SemStyle: learning to generate stylised image captions using unaligned text;Mathews

Cited by 11 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Intelligent void identification of particle packing system of caved ore and rock;Engineering Applications of Artificial Intelligence;2024-11

2. Next Generation Digital Health Monitoring System Using Artificial Neural Network;2024 5th International Conference on Recent Trends in Computer Science and Technology (ICRTCST);2024-04-09

3. Piclingo: Multilingual Image Caption Generator;Information Systems Engineering and Management;2024

4. A real-time image captioning framework using computer vision to help the visually impaired;Multimedia Tools and Applications;2023-12-22

5. Evaluation of 5% Known And 95% Unknown Matter in Inner and Outer Universe Using Quantum Computing, Artificial Intelligence and Vedic Scripture;2023 International Conference on Energy, Materials and Communication Engineering (ICEMCE);2023-12-14