Author:
Zhong Xian,Nie Guozhang,Huang Wenxin,Liu Wenxuan,Ma Bo,Lin Chia-Wen
Subject
Electrical and Electronic Engineering,Computer Vision and Pattern Recognition,Media Technology,Signal Processing
Reference46 articles.
1. Deep visual-semantic alignments for generating image descriptions;Karpathy;IEEE Trans. Pattern Anal. Mach. Intell.,2017
2. T. Yao, Y. Pan, Y. Li, T. Mei, Exploring visual relationship for image captioning, in: Proc. Springer Eur. Conf. Comput. Vis. (ECCV), 2018, pp. 711–727.
3. L. Li, S. Tang, L. Deng, Y. Zhang, Q. Tian, Image caption with global-local attention, in: Proc. Conf. Artif. Intell. (AAAI), 2017, pp. 4133–4139.
4. Babytalk: Understanding and generating simple image descriptions;Kulkarni;IEEE Trans. Pattern Anal. Mach. Intell.,2013
5. H. Hu, J. Gu, Z. Zhang, J. Dai, Y. Wei, Relation networks for object detection, in: Proc. IEEE/CVF Conf. Comput. Vis. Pattern Recognit. (CVPR), 2018, pp. 3588–3597.
Cited by
11 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. GVA: guided visual attention approach for automatic image caption generation;Multimedia Systems;2024-01-29
2. Automatic Generation of Pantograph Image Caption Based on Deep Learning;Proceedings of the 6th International Conference on Electrical Engineering and Information Technologies for Rail Transportation (EITRT) 2023;2024
3. Transformer-based local-global guidance for image captioning;Expert Systems with Applications;2023-08
4. A Context Semantic Auxiliary Network for Image Captioning;Information;2023-07-20
5. Background Disturbance Mitigation for Video Captioning Via Entity-Action Relocation;ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2023-06-04