Author:
Dai Bo,Ye Deming,Lin Dahua
Publisher
Springer International Publishing
Reference42 articles.
1. Vinyals, O., Toshev, A., Bengio, S., Erhan, D.: Show and tell: a neural image caption generator. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3156–3164 (2015)
2. Xu, K., et al.: Show, attend and tell: Neural image caption generation with visual attention. In: ICML, vol. 14, pp. 77–81 (2015)
3. Rennie, S.J., Marcheret, E., Mroueh, Y., Ross, J., Goel, V.: Self-critical sequence training for image captioning. arXiv preprint arXiv:1612.00563 (2016)
4. Lu, J., Xiong, C., Parikh, D., Socher, R.: Knowing when to look: Adaptive attention via a visual sentinel for image captioning. arXiv preprint arXiv:1612.01887 (2016)
5. Cho, K., et al.: Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv preprint arXiv:1406.1078 (2014)
Cited by
7 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. MMNet: Multi-Collaboration and Multi-Supervision Network for Sequential Deepfake Detection;IEEE Transactions on Information Forensics and Security;2024
2. Research on Image Captioning Based on Vision-language Pre-trained Models;2023 9th International Conference on Big Data and Information Analytics (BigDIA);2023-12-15
3. A Context Semantic Auxiliary Network for Image Captioning;Information;2023-07-20
4. From Show to Tell: A Survey on Deep Learning-Based Image Captioning;IEEE Transactions on Pattern Analysis and Machine Intelligence;2023-01-01
5. Effectuating Communication for the Deaf and Hard-of-Hearing: An Ethnographic Review;2022 International Conference on Electrical and Computing Technologies and Applications (ICECTA);2022-11-23