1. Zhong, X., Li, J., Huang, W., Xie, L.: Deep multi-label hashing for image retrieval. In: IEEE International Conference on Tools with Artificial Intelligence (ICTAI) (2019)
2. Xu, K., Ba, J., Kiros, R., Courville, A., Salakhutdinov, R., Zemel, R., Bengio, Y.: Show, attend and tell: neural image caption generation with visual attention. arXiv preprint
arXiv:1502.03044
(2015)
3. Vinyals, O., Toshev, A., Bengio, S., Erhan, D.: Show and tell: a neural image caption generator. In: CVPR, pp. 3156–3164 (2015)
4. Lu, J., Xiong, C., Parikh, D., Socher, R.: Knowing when to look: adaptive attention via a visual sentinel for image captioning. arXiv preprint
arXiv:1612.01887
(2016)
5. Chen, X., Fang, H., Lin, T.-Y., Vedantam, R., Gupta, S., Dollar, P., Zitnick, C.L.: Microsoft coco captions: data collection and evaluation server. arXiv preprint
arXiv:1504.00325
(2015)