Author:
Cui Quan,Zhou Boyan,Guo Yu,Yin Weidong,Wu Hao,Yoshie Osamu,Chen Yubo
Publisher
Springer Nature Switzerland
Reference52 articles.
1. Antol, S., et al.: VQA: visual question answering. In: ICCV (2015)
2. Changpinyo, S., Sharma, P., Ding, N., Soricut, R.: Conceptual 12M: pushing web-scale image-text pre-training to recognize long-tail visual concepts. In: CVPR (2021)
3. Chen, H., Ding, G., Liu, X., Lin, Z., Liu, J., Han, J.: IMRAM: iterative matching with recurrent attention memory for cross-modal image-text retrieval. In: CVPR (2020)
4. Chen, J., Hu, H., Wu, H., Jiang, Y., Wang, C.: Learning the best pooling strategy for visual semantic embedding. In: CVPR (2021)
5. Chen, T., Kornblith, S., Norouzi, M., Hinton, G.: A simple framework for contrastive learning of visual representations. In: ICML (2020)
Cited by
6 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. FAIR: Flow Type-Aware Pre-Training of Compiler Intermediate Representations;Proceedings of the 46th IEEE/ACM International Conference on Software Engineering;2024-02-06
2. Semantically Enhanced Scene Captions with Physical and Weather Condition Changes;2023 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW);2023-10-02
3. Dynamic Texts From UAV Perspective Natural Images;2023 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW);2023-10-02
4. Multi-Modal Representation Learning with Text-Driven Soft Masks;2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR);2023-06
5. Enhancing Automatic Placenta Analysis Through Distributional Feature Recomposition in Vision-Language Contrastive Learning;Lecture Notes in Computer Science;2023