Author:
Lu Siyu,Liu Zheng,Liu Tianlin,Zhou Wangchunshu
Subject
Electrical and Electronic Engineering,Artificial Intelligence,Control and Systems Engineering
Reference78 articles.
1. Vqa-med: Overview of the medical visual question answering task at imageclef 2019;Abacha;CLEF (Work. Not.),2019
2. Fusion of detected objects in text for visual question answering;Alberti,2019
3. Alsentzer, Emily, Murphy, John, Boag, William, Weng, Wei-Hung, Jindi, Di, Naumann, Tristan, McDermott, Matthew, 2019. Publicly available clinical BERT embeddings. In: Proceedings of the 2nd Clinical Natural Language Processing Workshop.
4. Cadene, Remi, Ben-Younes, Hedi, Cord, Matthieu, Thome, Nicolas, 2019. Murel: Multimodal relational reasoning for visual question answering. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. pp. 1989–1998.
5. Chen, Hui, Ding, Guiguang, Liu, Xudong, Lin, Zijia, Liu, Ji, Han, Jungong, 2020. Imram: Iterative matching with recurrent attention memory for cross-modal image-text retrieval. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. pp. 12655–12663.
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献