Vision transformer architecture and applications in digital health: a tutorial and survey-Reference-Cited by-同舟云学术

Vision transformer architecture and applications in digital health: a tutorial and survey

Published:2023-07-10 Issue:1 Volume:6 Page:
ISSN:2524-4442
Container-title:Visual Computing for Industry, Biomedicine, and Art
language:en
Short-container-title:Vis. Comput. Ind. Biomed. Art

Author:

Al-hammuri Khalid^ORCID,Gebali Fayez,Kanan Awos,Chelvan Ilamparithi Thirumarai

Abstract

AbstractThe vision transformer (ViT) is a state-of-the-art architecture for image recognition tasks that plays an important role in digital health applications. Medical images account for 90% of the data in digital medicine applications. This article discusses the core foundations of the ViT architecture and its digital health applications. These applications include image segmentation, classification, detection, prediction, reconstruction, synthesis, and telehealth such as report generation and security. This article also presents a roadmap for implementing the ViT in digital health systems and discusses its limitations and challenges.

Publisher

Springer Science and Business Media LLC

Subject

Computer Graphics and Computer-Aided Design,Computer Vision and Pattern Recognition,Visual Arts and Performing Arts,Medicine (miscellaneous),Computer Science (miscellaneous),Software

Link

https://link.springer.com/content/pdf/10.1186/s42492-023-00140-9.pdf

Reference114 articles.

1. Dosovitskiy A, Beyer L, Kolesnikov A, Weissenborn D, Zhai XH, Unterthiner T et al (2021) An image is worth 16x16 words: transformers for image recognition at scale. In: Proceedings of the 9th international conference on learning representations, OpenReview.net, Vienna, 3-7 May 2021

2. Zhang QM, Xu YF, Zhang J, Tao DC (2023) ViTAEv2: vision transformer advanced by exploring inductive bias for image recognition and beyond. Int J Comput Vis 131(5):1141-1162. https://doi.org/10.1007/s11263-022-01739-w

3. Han K, Wang YH, Chen HT, Chen XH, Guo JY, Liu ZH et al (2023) A survey on vision transformer. IEEE Trans Pattern Anal Mach Intell 45(1):87-110. https://doi.org/10.1109/TPAMI.2022.3152247

4. Wang RS, Lei T, Cui RX, Zhang BT, Meng HY, Nandi AK (2022) Medical image segmentation using deep learning: a survey. IET Image Process 16(5):1243-1267. https://doi.org/10.1049/ipr2.12419

5. Bai WJ, Suzuki H, Qin C, Tarroni G, Oktay O, Matthews PM et al (2018) Recurrent neural networks for aortic image sequence segmentation with sparse annotations. In: Frangi AF, Schnabel JA, Davatzikos C, Alberola-López C, Fichtinger G (eds) Medical image computing and computer assisted intervention. 21st international conference, Granada, September 2018. Lecture notes in computer science (Image processing, computer vision, pattern recognition, and graphics), vol 11073. Springer, Cham, pp 586-594. https://doi.org/10.1007/978-3-030-00937-3_67

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Multi-task approach based on combined CNN-transformer for efficient segmentation and classification of breast tumors in ultrasound images;Visual Computing for Industry, Biomedicine, and Art;2024-01-26

2. Transformer-Based Automated Segmentation of the Median Nerve in Ultrasound Videos of Wrist-to-Elbow Region;IEEE Transactions on Ultrasonics, Ferroelectrics, and Frequency Control;2024-01

3. How Artificial Intelligence Is Shaping Medical Imaging Technology: A Survey of Innovations and Applications;Bioengineering;2023-12-18

4. Zero Trust Context-Aware Access Control Framework for IoT Devices in Healthcare Cloud AI Ecosystem;2023-09-15