FER-PCVT: Facial Expression Recognition with Patch-Convolutional Vision Transformer for Stroke Patients

Author:

Fan YimingORCID,Wang Hewei,Zhu Xiaoyu,Cao Xiangming,Yi Chuanjian,Chen Yao,Jia Jie,Lu Xiaofeng

Abstract

Early rehabilitation with the right intensity contributes to the physical recovery of stroke survivors. In clinical practice, physicians determine whether the training intensity is suitable for rehabilitation based on patients’ narratives, training scores, and evaluation scales, which puts tremendous pressure on medical resources. In this study, a lightweight facial expression recognition algorithm is proposed to diagnose stroke patients’ training motivations automatically. First, the properties of convolution are introduced into the Vision Transformer’s structure, allowing the model to extract both local and global features of facial expressions. Second, the pyramid-shaped feature output mode in Convolutional Neural Networks is also introduced to reduce the model’s parameters and calculation costs significantly. Moreover, a classifier that can better classify facial expressions of stroke patients is designed to improve performance further. We verified the proposed algorithm on the Real-world Affective Faces Database (RAF-DB), the Face Expression Recognition Plus Dataset (FER+), and a private dataset for stroke patients. Experiments show that the backbone network of the proposed algorithm achieves better performance than Pyramid Vision Transformer (PvT) and Convolutional Vision Transformer (CvT) with fewer parameters and Floating-point Operations Per Second (FLOPs). In addition, the algorithm reaches an 89.44% accuracy on the RAF-DB dataset, which is higher than other recent studies. In particular, it obtains an accuracy of 99.81% on the private dataset, with only 4.10M parameters.

Funder

Wuxi Municipal Health Commission Translational Medicine Research Project in 2021

Scientific and Technological Innovation Plan of Shanghai STC

China National Nature Science Young Foundation

Shanghai Sailing Program

Key Subjects Construction Program of the Health System in Jing’an District

National Key R&D Program of China

Publisher

MDPI AG

Subject

General Neuroscience

Reference52 articles.

1. Temporal trends and characteristics of stroke in China in the past 15 years;West China Med. J.,2021

2. Early rehabilitation after stroke;Curr. Opin. Neurol.,2017

3. A survey on active learning and human-in-the-loop deep learning for medical image analysis;Med. Image Anal.,2021

4. Deep computational pathology in breast cancer;Semin. Cancer Biol.,2021

5. Quantifying Axial Spine Images Using Object-Specific Bi-Path Network;IEEE J. Biomed. Heal. Inform.,2021

Cited by 1 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

全球学者库

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"全球学者库"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前全球学者库共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2023 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3