Automated assessment of psychiatric disorders using speech: A systematic review-Reference-Cited by-同舟云学术

Automated assessment of psychiatric disorders using speech: A systematic review

Published:2020-01-31 Issue:1 Volume:5 Page:96-116
ISSN:2378-8038
Container-title:Laryngoscope Investigative Otolaryngology
language:en
Short-container-title:Laryngoscope Investig Oto

Author:

Low Daniel M.¹²^ORCID,Bentley Kate H.³⁴,Ghosh Satrajit S.¹⁴⁵^ORCID

Affiliation:

1. Program in Speech and Hearing Bioscience and Technology, Harvard Medical School Boston Massachusetts

2. Department of Brain and Cognitive Sciences MIT Cambridge Massachusetts

3. Department of Psychiatry Massachusetts General Hospital/Harvard Medical School Boston Massachusetts

4. McGovern Institute for Brain Research, MIT Cambridge Massachusetts

5. Department of Otolaryngology, Head and Neck Surgery Harvard Medical School Boston Massachusetts

Abstract

AbstractObjectiveThere are many barriers to accessing mental health assessments including cost and stigma. Even when individuals receive professional care, assessments are intermittent and may be limited partly due to the episodic nature of psychiatric symptoms. Therefore, machine‐learning technology using speech samples obtained in the clinic or remotely could one day be a biomarker to improve diagnosis and treatment. To date, reviews have only focused on using acoustic features from speech to detect depression and schizophrenia. Here, we present the first systematic review of studies using speech for automated assessments across a broader range of psychiatric disorders.MethodsWe followed the Preferred Reporting Items for Systematic Reviews and Meta‐Analysis (PRISMA) guidelines. We included studies from the last 10 years using speech to identify the presence or severity of disorders within the Diagnostic and Statistical Manual of Mental Disorders (DSM‐5). For each study, we describe sample size, clinical evaluation method, speech‐eliciting tasks, machine learning methodology, performance, and other relevant findings.Results1395 studies were screened of which 127 studies met the inclusion criteria. The majority of studies were on depression, schizophrenia, and bipolar disorder, and the remaining on post‐traumatic stress disorder, anxiety disorders, and eating disorders. 63% of studies built machine learning predictive models, and the remaining 37% performed null‐hypothesis testing only. We provide an online database with our search results and synthesize how acoustic features appear in each disorder.ConclusionSpeech processing technology could aid mental health assessments, but there are many obstacles to overcome, especially the need for comprehensive transdiagnostic and longitudinal studies. Given the diverse types of data sets, feature extraction, computational methodologies, and evaluation criteria, we provide guidelines for both acquiring data and building machine learning models with a focus on testing hypotheses, open science, reproducibility, and generalizability.Level of Evidence3a

Funder

Gift to the McGovern Institute for Brain Research at MIT

MIT-Philips Research Award for Clinicians

National Institute of Health

Publisher

Wiley

Link

https://onlinelibrary.wiley.com/doi/pdf/10.1002/lio2.354

Reference165 articles.

1. Lifetime Prevalence of Mental Disorders in U.S. Adolescents: Results from the National Comorbidity Survey Replication–Adolescent Supplement (NCS-A)

2. The economic costs of mental disorders

3. The effect of heart rate variability biofeedback training on stress and anxiety: a meta-analysis

Cited by 373 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Privacy-preserving feature extractor using adversarial pruning for TBI assessment from speech;Computer Speech & Language;2026-01

2. Transdiagnostic findings across major depressive disorder, bipolar disorder and schizophrenia: A qualitative review;Journal of Affective Disorders;2025-10

3. Eye-tracking metrics during image viewing as possible biomarkers of cognitive alterations: A systematic review and meta-analysis in people with bipolar disorder;Journal of Affective Disorders;2025-09

4. Associations between coherence and temporal parameters of narrative speech production in borderline personality disorder;Journal of Psychiatric Research;2025-08

5. Artificial intelligence in forensic mental health: A review of applications and implications;Journal of Forensic and Legal Medicine;2025-07