Addressing Inaccurate Nosology in Mental Health: A Multi Label Data Cleansing Approach
      for Detecting Label Noise from Structural Magnetic Resonance Imaging Data in Mood
      and Psychosis Disorders via Deep Learning

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Mental health diagnostic approaches are seeking to identify biological markers to work alongside of advanced machine learning approaches. It is difficult to identify a biological marker of disease when the traditional diagnostic labels themselves are not necessarily valid. To begin to address this, we worked with brain imaging data collected from individuals with mood and psychosis disorders from over 1400 individuals comprising healthy controls, psychosis patients and their unaffected first-degree relatives and we assumed there may be noise in the diagnostic labelling process. We detected label noise by classifying the data multiple times using a support vector machine classifier and then we retained those individuals in which all classifiers unanimously mislabeled those subjects. Next we assigned a new diagnostic label to these individuals, based on the biological data, using an iterative data cleansing approach. Simulation results showed our method was highly accurate in identifying label noise. We evaluated our method via a deep learning model which shows performance improvement of model on the cleansed dataset. Both diagnostic and Biotype categories showed a large percentage of noisy labels with the largest amount of relabeling occurring between the healthy control and bipolar and schizophrenia disorder individuals as well as in the unaffected close relatives. Extraction of imaging features highlighted regional brain changes associated with each group. In sum, this approach represents an initial step towards developing approaches that need not assume existing mental health diagnostic categories are always valid, but rather allows us to leverage this information while also acknowledging that there are mis-assignments.

Related collections

Author and article information

Journal

Journal ID (nlm-journal-id): 101671285

Journal ID (pubmed-jr-id): 44575

Journal ID (nlm-ta): Biol Psychiatry Cogn Neurosci Neuroimaging

Journal ID (iso-abbrev): Biol Psychiatry Cogn Neurosci Neuroimaging

Title: Biological psychiatry. Cognitive neuroscience and neuroimaging

ISSN (Print): 2451-9022

ISSN (Electronic): 2451-9030

Publication date Nihms-submitted: 17 December 2020

Publication date (Electronic): 25 May 2020

Publication date (Print): August 2020

Publication date PMC-release: 25 December 2020

Volume: 5

Issue: 8

Pages: 819-832

Affiliations

[a) ]Department of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta, GA, USA

[b) ]Tri-institutional Center of Translational Research in Neuroimaging & Data Science (TReNDS), Georgia State University, Georgia Institute of Technology, Emory, 55 Park Pl NE, 18th Floor, Atlanta, GA, USA

[c) ]Department of Psychiatry, Yale University, New Haven, CT, USA

[d) ]Department of Computer Science, Georgia State University, Atlanta, GA, USA

[e) ]Department of Psychology, Georgia State University, Atlanta, GA, USA

[f) ]Olin Neuropsychiatry Research Center, Hartford Hospital, Hartford, CT, USA

[g) ]Department of Neuroscience, Yale University, New Haven, CT, USA

Article

Accession ID: PMC7760893 Pmcid ID: PMC7760893 Pmc-uid ID: 7760893 Manuscript ID: nihpa1655334

DOI: 10.1016/j.bpsc.2020.05.008

PMC ID: 7760893

PubMed ID: 32771180

SO-VID: 997ced24-08c3-4ebb-800d-f7211b9cb7bb

History

Comments

Comment on this article

scite_

Cited by 6

See all cited by

Addressing Inaccurate Nosology in Mental Health: A Multi Label Data Cleansing Approach for Detecting Label Noise from Structural Magnetic Resonance Imaging Data in Mood and Psychosis Disorders via Deep Learning

Read this article at