| Literature DB >> 34806996 |
Zainab Jan1, Noor Ai-Ansari2, Osama Mousa2, Alaa Abd-Alrazaq2, Arfan Ahmed2,3, Tanvir Alam2, Mowafa Househ2.
Abstract
BACKGROUND: Bipolar disorder (BD) is the 10th most common cause of frailty in young individuals and has triggered morbidity and mortality worldwide. Patients with BD have a life expectancy 9 to 17 years lower than that of normal people. BD is a predominant mental disorder, but it can be misdiagnosed as depressive disorder, which leads to difficulties in treating affected patients. Approximately 60% of patients with BD are treated for depression. However, machine learning provides advanced skills and techniques for better diagnosis of BD.Entities:
Keywords: bipolar disorder; clinical data; diagnosis; machine learning; mental health; scoping review; support vector machine
Mesh:
Year: 2021 PMID: 34806996 PMCID: PMC8663682 DOI: 10.2196/29749
Source DB: PubMed Journal: J Med Internet Res ISSN: 1438-8871 Impact factor: 5.428
Figure 1PRISMA (Preferred Reporting Items for Systematics Reviews and Meta-Analyses) flow diagram.
General characteristics of the included studies (N=33).
| Characteristic | Studies, n (%) | |
|
| ||
|
| Research articles | 30 (91) |
|
| Conference proceedings | 3 (9) |
|
| ||
|
| Published | 33 (100) |
|
| ||
|
| China | 8 (24) |
|
| United States | 7 (21) |
|
| United Kingdom | 3 (9) |
|
| Canada | 2 (6) |
|
| Germany | 2 (6) |
|
| Brazil | 1 (3) |
|
| Japan | 1 (3) |
|
| Australia | 1 (3) |
|
| Italy | 1 (3) |
|
| Turkey | 1 (3) |
|
| Korea | 2 (6) |
|
| Norway | 1 (3) |
|
| Netherlands | 1 (3) |
|
| India | 1 (3) |
|
| Egypt | 1 (3) |
|
| ||
|
| 2021 | 6 (18) |
|
| 2020 | 5 (15) |
|
| 2019 | 7 (21) |
|
| 2018 | 7 (21) |
|
| 2017 | 3 (9) |
|
| 2016 | 5 (15) |
|
| ||
|
| Model development | 24 (73) |
|
| Evaluation | 5 (15) |
|
| Data analysis | 3 (9) |
|
| Model adaptation | 2 (6) |
|
| ||
|
| Bipolar disorder type 1 | 27 (82) |
|
| Bipolar disorder type 2 | 27 (82) |
|
| Psychotic bipolar | 3 (9) |
|
| Chronic bipolar | 2 (6) |
|
| First episode bipolar | 1 (3) |
|
| ||
|
| Machine learning | 33 (100) |
|
| Deep learning | 3 (9) |
|
| ||
|
| Diagnosis and detection | 33 (100) |
Figure 2Publications by year and country.
Machine learning models and algorithms, methods, and tools used in the included studies (N=33).a,b
| Model categories | Number of studies, n (%) | Study ID | |
|
| |||
|
| Support vector machine | 9 (28) | [ |
|
| Artificial neural network | 4 (12.12) | [ |
|
| Artificial neural network-particle swarm optimization | 1 (3.03) | [ |
|
| Random forest | 4 (12.12) | [ |
|
| Prediction rule ensembles | 1 (3.03) | [ |
|
| Gaussian process model | 2 (6.06) | [ |
|
| Nearest neighbor classification algorithm | 1 (3.03) | [ |
|
| Naive Bayes algorithm | 1 (3.03) | [ |
|
| Decision tree algorithm | 1 (3.03) | [ |
|
| |||
|
| Growth mixture modeling | 1 (3.03) | [ |
|
| Linear discriminant analysis | 1 (3.03) | [ |
|
| |||
|
| Baseline logistic regression | 1 (3.03) | [ |
|
| Linear regression | 3 (9.09) | [ |
|
| Elastic net method | 2 (6.06) | [ |
| Least absolute shrinkage and selection operator | 2 (6.06) | [ | |
| Fuzzy TOPSIS method | 1 (3.03) | [ | |
|
| |||
|
| K-means clustering | 1 (3.03) | [ |
|
| |||
|
| Deep neural network | 2 (6.06) | [ |
|
| Convolutional neural network | 1 (3.03) | [ |
|
| DeepBipolar | 1 (3.03) | [ |
|
| |||
|
| Natural language processing | 1 (3.03) | [ |
|
| |||
|
| Structured clinical interview for DSM-IVd | 1 (3.03) | [ |
aMachine learning models/algorithms were not reported in 2 studies, of which 1 study used a novel machine learning approach to diagnose bipolar disorder type I. The name of the model is not mentioned.
bMachine learning methods were only reported in 8 studies.
cThis is an interview-based assessment tool for diagnosis.
dDSM-IV: Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition.
Features of data used in the included studies (N=33).
| Feature | Value | |
|
|
| |
|
| <100 | 9 (28) |
|
| 100-200 | 9 (28) |
|
| 200-600 | 7 (21) |
|
| 700-1000 | 3 (9) |
|
| >2000 | 2 (6) |
|
|
| |
|
| Clinical data | 19 (58) |
|
| Nonclinical data | 12 (36) |
|
|
| |
|
| Private | 21 (64) |
|
| Public | 9 (28) |
|
|
| |
|
| Disorder samples | >90 |
|
| Healthy control | 10 |
aData set size was only reported in 30 studies.
bData types were only mentioned in 31 studies. Clinical data include blood samples, electronic medical records, neurological data, magnetic resonance imaging data, electroencephalography and microarray expression data, whereas nonclinical data include phenotype data, genotype data, genomic data, and genome wide association studies.
cPublic data include government sources, public databases, websites, and freely available databases, whereas private data include nongovernment sources, personal information, or data of specific hospitals or research organizations. Private data include databases that are not available in the public domain.
dMore than 90% of the samples used in the included studies were bipolar disorder samples (regardless of type), whereas 10% of the samples were healthy control samples.
Data set types used in the included studies (N=33).
| Data typea | Study reference | ||
|
| |||
|
| Immune-inflammatory signature | [ | |
|
| Blood samples (serum) | [ | |
|
| Neuropsychological data | [ | |
|
| Neurocognitive data | [ | |
|
| Affective Disorder Evaluation scale | [ | |
|
| Magnetic resonance imaging ( structural and functional) | [ | |
|
| Electroencephalography | [ | |
|
| PGBI-10Mb manic symptom data | [ | |
|
| Microarray expression data set | [ | |
|
| |||
|
| CANTABc cognitive scores | [ | |
|
| Large-scale genome-wide association | [ | |
|
| Phenotypic data set | [ | |
|
| Fractional anisotropy | [ | |
|
| Radial diffusivity | [ | |
|
| Axial diffusivity | [ | |
|
| Electronic medical record | [ | |
|
| Passive digital phenotypes | [ | |
|
| Bipolarity index | [ | |
|
| Daily mood ratings survey | [ | |
|
| Diffusion tensor images | [ | |
|
| Affective Disorder Evaluation scale | [ | |
|
| Activity monitoring | [ | |
|
| Genomic data | [ | |
aIn several studies, more than one data type was used.
bPGBI-10M: Parent General Behavior Inventory-10-Item Mania Scale.
cCANTAB: Cambridge Neuropsychological Test Automated Battery.
Statistical validation.
| Statistics | Study reference | |
|
|
| |
|
| ≤70 | [ |
|
| 71-78 | [ |
|
| 83-90 | [ |
|
| >91 | [ |
|
|
| |
|
| ≤60 | [ |
|
| 65-67 | [ |
|
| 75-78 | [ |
|
| 80-88 | [ |
|
| >90 | [ |
|
|
| |
|
| ≤70 | [ |
|
| 74-77 | [ |
|
| 81-89 | [ |
|
| >92 | [ |
|
|
| |
|
| ≤70 | [ |
|
| 74-78 | [ |
|
| 84- 88 | [ |
|
| >91 | [ |
aRatio of accuracy was not reported in 7 studies. In some studies, different values were mentioned, so the overall values do not sum up.
bSensitivity was not mentioned in 18 studies.
cSpecificity was not mentioned in 20 studies.
dAUC: area under the curve. It is basically used for statistical validation of any data. AUC values were not reported in 23 studies.
Model performance metrics.
| Data type | Study ID | Proposed model | Sensitivity, % | Specificity, % | Accuracy, % | AUCa |
| GWASb | [ | Random forest | 77.7 | 85.4 | 85.2 | NRc |
| Neuropsychological data | [ | SVMd | 76 | 77 | 77.0 | NR |
| ADEe and BPxf | [ | SVM | NR | NR | 96.0 | 92.1 |
| MRIg | [ | SVM | 85 | 85 | 85 | NR |
| MRI | [ | SVM | 82.3 | 92.7 | 87.6 | NR |
| MRI | [ | SVM | 87.5 | 97.1 | 92.4 | NR |
| MRI | [ | SVM | NR | NR | 76.0 | 74 |
| MRI | [ | SVM | 84.6 | 92.3 | 83.5 | NR |
| MRI | [ | Gaussian process model | 66.4 | 74.2 | 70.3 | NR |
| EEGh | [ | SVM | NR | NR | 98.0 | NR |
|
| [ | ANNi | 83.87 | NR | 89.89 | NR |
| DTIj | [ | SVM | NR | NR | 68.3 | NR |
| Activity monitoring | [ | RF,k CNN,l and ANN | 82 | 84 | 84 | NR |
| Genomic data | [ | ANN-PSOm | 83.87 | NR | 89.89 | NR |
| Immune-inflammatory signature | [ | Linear regression and elastic net methods | NR | NR | 86 | 97 |
| EMRn | [ | Linear regression and elastic net methods | 75 | 81 | 78 | 84 |
| CANTABo cognitive score | [ | Linear regression and LASSOp | NR | NR | 71.0 | NR |
| Phenotypic data set (passive digital phenotype) | [ | RF | NR | NR | 65 | 67 |
| Fractional anisotropy, radial diffusivity, and axial diffusivity | [ | Gaussian Process model | 66.67 | 84.21 | 75.0 | NR |
| PGBI-10Mq manic symptom data | [ | Growth mixture modeling | 83 | 89 | NR | NR |
aAUC: area under the curve.
bGWAS: genome-wide association.
cNR: not reported in the article.
dSVM: support vector machine.
eADE: Affective Disorder Evaluation.
fBPx: bipolarity index.
gMRI: magnetic resonance imaging.
hEEG: electroencephalography.
iANN: artificial neural network.
jDTI: diffusion tensor images.
kRF: random forest.
lCNN: convolutional neural network.
mANN-PSO: ANN-particle swarm optimization.
nEMR: electronic medical record.
oCANTAB: Cambridge Neuropsychological Test Automated Battery.
pLASSO: least absolute shrinkage and selection operator.
qPGBI-10M: Parent General Behavior Inventory-10-Item Mania Scale.