| Literature DB >> 32773870 |
Sai Krishna Tikka1, Bikesh Kumar Singh2, S Haque Nizamie3, Shobit Garg4, Sunandan Mandal5, Kavita Thakur5, Lokesh Kumar Singh1.
Abstract
BACKGROUND: Interview-based schizophrenia (SCZ) diagnostic methods are not completely valid. Moreover, SCZ-the disease entity is very heterogeneous. Supervised-Machine-Learning (sML) application of Artificial-Intelligence holds a tremendous promise in solving these issues. AIMS: To sML-based discriminating validity of resting-state electroencephalographic (EEG) quantitative features in classifying SCZ from healthy and, positive (PS) and negative symptom (NS) subgroups, using a high-density recording. SETTINGS ANDEntities:
Keywords: Feature-extraction; machine-learning; negative symptoms; positive symptoms; validity
Year: 2020 PMID: 32773870 PMCID: PMC7368447 DOI: 10.4103/psychiatry.IndianJPsychiatry_91_20
Source DB: PubMed Journal: Indian J Psychiatry ISSN: 0019-5545 Impact factor: 1.759
Figure 1Showing channel placement of the 256 sensor net according to 10-10 international system
Figure 2General machine learning pipeline for classification of positive and negative samples. In experiment 1, schizophrenia was considered a positive group, and healthy was considered as a negative group. In experiment 2, positive symptom was considered positive and negative symptom was considered a negative group
Different types of support vector machine classifier and the parameters used in this study
| Classification method | Kernel type | Description |
|---|---|---|
| Linear SVM | Linear kernel | |
| Quadratic SVM | Polynomial kernel | |
| Cubic SVM | Polynomial kernel | |
| Fine Gaussian SVM | Gaussian radial basis function | |
| Medium Gaussian SVM | Gaussian radial basis function | |
| Course Gaussian SVM | Gaussian radial basis function |
f is the kernel function, xi and xj are the input feature vectors and σ is kernel parameter. SVM – Support vector machine
Different performance measures used for evaluation of support vector machine classifier
| Measure | Description | Mathematical expression |
|---|---|---|
| Accuracy | Percentage of correctly classified samples | |
| Sensitivity | Percentage of correctly classified samples belonging to schizophrenia (experiment 1)/positive symptoms (experiment 2) | |
| Specificity | Percentage of correctly classified samples belonging to the healthy group (experiment 1)/positive symptoms (experiment 2) | |
| Area under receiver operating characteristic curve | A common measure of sensitivity and specificity |
tp – True positives (number of correctly classified positive samples), tn – True negatives (number of correctly classified negative samples), fp – False positives (number of wrongly classified positive samples), fn – False negatives (number of wrongly classified negative samples). In experiment 1 – Schizophrenia was considered as positive group and healthy was considered as negative group. In experiment 2 – Positive symptom was considered as positive group and negative symptom was considered as negative group. SVM – Support vector machine; AUC – Area under receiver operating curve
Comparison of sociodemographic and clinical variables across the groups
| Variables | SCZ ( | HC ( | PS ( | NS ( | ||
|---|---|---|---|---|---|---|
| Age (years) | 31.56±7.05 | 29.95±3.78 | 0.93 | 32.50±8.71 | 29.30±4.71 | 1.07 |
| Marital status | ||||||
| Unmarried | 15 (39.5) | 8 (40) | 0.01 | 8 (44.4) | 3 (30) | 0.56 |
| Married | 23 (60.5) | 12 (60) | 10 (55.6) | 7 (70) | ||
| Religion | ||||||
| Hindu | 28 (73.7) | 18 (90) | 2.17 | 12 (66.7) | 8 (80) | 0.58 |
| Non-Hindu | 10 (26.3) | 2 (10) | 6 (33.3) | 2 (20) | ||
| Educationf | ||||||
| Illiterate/primary | 13 (34.2) | 7 (35) | 4.45 | 3 (16.7) | 4 (40) | 5.72 |
| Secondary | 7 (18.4) | 0 (0) | 7 (38.9) | 0 (0) | ||
| Graduate | 18 (47.4) | 13 (65) | 8 (44.4) | 6 (60) | ||
| Employment | ||||||
| Unemployed | 30 (78.9) | 6 (30) | 13.33*** | 5 (27.8) | 2 (20) | 0.20 |
| Employed | 8 (21.1) | 14 (70) | 13 (72.2) | 8 (80) | ||
| Socioeconomic statusf | ||||||
| Lower | 16 (42.1) | 3 (15) | 4.40 | 5 (27.8) | 5 (50) | 1.74 |
| Middle | 21 (55.3) | 16 (80) | 12 (66.7) | 5 (50) | ||
| Higher | 1 (2.6) | 1 (5) | 1 (5.6) | 0 (0) | ||
| Habitatf | ||||||
| Rural | 23 (60.5) | 5 (25) | 8.20* | 9 (50) | 7 (70) | 2.14 |
| Semi urban | 3 (7.9) | 6 (30) | 3 (16.7) | 0 (0) | ||
| Urban | 12 (31.6) | 9 (45) | 6 (33.3) | 3 (30) | ||
| Duration of illness (months) | 47.18±26.04 | 43.22.±27.40 | 55.20±31.44 | 1.05 | ||
| Chlorpromazine equivalents | 425.66±163.65 | 438.33±127.67 | 418.92±198.90 | 0.19 | ||
| PANSS | ||||||
| Positive | 21.84±5.23 | 25.83±3.36 | 16.90±2.51 | 7.31*** | ||
| Negative | 20.94±8.11 | 15.56±3.28 | 30.70±5.54 | 9.14*** | ||
| General psychopathology | 29.24±8.36 | 27.61±5.91 | 33.50±9.74 | 2.00 | ||
| Total | 72.03±13.31 | 69.00±8.32 | 81.10±12.06 | 3.14** |
*P<.05; **P<0.01; ***P<0.001; fFisher exact test used. SCZ – Schizophrenia patients; HC – Healthy controls; PS – Schizophrenia patients with predominantly positive symptoms; NS – Schizophrenia patients with predominantly negative symptoms; PANSS – Positive and negative syndrome scale; SD – Standard deviation
Significant features found statistically significant between the groups
| A. Experiment-1: SCZ ( | |||||
|---|---|---|---|---|---|
| Regions | Frequency | Feature | Mean±SEM | ||
| SCZ | HC | ||||
| Left IFG | Theta | Entropy | 2.7352±0.07582 | 2.4468±0.06875 | 0.003 |
| Variance | 92.1948±57.55483 | 121.8455±81.11564 | 0.033 | ||
| SD | 5.7851±1.75544 | 6.0636±1.43535 | 0.038 | ||
| Power | 92.1886±57.55100 | 121.8374±81.11023 | 0.033 | ||
| Gamma | Skewness | 0.1112±0.06864 | −0.0382±0.02244 | 0.003 | |
| Right IFG | Delta | Skewness | −0.3342±0.22691 | 0.2836±0.08275 | 0.016 |
| Beta | Skewness | 0.1463±0.10157 | −0.0264±0.02857 | 0.039 | |
| Gamma | Skewness | 0.1203±0.07139 | −0.0278±0.03549 | 0.014 | |
| Left DLPFC | Delta | Skewness | −0.6782±0.20258 | −0.0545±0.09706 | 0.008 |
| Crest form | 3.0215±0.13749 | 3.6768±0.17393 | 0.023 | ||
| Gamma | Skewness | 0.1165±0.06885 | −0.0054±0.01480 | 0.015 | |
| Right DLPFC | Delta | Skewness | −0.3277±0.26277 | 0.3257±0.10590 | 0.024 |
| Beta | Skewness | 0.1520±0.10229 | 0.0255±0.02659 | 0.010 | |
| Gamma | Skewness | 0.1208±0.07167 | −0.0170±0.00951 | 0.002 | |
| Left IPL | Theta | Form factor | 1955.7576±1398.23182 | −552.9008±365.59427 | 0.012 |
| Gamma | Mean | −0.0001±0.00009 | 0.0000±0.00004 | 0.036 | |
| Right IPL | Alpha | Variance | 32.3363±14.53552 | 32.4062±25.70312 | 0.031 |
| SD | 4.2728±0.85405 | 3.1815±0.77514 | 0.041 | ||
| Power | 32.3342±14.53455 | 32.4041±25.70141 | 0.031 | ||
| Beta | Variance | 44.6849±16.87896 | 33.8945±20.78437 | 0.036 | |
| SD | 5.3190±0.91786 | 3.8982±0.70642 | 0.039 | ||
| Minimum | −58.7084±21.69012 | −25.9060±9.70250 | 0.010 | ||
| Maximum | 61.7134±23.12278 | 26.2544±10.16121 | 0.005 | ||
| Range | 120.4218±44.76786 | 52.1604±19.86322 | 0.010 | ||
| Power | 44.6819±16.87784 | 33.8922±20.78298 | 0.036 | ||
| Gamma | Variance | 43.6071±29.07713 | 15.9474±12.94389 | 0.008 | |
| SD | 3.9643±1.20929 | 2.1990±0.54751 | 0.011 | ||
| Maximum | 61.7252±27.53687 | 19.6251±9.35317 | 0.036 | ||
| Range | 124.1787±55.65012 | 38.7074±18.20058 | 0.048 | ||
| Power | 43.6041±29.07519 | 15.9463±12.94303 | 0.008 | ||
| Left STG | Theta | Form factor | 734.5629±1152.24726 | −339.0403±789.73024 | 0.048 |
| Gamma | Variance | 374244.9815±374230.11109 | 15.8634±12.85715 | 0.033 | |
| SD | 64.7792±61.99634 | 2.1937±0.54428 | 0.039 | ||
| Power | 374220.0319±374205.16242 | 15.8624±12.85629 | 0.033 | ||
| Right STG | Beta | Maximum | 61.5651±23.25161 | 27.9247±10.31370 | 0.041 |
| Range | 119.5335±45.01290 | 55.2043±20.07156 | 0.048 | ||
| Left IFG | Delta | Mean | −0.0038±0.00208 | 0.0011±0.00064 | 0.021 |
| Crest factor | 4.1242±0.29585 | 2.9817±0.18331 | 0.014 | ||
| Theta | Mean | 0.0031±0.00157 | −0.0012±0.00055 | 0.027 | |
| Form factor | 953.9041±1738.82570 | −1961.1192±816.33388 | 0.002 | ||
| Right IFG | Delta | Mean | −0.0016±0.00131 | 0.0020±0.00090 | 0.024 |
| Variance | 36089.0665±9247.95745 | 2906.2327±1010.58574 | 0.035 | ||
| SD | 134.2456±23.89547 | 43.8882±8.09026 | 0.049 | ||
| Minimum | −463.3971±82.96620 | −139.9365±23.01746 | 0.027 | ||
| Range | 1090.9231±209.12290 | 287.8238±56.07730 | 0.039 | ||
| Power | 36086.6608±9247.34097 | 2906.0389±1010.51837 | 0.035 | ||
| Theta | Mean | 0.0003±0.00115 | −0.0021±0.00080 | 0.031 | |
| Left DLPFC | Delta | Form factor | −11239.8840±12182.92907 | 8501.4417±3611.42483 | 0.007 |
| Theta | Form factor | 581.4022±241.65339 | −1426.4185±361.01432 | <.001 | |
| Beta | Kurtosis | 16.1049±10.66669 | 40.2326±35.88180 | 0.017 | |
| Right DLPFC | Delta | Mean | −0.0051±0.00199 | 0.0022±0.00140 | 0.007 |
| Entropy | 1.1264±0.07445 | 1.2432±0.04452 | 0.049 | ||
| Theta | Mean | 0.0029±0.00132 | −0.0021±0.00118 | 0.010 | |
| Alpha | Mean | 0.0031±0.00108 | −0.0002±0.00028 | 0.035 | |
| Beta | Mean | −0.0009±0.00028 | 0.0000±0.00019 | 0.027 | |
| Left IPL | Delta | Mean | 0.0143±0.00337 | −0.0002±0.00137 | 0.007 |
| Theta | Mean | −0.0085±0.00195 | 0.0009±0.00122 | 0.004 | |
| Kurtosis | 16.1318±4.67997 | 4.4192±0.34235 | 0.027 | ||
| Crest factor | 5.8362±0.50224 | 4.2454±0.14854 | 0.017 | ||
| Alpha | Skewness | −0.4935±0.18339 | −0.0008±0.00808 | 0.006 | |
| Right IPL | Alpha | Form factor | 957.9047±361.44474 | −1626.4591±1132.25393 | 0.019 |
| Beta | Kurtosis | 5.8536±1.82523 | 4.8403±0.32138 | 0.049 | |
| Crest factor | 4.9846±0.38732 | 5.7063±0.31853 | 0.027 | ||
| Left STG | Alpha | Form factor | −4635.7358±4093.85566 | 1340.2171±666.57497 | 0.017 |
SEM – Standard error of mean; IFG – Inferior frontal gyrus; DLPFC – Dorsolateral prefrontal cortex; IPL – Inferior parietal lobule; STG – Superior temporal gyrus; SCZ – Schizophrenia patients; HC – Healthy controls; PS – Schizophrenia patients with predominant positive symptoms; NS – Schizophrenia patients with predominant negative symptoms
Classification results using significant features
| A. Experiment-1: SCZ ( | ||||||||
|---|---|---|---|---|---|---|---|---|
| 10-fold method | SVM Model | Hold-out method | ||||||
| ACC (%) | SEN (%) | SPE (%) | AUC (%) | ACC (%) | SEN (%) | SPE (%) | AUC (%) | |
| 72.41 | 86.84 | 45.00 | 65.92 | Linear SVM | 73.68 | 92.31 | 33.33 | 62.82 |
| 70.69 | 78.95 | 55.00 | 66.97 | Quadratic SVM | 78.95†† | 92.31 | 50.00 | 71.15 |
| 63.79 | 68.42 | 55.00 | 61.71 | Cubic SVM | 68.42 | 76.92 | 50.00 | 63.46 |
| 70.69 | 97.37 | 20.00 | 58.68 | Fine Gaussian SVM | 68.42 | 100.00 | 0.00 | 50.00 |
| 63.79 | 92.11 | 10.00 | 51.05 | Medium Gaussian SVM | 73.68 | 100.00 | 16.67 | 58.33 |
| 65.52 | 97.37 | 5.00 | 51.18 | Coarse Gaussian SVM | 68.42 | 100.00 | 0.00 | 50.00 |
| 85.71 | 88.89 | 80.00 | 84.44 | Linear SVM | 88.89 | 100.00 | 75.00 | 87.50 |
| 82.14 | 83.33 | 80.00 | 81.67 | QUADRATIC SVM | 88.89 | 100.00 | 75.00 | 87.50 |
| 82.14 | 83.33 | 80.00 | 81.67 | Cubic SVM | 88.89 | 100.00 | 75.00 | 87.50 |
| 64.29 | 100.00 | 0.00 | 50.00 | Fine Gaussian SVM | 55.56 | 100.00 | 0.00 | 50.00 |
| 89.29†† | 100.00 | 70.00 | 85.00 | Medium Gaussian SVM | 77.78 | 100.00 | 50.00 | 75.00 |
| 64.29 | 100.00 | 0.00 | 50.00 | Coarse Gaussian SVM | 55.56 | 100.00 | 0.00 | 50.00 |
†† Model showing highest accuracy. SVM – Support vector machine; ACC – Accuracy; SEN – Sensitivity; SPE – Specificity; AUC – Area under receiver operating curve; SCZ –Schizophrenia patients; HC – Healthy controls; PS – Schizophrenia patients with predominant positive symptoms; NS – Schizophrenia patients with predominant negative symptoms