Literature DB >> 33631640

Artificial Intelligence-Based Prediction of Covid-19 Severity on the Results of Protein Profiling.

Şeyma Yaşar1, Cemil Çolak1, Saim Yoloğlu1.   

Abstract

BACKGROUND: COVID-19 progresses slowly and negatively affects many people. However, mild to moderate symptoms develop in most infected people, who recover without hospitalization. Therefore, the development of early diagnosis and treatment strategies is essential. One of these methods is proteomic technology based on the blood protein profiling technique. This study aims to classify three COVID-19 positive patient groups (mild, severe, and critical) and a control group based on the blood protein profiling using deep learning (DL), random forest (RF), and gradient boosted trees (GBTs).
METHODS: The dataset consists of 93 samples (60 COVID-19 patients, 33 control), and 370 variables obtained from an open-source website. The current dataset contains age, gender, and 368 protein, used to predict the relationship between disease severity and proteins using DL and machine learning approaches (RF, GBTs). An evolutionary algorithm tunes hyperparameters of the models and the predictions are assessed through accuracy, sensitivity, specificity, precision, F1 score, classification error, and kappa performance metrics.
RESULTS: The accuracy of RF (96.21%) was higher as compared to DL (94.73%). However, the ensemble classifier GBTs produced the highest accuracy (96.98%). TGB1BP2 in the cardiovascular II panel and MILR1 in the inflammation panel were the two most important proteins associated with disease severity.
CONCLUSIONS: The proposed model (GBTs) achieved the best prediction of disease severity based on the proteins compared to the other algorithms. The results point out that changes in blood proteins associated with the severity of COVID-19 may be used in monitoring and early diagnosis/treatment of the disease.
Copyright © 2021. Published by Elsevier B.V.

Entities:  

Keywords:  Artificial Intelligence; COVID-19; Deep Learning; Gradient Boosted Trees; Random Forest

Mesh:

Year:  2021        PMID: 33631640      PMCID: PMC7882428          DOI: 10.1016/j.cmpb.2021.105996

Source DB:  PubMed          Journal:  Comput Methods Programs Biomed        ISSN: 0169-2607            Impact factor:   5.428


Introduction

The novel coronavirus disease (COVID-19) has spread rapidly across the globe affecting billions of people's everyday lives. The disease can lead to serious pneumonia, which can lead to death [1]. The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) outbreak that occurred in December 2019 manifests itself in different situations in infected patients. While it appears as a mild respiratory infection on some infected patients, it may progress to severe pneumonia and acute respiratory distress syndrome (ARDS), resulting in multiple organ failure or even death in some. On the other hand, in some patients, the disease progresses without symptoms. Hence, it is very difficult to determine the percentage of people with COVID-19 severity. However, according to the World Health Organization (WHO), it is estimated that 80% of infections are asymptomatic or mild, 15% are serious infections requiring oxygen support, 5% are critical infections requiring ventilation, and 3% are fatal [2]. Patients classified as COVID-19 clinically severe are diagnosed based on some clinical features such as respiratory rate and mean oxygen saturation. However, when these clinical signs appeared, the patients reached a clinically serious stage. Therefore, patients are either taken to intensive care or can die quickly. Therefore, considering all these negativities, it is very important to detect early which cases may become clinically serious and develop new approaches to prevent deaths from COVID-19. Therefore, many studies have been conducted for early diagnosis and diagnosis. While most of these studies are on the clinical and epidemiological features of COVID-19, one of the most frequently used approaches recently is proteomics technology. Proteomics technology is the study of all proteins in a biological system and is increasingly being used by clinical researchers to identify disease biological markers [3]. Therefore, the detection of molecularly changed proteins in the blood of an infected individual with proteomics technology and the discovery of biomarkers are thought to play an active role in the development of the diagnosis and treatment of COVID-19. In recent years, machine learning and deep learning-based studies have gained importance in terms of being a decision support mechanism for clinicians for early diagnosis and diagnosis of diseases in the field of health. Therefore, interest in studies involving machine learning algorithms combined with data obtained from methods used in the early diagnosis and diagnosis of COVID-19 has increased considerably. Machine learning is a sub-branch of artificial intelligence, consisting of modeling and algorithms that make inferences from existing data using mathematical and statistical methods and make predictions about the unknown with these inferences [4]. On the other hand, machine learning is an area at the intersection of statistics, artificial intelligence, computers, predictive analysis, and statistical learning about obtaining information from available data [5]. Deep learning, a sub-branch of machine learning, is the name of the system that allows multi-layered neural networks, unlike machine learning [6]. The main factor that distinguishes deep learning from ANN is that deep neural networks consist of many more layers than ANN. It is done by increasing the number of hidden layers to obtain more features from the data to be processed and to make learning better [7]. The relationships between COVID-19 severity and protein profiling technology can be modeled to discover important proteins associated with the pandemic by deep learning and artificial modeling approaches. Therefore, this study aims to classify three COVID-19 positive patient groups (mild, severe, and critical) and a control group based on the blood protein profiling using deep learning and machine learning models (i.e., Random Forest and Gradient Boosted Tree).

Material And Methods

Dataset

The data set used in this study includes age, gender, 368 proteins, obtained from blood protein profiling belong to 93 subjects which are 59 positive COVID-19 cases [mild (n = 26; group 1), severe (n = 9; group 2), critical (n = 24; group 3)] and 28 control groups (control group). In the OLINK proteomics of cardiovascular, immune, inflammation, and neurology panels, each containing 92 proteins, the protein profiles of 87 samples were examined, resulting in 368 protein measurements per subject blood protein profiling. One of the reasons for the data set's missing values was that a sample of the mild symptom group failed the immune and neurology panels' analysis. However, it was not excluded from the analysis for analyzes involving cardiovascular and inflammation panels. Another reason for the missing values in the data set was that in more than 50% of the samples in all four disease groups, thirteen proteins were missing Normalized Protein eXpression (NPX) values or NPX values below the protein-specific detection limit (LOD). Therefore, these 13 proteins were excluded, and 355 proteins remained; 344 were unique in that protein replication in four panels [8].

Data Preprocessing

Missing values in the data set often complicate the statistical analysis of multivariate data. Therefore, the data set's missing values are complemented by the multiple imputation method using Fully Conditional Specification (FCS), as it is thought that it will negatively affect the model training process [9]. In this method, each variable containing a missing value is determined by a separate model and assigned to that variable with that model [10]. Similarly, since the number of data in each group is unbalanced in the data set, all the classes have been balanced with the "Sample (balance)" operator in Rapidminer Studio. This operator functions the same way as the (absolute) combination of multiplying and sampling works. A fixed number of examples are selected for a given input dataset for each included mark. Current examples are multiplied if insufficient examples are included [11]. Random Forest-Recursive Feature Elimination (RF-RFE) algorithm is used as the variable selection method. In this algorithm, the data set is first trained with a machine learning algorithm, in which variables such as Random Forest (RF), logistic regression, Support Vector Machines (SVM) have certain weights. Then, the variable with the smallest coefficient is removed, and the system is retrained with the remaining variables. This process continues until all features are eliminated, and the variable subset giving the best result is selected. RFE was originally proposed to enable support vector machines to perform feature selection by iteratively training a model, grading features, and then removing the lowest-rated features [12]. However, recently this method has been similarly applied to Random forest (RF), and it is useful in the presence of related features [13], [14], [15].

Deep Learning

Deep Learning is a machine learning technique developed for machine feature extraction, perception, and learning. It performs its operations using multiple consecutive layers. Each consecutive layer receives the output formed in the previous layer as input [16]. Besides the deep learning algorithm also performs data-based learning, the learning process works with calculations based on network diagrams expressed as a neural network, not a single mathematical model as in standard machine learning algorithms. The deep learning architecture in the study was constituted by using a multi-layer feed-forward neural network with stochastic gradient descent using the backpropagation approach. Epsilon, rho, L1 regularization, L2 regularization, max w2, and dropout hyperparameters were tuned using the Evolutionary optimization algorithm to increase model performance.

Machine Learning

Random Forest

The random forest (RF) algorithm establishes multiple decision trees on data samples, and one estimate is obtained from each. Afterward, if the problem is a regression problem, the results obtained are averaged, and if the problem is the classification problem, the prediction with the highest number of votes is selected. RF can reduce overfitting, one of the biggest problems of machine learning algorithms. Since this algorithm gives positive results in large data sets, it can work harmoniously in large data sets. The RF provides an advantage by not ignoring outlier observations [17]. The hyperparameters of maximal depth, minimal leaf size, minimal size for the split, and the number of pre pruning alternatives were tuned using the Evolutionary optimization algorithm to increase model performance.

Gradient Boosted Trees (GBTs)

Gradient Boost is a type of ensemble method used in machine learning. The foundations of this algorithm are based on the studies of Friedman et al. [18], [19], [20]. The main disadvantage of decision trees is that it generates a large "bias" in simple trees and a large variance in complex trees. "Bootstrap" is a method of selecting a random number of data from a data set. It is mostly used to reduce the variance of the tree. In this algorithm, a community of stronger learners is produced, usually by compensating for each other's weaknesses, such as decision trees. According to the Gradient Boosted Tree (GBTs) algorithm, a prediction function is constructed in the first iteration. A loss function is obtained from these differences by calculating the difference between estimates and observations. In the second iteration, the difference between the repetitions and observations is calculated by combining the estimation and loss functions. Thus, the success of the prediction function is tried to be increased by constantly adding on it. The difference between the predictions and the observations obtained is as close to zero as possible [21]. The maximum depth and learning rate hyperparameters were tuned using the Evolutionary optimization algorithm to increase the model performance.

Performance Evaluation Metrics

A 10-fold cross-validation method was used for the validity of the model. In the 10-fold cross-validation method, all data are divided into ten equal parts. The first part is used as the test set, and the remaining nine parts are used as the training data set, and this process is repeated for each part. In this technique, the general accuracy rate of the model is determined by averaging the accuracy values. Performance metrics for all models are given with accuracy, sensitivity, specificity, precision, F1 score, classification error, and kappa statistics.

Data Analysis

The compliance of quantitative variables to normal distribution was checked with the Shapiro Wilk test. Quantitative variables fulfilling the normal distribution assumption were summarized with mean and standard deviation and quantitative variables that did not show normal distribution with median and min-max. In statistical analysis, the Kruskal Wallis test was used for variables that did not show normal distribution, and the Conover test was used for pairwise comparisons of variables with differences (p<0.05). One-way analysis of variance (One-way ANOVA) test was used for variables with normal distribution and Tukey test in case of variance homogeneity in multiple comparisons, and Tamhane T2 test when variances were not homogeneous. In this study, in addition to artificial intelligence modeling and basic comparisons, the effect size was calculated to evaluate the effects of each protein on COVID-19 severity and control groups. The effect size is defined as the magnitude of the difference between groups [22]. Generally, the interpretation values in reported literature are small effect between 0.01-0.06, moderate effect between 0.06-0.14, and large effect more than 0.14 [23]. P <0.05 was considered statistically significant. The programming languages of "Statistical Analysis Software" [24], RStudio Version 3.6.2 [25], and RapidMiner Studio Version 9.8 [26] were used in data analysis.

Results

Baseline characteristic of the original data

In the study, 93 subjects are included, which are 34 (36.6%) are female, 59 (63.4%) are male, and the average age is 58.6 (±15.3). Descriptive statistics regarding the COVID-19 positive and control group based on age and gender variables of the data set are given in Table 1 .
Table 1

escriptive statistics of gender and age variables by COVID-19 positive and control.

VariableControl (n=33)Mild (n=26)Severe (n=9)Critical (n=25)Effect Sizep-value
GenderFemale16a (48.5%)13a (50.0%)3a,b(33.3%)2b (8.0 %)0,61 (Large)0.005*
Male17a (51.5%)13a (50.0%)6a,b(66.7%)23b (92.0)%
Mean Age ± SD61.1a ± 18.251.3b ± 13.564.8a ± 12.860.6a ± 11.30.09 (Large)0.039⁎⁎

a, b: Different characters in each row show a statistically significant difference (p <0.05)

Pearson Chi-square test

Kruskal-Wallis test.

escriptive statistics of gender and age variables by COVID-19 positive and control. a, b: Different characters in each row show a statistically significant difference (p <0.05) Pearson Chi-square test Kruskal-Wallis test. Descriptive statistics of 92 proteins in the cardiovascular II panel according to the COVID-19 positive and control group are given in Table 2 . Additionally, the effect sizes for each protein are estimated in Table 2.
Table 2

Descriptive statistics of proteins in the cardiovascular II panel by groups.

Protein NamesGroups
Effect Sizep-value
ControlMildSevereCritical
Median (Min-Max)Median (Min-Max)Median (Min-Max)Median (Min-Max)
BMP-64.91a(2.57-6.16)4.74a(3.68-5.69)5.11a(4.46-5.64)5.07a(4.22-6.21)0.03 (Large)0.121*
ADM8.19a(6.91-9.07)8.47a,b(6.62-9.14)8.92b,c(8.22-9.79)9.52c(7.83-10.16)0.41 (Large)<0.0001*
CD40-L6.96a(4.87-8.66)4.06b(2.44-7.53)5.02b(3.36-6.92)3.98b(2.92-7.34)0.41 (Large)<0.0001*
PGF7.42a(6.92-8.27)7.33a(6.1-8.09)7.53a,b(7.04-8.25)7.92b(6.99-9.66)0.22 (Large)<0.0001*
ADAM-TS135.03 a,b(4.77-5.39)5.15a(4.74-5.52)4.86a,b(4.68-5.19)4.9b(4.27-6.3)0.18 (Large)<0.0001*
BOC3.79a(2.96-4.53)3.76a(2.75-4.36)2.99b(2.56-3.44)2.73b(2.48-3.59)0.49 (Large)<0.0001*
IL-4RA2.1a(1.57-2.57)2.05a(1.3-3.35)2.09a(1.69-4.41)2.97b(1.84-5.11)0.32 (Large)<0.0001*
IL-1ra4.97a(3.9-6.96)5.12a(3.97-7.47)6.64b(5.45-8.12)7.72c(5.99-8.16)0.55 (Large)<0.0001*
TNFRSF10A2.75a(2.34-3.78)2.64a(1.94-3.89)3.89b(3.01-4.89)4.44b(3.06-6.52)0.56 (Large)<0.0001*
STK46.23a(4.87-6.61)2.5b(0.41-5.67)3.4b(1.36-4.25)3.12b(1.44-5.12)0.58 (Large)<0.0001*
PAR-19.77a(9.21-10.32)8.44b(7.72-9.24)8.66b(7.79-9.34)8.82b(5.73-9.79)0.54 (Large)<0.0001*
PRSS279.32a(8.17-9.77)8.95a(7.95-10.36)8.21b(7.9-8.53)8.04b(7.34-9.03)0.42 (Large)<0.0001*
TF5.92a(4.92-6.59)5.82a(4.63-6.13)6.1a,b(5.21-6.94)6.26b(5.59-7.51)0.19 (Large)<0.0001*
IL1RL24.51a,b(3.47-4.87)4.52a(3.43-5.16)4b(2.8-4.96)4.4a,b(3.44-5.92)0.03 (Large)0.103*
PDGF subunit B11.1a(10.12-11.44)9.77b(7.93-10.85)10.38b,c(8.78-11.27)10.38c(8.01-11.08)0.51 (Large)<0.0001*
IL-17D2.6a(1.8-4.53)2.33a,b(1.56-3.02)2.3a,b(1.7-3.33)2.08b(1.75-2.83)0.07 (Large)0.023*
LOX-16.94a(6.31-8.34)6.77a(5.2-7.84)7.84b(7.15-9.38)8.27b(6.45-9.81)0.32 (Large)<0.0001*
IL189.08a,b(8.14-10.62)8.7a(7.49-10.27)a9.73b,c(8.97-11.09)9.85c(8.98-13.73)0.41 (Large)<0.0001*
PIgR5.76a(5.43-6.21)5.69a(5.31-6.01)5.67a(5.54-5.76)5.66a(5.44-6.23)0.02 (Large)0.171*
RAGE13a(12.01-14.07)a13.38a(11.32-14.32)13.88b(13.55-14.92)14.58b(12.6-15.05)0.43 (Large)<0.0001*
SOD210.27a(9.89-10.52)10.1b(9.69-10.3)10.17a(10.11-10.45)10.15a(10.05-10.33)0.33 (Large)<0.0001*
FGF-232.89a(1.89-4)2.57a(1.54-5.22)2.69a(2.08-3)2.95a(1.8-9.17)<0.001 (Small)0.378*
SPON28.17a(7.72-8.63)8.39a,b(7.32-8.59)8.63c(8.19-8.76)8.74c(8.47-8.96)0.54 (Large)<0.0001*
GLO16.54a(5.56-7.55)5.23b(4.32-6.63)6.67a(5.78-7.42)6.01a(5.35-8.88)0.38 (Large)<0.0001*
SERPINA123.01a,b(1.68-4.45)3.26a(1.27-6.7)1.85a,b(1.24-7.91)2.23b(0.59-4.64)0.12 (Large)0.002*
TM9.53a,b(9.24-10.15)9.55a,b(8.54-10.21)9.32a(8.81-10.09)9.76b(9.18-11.02)0.05 (Large)0.045*
PRELP8.27a(7.86-8.74)8.35a(7.63-8.75)8.34a(8-8.92)8.38a(7.56-9.1)0.02 (Large)0.215*
HO-111.67a(9.6-12.57)11.64a(10.41-12.77)12.51b(11.4-12.95)12.69b(10.97-13.02)0.36 (Large)<0.0001*
XCL14.63a(3.81-6.76)5.07a(3.79-5.66)5.39a,b(4.22-5.73)5.47b(4.42-7.38)0.22 (Large)<0.0001*
CEACAM84.25a(2.95-5.28)4.11a(2.47-5.69)4.73a,b(3.89-5.95)4.91b(3.62-7.73)0.23 (Large)<0.0001*
PTX33.75a(3.15-5.35)3.94a(2.6-5.66)5.17b(4.58-6.14)5.62b(3.94-6.22)0.52 (Large)<0.0001*
PSGL-14.66a(4.28-5.22)4.55a,b(4.08-5.4)4.39b(3.84-4.76)4.37b,c(3.85-4.9)0.19 (Large)<0.0001*
MMP78.14a(7.07-10.11)9.6b(8.49-10.02)9.25b,c(8.95-10.15)9.91c(8.98-12.18)0.48 (Large)<0.0001*
ITGB1BP28a(6.41-9.32)2.53b(1.62-5.53)3.71b(1.95-4.15)3.09b(1.86-5.21)0.60 (Large)<0.0001*
DCN4.27a(3.6-5.03)4.46a(3.4-4.95)4.28a,b(3.99-5.37)4.86b(3.93-5.7)0.11 (Large)0.005*
Dkk-19.74a(7.8-10.47)8.29b(7.19-10.09)9.05a(8.2-9.76)9.28a(7.82-10.52)0.37 (Large)<0.0001*
HB-EGF3.8a(1.61-4.64)0.8b(0.28-1.49)0.77b(0.36-1.68)1.61c(0.45-2.53)0.71 (Large)<0.0001*
BNP1.11a(0.56-2.24)1.29a,c(0.72-3.29)3.38b(1.78-6.24)1.77c(0.69-5.87)0.21 (Large)<0.0001*
ACE23.27a(2.5-5.02)3.24a(2.57-5.12)4.44b(3.19-5.61)4.8b(3.08-6.42)0.36 (Large)<0.0001*
CTSL17.21a(6.59-7.73)7.44a(6.43-9.71)8.75b(8.19-9.74)9.53c(8.62-10.24)0.61 (Large)<0.0001*
hOSCAR10.73a(10.34-11.19)10.76a(9.8-11.08)11.02b(10.88-11.29)11.05b(10.67-11.34)0.28 (Large)<0.0001*
TGM28.85a(5.17-9.59)7.79b(6.16-8.92)8.22a,b(6.95-9.2)8.19a,b(6.53-10.06)0.19 (Large)<0.0001*
CA5A2.08a(1.35-4.81)2.61a(1.39-5.34)3.85b(2.63-6)4.67b(2.21-7.38)0.44 (Large)<0.0001*
PARP-15.05a(3.83-8.26)3.95b(2.88-5.69)5.14a,c(4.3-6.97)5.83c(4.45-8.92)0.03 (Large)<0.0001*
Mean ± SDMean ± SDMean ± SDMean ± SD
ANGPT19.36a ± 0.648.21b ± 1.048.69a,b ± 0.828.30b ± 0.910.26 (Large)<0.0001⁎⁎
SLAMF73.04a ± 0.663.35a,b ± 0.583.91b,c ± 0.724.60c ± 0.770.48 (Large)<0.0001⁎⁎
SRC7.48a ± 0.345.54b ± 1.215.91b ± 0.695.93b ± 0.980.47 (Large)<0.0001⁎⁎
IL63.16a ± 0.753.13a ± 16.56b ± 1.228.29c ± 1.80.79 (Large)<0.0001⁎⁎
IDUA5.94a ± 0.475.59a,b ± 0.425.25b ± 0.675.43b,c ± 0.560.19 (Large)0.001⁎⁎
TNFRSF11A5.82a ± 0.465.53a ± 0.416.07a,b ± 0.66.52b ± 0.820.32 (Large)<0.0001⁎⁎
TRAIL-R26.10a ± 0.425.93a ± 0.357.09b ± 0.597.97c ± 1.020.66 (Large)<0.0001⁎⁎
TIE27.36a ± 0.257.34a ± 0.327.19a ± 0.247.43a ± 0.250.06 (Large)0.141⁎⁎
IL-275.78a ± 0.356.05a ± 0.427.01b ± 0.446.89b ± 0.630.54 (Large)<0.0001⁎⁎
CXCL111.78a ± 0.559.38b ± 0.9810.53c ± 0.9310.67c ± 0.870.58 (Large)<0.0001⁎⁎
Gal-97.99a ± 0.458.14a ± 0.479.08b ± 0.299.31b ± 0.270.69 (Large)<0.0001⁎⁎
GIF6.98a ± 0.856.98a ± 0.666.10b ± 0.876.60a,b ± 0.840.12 (Large)0.039⁎⁎
SCF9.39a ± 0.359.34a ± 0.377.85b ± 0.928.12b ± 0.970.51 (Large)<0.0001⁎⁎
FGF-216.57a ± 1.096.83a,b ± 1.326.83a,b ± 2.097.98b ± 2.650.10 (Large)<0.0001⁎⁎
CTRC10.17a ± 0.7710.52a ± 0.759.72a,b ± 0.799.29b ± 1.370.21 (Large)0.279⁎⁎
GH8.20a,b ± 2.026.94a ± 2.189.26b ± 0.848.28a,b ± 1.420.14 (Large)<0.0001⁎⁎
FS10.76a ± 0.5510.74a ± 0.5210.79a ± 0.4811.05a ± 0.680.05 (Large)0.008⁎⁎
CD845.84a ± 0.434.59b ± 0.374.58b ± 0.414.68b ± 0.420.67 (Large)0.296⁎⁎
PAPPA3.19a ± 0.62.77a ± 0.712.77a ± 0.593.01a ± 0.670.07 (Large)<0.0001⁎⁎
REN6.65a ± 0.676.26a ± 0.555.99a ± 0.767.30b ± 0.890.30 (Large)0.12⁎⁎
DECR17.92a ± 1.063.22b ± 0.743.89b,c ± 0.344.39c ± 0.770.85 (Large)<0.0001⁎⁎
MERTK6.31a ± 0.46.48a,b ± 0.496.83b,c ± 0.67.16c ± 0.320.40 (Large)<0.0001⁎⁎
KIM18.22a ± 0.898.14a ± 0.789.13a,b ± 0.589.42b ± 1.290.27 (Large)<0.0001⁎⁎
THBS25.50a ± 0.2a5.54a ± 0.15a5.68a,b ± 0.18a.b5.75b ± 0.2b0.25 (Large)<0.0001⁎⁎
VSIG23.87a ± 0.593.69a ± 0.423.54a ± 0.614.00a ± 0.780.06 (Large)<0.0001⁎⁎
AMBP7.60a ± 0.237.60a ± 0.217.52a ± 0.287.57a ± 0.250.01 (Moderate)0.249⁎⁎
IL166.73a ± 0.626.55a ± 0.617.01a,b ± 0.67.21b ± 0.670.16 (Large)0.008⁎⁎
SORT18.96a ± 0.388.36b ± 0.378.65a,b ± 0.348.75a ± 0.230.34 (Large)<0.0001⁎⁎
CCL1710.95a ± 0.888.47b ± 0.988.16b ± 0.878.23b ± 1.770.52 (Large)<0.0001⁎⁎
CCL36.66a,b ± 0.846.06a ± 0.637.30b,c ± 0.947.65c ± 0.890.39 (Large)<0.0001⁎⁎
IgG Fc receptor II-b3.03a ± 0.892.96a ± 0.73.51a ± 0.543.34a ± 0.670.07 (Large)0.08⁎⁎
LPL9.53a ± 0.4710.08b ± 0.379.47a ± 0.469.42a ± 0.470.30 (Large)<0.0001⁎⁎
PRSS88.96a ± 0.398.78a ± 0.449.08a ± 0.259.05a ± 0.350.09 (Large)0.103⁎⁎
AGRP5.19a ± 0.365.44a,b ± 0.595.74b,c ± 0.576.12c ± 0.570.35 (Large)<0.0001⁎⁎
GDF-29.17a ± 0.658.80a ± 0.517.54b ± 0.677.60b ± 0.870.52 (Large)<0.0001⁎⁎
FABP28.04a ± 0.998.02a ± 0.867.29a,b ± 1.077.09b ± 1.350.15 (Large)0.009⁎⁎
THPO4.50a ± 0.543.65b ± 0.443.70b,c ± 0.544.20a,c ± 0.660.32 (Large)<0.0001⁎⁎
MARCO6.83a ± 0.256.92a ± 0.26.89a ± 0.186.91a ± 0.280.03 (Large)0.389⁎⁎
GT2.32a ± 0.632.59a ± 0.622.09a ± 0.942.43a ± 1.080.03 (Large)0.259⁎⁎
MMP127.13a ± 0.866.89a ± 0.56.25a ± 0.836.51a ± 1.240.11 (Large)0.038⁎⁎
PD-L23.49a ± 0.323.44a ± 0.423.51a ± 0.53.65a ± 0.420.04 (Large)0.248⁎⁎
TNFRSF13B9.64 ± 0.42a9.73 ± 0.45a.b10.22 ± 0.6b.c10.26 ± 0.550.26 (Large)<0.0001⁎⁎
LEP6.03a ± 1.565.91a ± 1.335.56a ± 1.76.22a ± 0.80.02 (Large)0.817⁎⁎
HSP 2710.33a ± 0.319.57b ± 0.7710.10a,b ± 0.3210.25a ± 0.390.29 (Large)<0.0001⁎⁎
CD44.46a ± 0.344.78b ± 0.385.05b,c ± 0.345.27c ± 0.540.38 (Large)<0.0001⁎⁎
NEMO8.08a ± 0.84.55b ± 0.845.27b,c ± 0.655.52c ± 0.950.76 (Large)<0.0001⁎⁎
VEGFD7.79a ± 0.47.76a ± 0.287.80a,b ± 0.247.42b ± 0.530.15 (Large)0.011⁎⁎
HAOX15.03a ± 1.265.72a ± 1.226.33a,b ± 1.67.62b ± 1.350.39 (Large)<0.0001⁎⁎

a, b,c: Different characters in each row show a statistically significant difference (p <0.05)

Kruskal-Wallis test

One-way analysis of variance.

Descriptive statistics of proteins in the cardiovascular II panel by groups. a, b,c: Different characters in each row show a statistically significant difference (p <0.05) Kruskal-Wallis test One-way analysis of variance. According to Table 2, the difference between groups in terms of protein in the cardiovascular panel except for BMP-6, IL1RL2, PIgR, FGF-23, PRELP, TIE2, CTRC, CD84, REN, AMBP, IgG Fc receptor II-b, PRSS8, MARCO, GT, PD-L2, and LEP proteins is statistically significant (p<0.05). According to the effect size findings, the two proteins most prominently affecting the severity of COVID-19 and control groups in the cardiovascular II panel are DECR-1 (0.85) and IL-6 (0.79) proteins. Similarly, the descriptive statistics of 92 proteins in the immune response panel according to the COVID-19 positive and control group are given in Table 3 .
Table 3

Descriptive statistics of proteins in the immune response panel by groups.

Protein NamesGroups
Effect Sizep-value
ControlMildSevereCritical
Median (Min-Max)Median (Min-Max)Median (Min-Max)Median (Min-Max)
PPP1R9B6.7a(5.3-7.29)1.22b(0.74-3.72)1.83b(0.63-2.45)1.77b(0.73-3.62)0.63 (Large)<0.0001*
GLB13.8a(1.61-4.64)0.8b(0.28-1.49)0.77b(0.36-1.68)1.61c(0.45-2.53)0.71 (Large)<0.0001*
PSIP14.48a(2.72-6.71)2.16b(1.54-4.56)3.87a(3.16-5.43)4.43a(3.47-7.4)0.49 (Large)<0.0001*
ZBTB164.09a(2.53-8.12)0.4b(-0.07-2.25)1.2b,c(0.42-2.69)1.45c(0.54-3.28)0.71 (Large)<0.0001*
IRAK46.74a(5-7.89)1.56b(0.85-3.69)2.35b(1.29-3.51)2.04b(1.12-4.29)0.64 (Large)<0.0001*
HCLS18.19a(6.97-8.74)3.67b(2.46-6)5.37c(4.15-6.71)5.33c(4.08-7.51)0.69 (Large)<0.0001*
CNTNAP21.97a(1.39-3.32)1.46b(0.65-1.99)1.39b(0.69-1.76)1.09b(0.56-2.3)0.37 (Large)<0.0001*
CLEC4G3.34a(2.58-4.47)3.4a(2.85-4.53)3.88a,b(3.08-4.86)4.32b(3.37-6.35)0.36 (Large)<0.0001*
IRF93.11a(2.09-4.31)1.33b(0.85-1.87)1.74b,c(1.15-2.85)2.21c(1.23-5.42)0.58 (Large)<0.0001*
EDAR4.22a(2.3-5.71)1.72b(1.11-3.92)1.36b(0.9-1.7)1.47b(0.92-2.68)0.63 (Large)<0.0001*
IL62.21a(1.47-3.79)1.9a(1.16-4.87)5.02b(4.25-8.14)7.13c(2.91-10.47)0.64 (Large)<0.0001*
DGKZ1.44a(0.58-2.65)0.39b(-0.1-0.68)0.6b(0.29-0.89)0.57b(0.02-3.48)0.54 (Large)<0.0001*
CLEC4C3.88a,b(2.58-4.88)3.98a(3.3-5.44)3.51b,c(2.41-4.31)3.05c(2.07-4.27)0.25 (Large)<0.0001*
IRAK14.82a(3.32-6.07)1.6b(1.13-2.97)2.19b,c(1.68-2.81)2.39c(1.9-4.37)0.72 (Large)<0.0001*
CLEC4A4.19a(3.05-4.52)3.84b(2.85-4.24)2.98c(2.14-3.61)2.94c(2.31-4)0.60 (Large)<0.0001*
PRDX15.32a(3.86-6.62)1.24b(0.71-6.25)2.14b,c(1.28-4.48)2.81c(1.83-5.08)0.67 (Large)<0.0001*
PRDX32.4a(-0.07-4.38)0.63b(-0.98-0.43)0.26b,c(-0.65-0.91)0.25c(-0.32-1.47)0.70 (Large)<0.0001*
FGF23.64a(2.03-5.74)0.24b(-0.13-1.31)0.98b(0.15-1.83)0.44b(-0.07-1.94)0.64 (Large)<0.0001*
PRDX57.65a(7.06-8.44)3.27b(2.53-6.61)4.55c(3.79-6.31)5.19c(4.04-7.05)0.70 (Large)<0.0001*
TRIM53.83a(2.42-5.49)1.2b(0.92-2.47)1.66b,c(1.43-3.49)2.29c(1.57-4.61)0.67 (Large)<0.0001*
DCTN17.42a(5.65-8.63)2.81b(1.54-4.47)3.37b,c(2.46-3.9)3.6c(2.6-7.18)0.63 (Large)<0.0001*
ITGA65.1a(1.43-6.34)0.88b(0.03-3.98)0.56b(0.12-1.17)0.69b(-0.11-2.09)0.59 (Large)<0.0001*
CDSN3.16a(2.28-3.96)2.6a(1.47-6.23)2.29a(1.89-3.65)2.59a(1.54-4.55)0.13 (Large)0.002*
FXYD52.25a(0.77-4.13)0.02b(-0.51-0.85)0.32b,c(0.03-1.25)0.56c(-0.7-3.06)0.64 (Large)<0.0001*
TRAF23.19a(2.16-5.3)1.31b(0.86-4.42)1.81b(1.08-2.55)1.73b(1.09-3.7)0.54 (Large)<0.0001*
LILRB43.05a(2.09-4.62)3.42a(2.65-5.38)4.86b(4.19-6.22)5.47c(4.62-7.38)0.64 (Large)<0.0001*
NTF41.89a(1.31-2.81)1.91a(1.4-3.54)1.54a(1.33-1.95)1.58a(0.95-2.53)0.09 (Large)0.011*
KRT192.17a(1.17-4.07)1.84a(0.08-4.19)4.84b(2.69-7.31)5.57b(3.11-7.72)0.63 (Large)<0.0001*
HNMT9.12a(7.7-10.2)8.96a(6.91-10.66)9.55a,b(8.9-11.63)9.91b(9.13-14.96)0.31 (Large)<0.0001*
CCL117.23a(6.32-8.56)7.41a(5.32-8.33)6.69a(6.21-7.59)7.11a(6.23-8.04)0.06 (Large)0.036*
EGLN11.77a(1.03-2.59)1.16b(0.86-1.98)1.69a(0.76-2.61)1.88a(1.19-4.68)0.32 (Large)<0.0001*
NFATC30.94a(0.43-2.78)0.57b(0.12-2.1)0.81a,b(0.32-2.1)1.38a(0.53-2.45)0.26 (Large)<0.0001*
EIF5A0.25a(-0.17-1.95)0.03b(-0.53-0.45)0.01b,c(-0.3-0.39)0.24a,c(-0.63-1.4)0.20 (Large)<0.0001*
EIF4G17.51a(6.32-8.13)2.57b(1.51-5.26)4.07c(2.79-4.86)4.41c(3.12-6.49)0.72 (Large)<0.0001*
CD281.61a(1.22-4.38)1.65a(0.94-3.75)1.37a(0.98-1.77)1.57a(0.92-2.3)0.03 (Large)0.130*
PTH1R3.9a(3.28-5.38)4.09a(3.47-4.92)3.6a(3.45-4.4)3.85a(3.26-4.78)0.08 (Large)0.019*
BIRC22.13a(1.16-4.28)0.31b(-0.01-0.86)0.59b,c(0.3-1.01)0.71c(0.18-1.58)0.70 (Large)<0.0001*
HSD11B13.1a,b(2.55-3.8)3.17a(2.64-4.36)2.59b(2.12-3.74)2.89b,c(1.96-4.06)0.11 (Large)0.005*
NF24.01a(2.22-5.11)1.72b(-2.48-0.4)1.31b(-1.98–0.62)1.19b(-2.32-1.56)0.63 (Large)<0.0001*
SH2B36.72a(5.05-8.86)2.7b(1.53-6.8)3.23b(2.46-3.65)2.79b(1.67-5)0.59 (Large)<0.0001*
FCRL31.08a(0.48-1.91)0.96a(0.46-1.74)0.79a(0.63-2.11)0.92a(0.55-1.88)0.01 (Moderate)0.293*
CKAP44.74a(3.79-5.3)4.71a(3.65-5.47)5.82b(5.26-7.72)6.7c(5.16-9.94)0.64 (Large)<0.0001*
JUN0.55a(-0.4-5.39)0.23a(-0.76-2.76)0.2a(-0.57-1.3)0.59a(-0.39-3.84)0.03 (Large)0.121*
HEXIM18.48a(5.94-9.71)3.86b(2.64-5.69)5.07c(4.51-6.96)5.67c(4.68-7.8)0.78 (Large)<0.0001*
CLEC4D3.36a(2.49-4.54)3.49a(2.02-4.47)3.82a,b(3.48-5.27)4.42b(3.09-7.93)0.29 (Large)<0.0001*
PRKCQ1.86a(0.8-5.11)0.4b(-0.3-1.2)0.56b(-0.09-0.93)0.57b(0.04-4.82)0.57 (Large)<0.0001*
CXADR2.22a(1.57-3.45)2.18a(1.29-3.3)2.5a,b(1.6-3.67)2.77b(1.95-4.84)0.19 (Large)<0.0001*
IL103.51a(2.86-6.58)3.7a(2.84-7.15)5.1b(4.24-6.77)5.92b(4.35-7.35)0.53 (Large)<0.0001*
SRPK25.57a(2.85-7.63)0.19b(-0.15-1.87)1.01b,c(0.41-2.37)1.55c(0.74-4.52)0.76 (Large)<0.0001*
KLRD16.44a(5.61-8.36)6.62a,b(5.71-7.56)7.15a,b(5.98-7.59)7.15b(5.83-8.87)0.13 (Large)0.002*
BACH12.79a(1.46-5.36)0.73b(0.22-3.18)1.49b,c(1.17-2.98)2.12c(1.33-3.79)0.61 (Large)<0.0001*
PIK3AP14.88a(3.49-7.4)2.51b(1.32-3.67)3.05b,c(2.28-4.72)3.18c(1.94-5.86)0.54 (Large)<0.0001*
SPRY27.1a(4.38-9.22)1.9b(0.76-7.81)2.4b(1.16-2.71)2.2b(1.18-5.02)0.56 (Large)<0.0001*
STC15.55a(4.39-6.49)7.01b(5.55-7.72)7.29b,c(6.65-7.54)7.52c(6.43-7.72)0.66 (Large)<0.0001*
ARNT0.84a,b(0.21-2.32)0.36a(-0.3-1.63)0.63a,b(0.39-3.91)0.83b(-0.07-5.86)0.17 (Large)<0.0001*
FAM3B4.88a(3.96-5.64)4.87a(3.16-5.58)4.31b(3.21-4.87)4.45a,b(3.34-6.53)0.13 (Large)0.002*
DFFA7.31a(5.91-8.94)3.8b(2.91-5.09)4.61c(4.44-5.93)5.24c(4.26-8.38)0.73 (Large)<0.0001*
DAPP19.03a(7.08-9.75)2.33b(1.29-5.55)2.87b(0.8-3.57)2.4b(1.45-5.48)0.60 (Large)<0.0001*
PADI21.41a,c(0.74-2.77)0.59b(0.25-3.05)0.79a,b(0.52-1.59)1.36c(0.51-4.41)0.28 (Large)<0.0001*
CLEC7A3.33a(-0.07-4.19)3.19a(2.1-3.66)3.36a(2.88-4.28)3.35a(-0.16-6.42)0.01 (Moderate)0.265*
IL12RB12a(1.57-5.67)2.25a(1.73-2.98)2.65a,b(2.34-3.33)3.05b(2.1-5.01)0.46 (Large)<0.0001*
TANK3.71a(1.94-5.94)1.06b(0.31-2.2)1.5b(0.81-2.56)1.44b(0.09-4.18)0.57 (Large)<0.0001*
KPNA11.18a(0.05-2.16)0.83b(-1.59-0.16)0.57b,c(-1.19-1.41)0.45c(-1.17-3.03)0.57 (Large)<0.0001*
LAG32.39a(1.79-4.03)2.77b(2.2-4.4)3.16b,c(2.68-4.04)3.28c(2.32-4.64)0.34 (Large)<0.0001*
IL50.63a(0.18-3.5)0.55a(0.11-6.91)0.9a(0.2-2.99)0.66a(0.07-4.38)0.01 (Moderate)0.259*
CD832.91a(2.41-3.61)2.91a(2.17-3.51)2.69a(2.36-3.36)2.91a(2.18-4.72)<0.0001 (Small)0.397*
ITGB63.07a,b(2.21-3.72)2.99a,b(1.85-4.06)2.85a(2.2-3.02)a3.24b(2.32-5.26)b0.08 (Large)0.018*
Mean ± SDMean ± SDMean ± SDMean ± SD
TPSAB14.81a ± 0.64.51a ± 0.534.70a ± 0.594.46a ± 0.620.06 (Large)0.121⁎⁎
DPP101.66a ± 0.431.32b ± 0.351.02b ± 0.331.44a,b ± 0.530.18 (Large)0.002⁎⁎
GALNT32.63a ± 0.462.99a ± 0.583.00a ± 0.453.59b ± 0.560.35 (Large)<0.0001⁎⁎
TRIM215.93a ± 0.632.52b ± 0.793.40c ± 0.984.19c ± 0.930.75 (Large)<0.0001⁎⁎
ITM2A2.36a ± 0.422.28a ± 0.951.84a ± 0.832.59a ± 1.290.05 (Large)0.202⁎⁎
MILR12.82a ± 0.463.16a ± 0.433.28a,b ± 0.613.68b ± 0.520.33 (Large)<0.0001⁎⁎
LY753.11a ± 0.352.86a,c ± 0.452.42b ± 0.352.66b,c ± 0.360.27 (Large)<0.0001⁎⁎
PLXNA45.69a ± 0.913.48b ± 0.824.14b ± 0.793.62b ± 0.98b0.56 (Large)<0.0001⁎⁎
MGMT7.33a ± 0.593.34b ± 1.063.96b,c ± 0.984.33c ± 0.970.79 (Large)<0.0001⁎⁎
TREM12.36a ± 0.572.33a ± 0.513.11b ± 0.493.20b ± 0.670.34 (Large)<0.0001⁎⁎
SH2D1A3.23a ± 1.051.66b ± 0.511.99b ± 0.52.15b ± 0.620.43 (Large)<0.0001⁎⁎
ICA13.74a ± 0.910.78b ± 0.311.20b,c ± 0.381.27c ± 0.490.82 (Large)<0.0001⁎⁎
DCBLD27.84a ± 0.367.96a ± 0.357.94a ± 0.367.89a ± 0.440.02 (Large)0.580⁎⁎
FCRL63.07a ± 0.553.62b ± 0.652.97a,b ± 0.563.46a,b ± 0.770.14 (Large)0.014⁎⁎
NCR13.48a,b ± 0.413.27a ± 0.363.61a,b ± 0.473.83b ± 0.780.14 (Large)0.004⁎⁎
CXCL121.55a ± 0.251.35b ± 0.181.65a ± 0.271.72a ± 0.240.29 (Large)<0.0001⁎⁎
AREG3.39 ± 0.49a3.21 ± 0.63a4.99 ± 0.77b5.78 ± 1.03c0.71 (Large)<0.0001⁎⁎
IFNLR12.43a ± 0.372.58a ± 0.283.02b ± 0.393.12b ± 0.440.40 (Large)<0.0001⁎⁎
SIT11.79a ± 0.332.22a ± 0.653.24b ± 0.992.93b ± 1.070.34 (Large)<0.0001⁎⁎
MASP12.27a ± 0.331.96b ± 0.331.28c ± 0.21.42c ± 0.280.62 (Large)<0.0001⁎⁎
LAMP34.24a ± 0.614.68a ± 0.915.53b ± 0.666.07b ± 0.470.56 (Large)<0.0001⁎⁎
CLEC6A1.94a ± 0.562.26a ± 0.513.01b ± 0.613.26b ± 0.650.49 (Large)<0.0001⁎⁎
DDX584.34a ± 0.752.81b ± 0.824.09a ± 0.984.94a ± 1.30.43 (Large)<0.0001⁎⁎
ITGA113.21a ± 0.362.92a ± 0.422.12b ± 0.342.18b ± 0.60.52 (Large)<0.0001⁎⁎
BTN3A23.21a,b ± 0.532.83a ± 0.393.39b ± 0.594.24c ± 0.620.53 (Large)<0.0001⁎⁎

a, b,c: Different characters in each row show a statistically significant difference (p <0.05)

Kruskal-Wallis test

One-way analysis of variance.

Descriptive statistics of proteins in the immune response panel by groups. a, b,c: Different characters in each row show a statistically significant difference (p <0.05) Kruskal-Wallis test One-way analysis of variance. Considering Table 3, except for CD28, FCRL3, JUN, CLEC7A, IL5, CD83, TPSAB1, ITM2A, DCBLD2 proteins, the difference in protein between groups in the immune response panel and IL12 was statistically significant ( p<0.05). Considering the effect size findings, the two proteins that most remarkably affect the COVID-19 severity and control groups in the immune response panel are ICA1 (0.82) and MGMT (0.79) proteins. Descriptive statistics of 92 proteins in the inflammation panel according to the COVID-19 positive and control group are given in Table 4 .
Table 4

Descriptive statistics of proteins in the inflammation panel by groups.

Protein NamesGroups
Effect Sizep-value
ControlMildSevereCritical
Median (Min-Max)Median (Min-Max)Median (Min-Max)Median (Min-Max)
IL85.46a(4.6-7.13)4.34b(3.29-6.57)6.11a(5.12-6.9)7.36c(5.96-9.01)0.64 (Large)<0.0001*
CD8A9.28a(7.84-10.83)9.9b(8.75-11.66)9.58a,b(8.27-11.22)9.91b(8.68-11.79)0.09 (Large)0.011
MCP-31.77a(1.03-4.31)1.66a(0.85-4.33)4.65b(2.25-6.67)5.87c(3.83-8.06)0.64 (Large)<0.0001*
GDNF2a(1.09-2.7)2.03a,b(1.29-3.29)1.93a,b(1.61-3.23)2.29b(1.74-3.29)0.09 (Large)0.013
CD2445.95a(5.4-7.28)5.96a(5.08-6.63)5.55b(4.96-6.11)5.54b(5.06-6.26)0.23 (Large)<0.0001*
IL73.77a(2.39-5.61)1.83b(1.25-3.46)2.62b,c(1.46-3.35)2.8c(1.34-4.35)0.52 (Large)<0.0001*
uPA9.58a(9.12-10.47)9.74a(8.53-10.61)9.69a(9.34-10.45)10.12b(9.57-11.17)0.20 (Large)<0.0001*
IL62.5a(1.42-4.27)1.96a(1.08-5.28)5.42b(4.58-8.32)7.42c(3.21-10.62)0.64 (Large)<0.0001*
IL-17C2.12a(1.22-4.96)2.18a(1.16-4.03)2.53a,b(2.15-5.09)3.39b(2.02-5.6)0.34 (Large)<0.0001*
IL-17A1.52a,b(0.75-2.93)1.38a(0.68-2.87)1.8b5(1.33-2.92)1.72a,b(0.91-2.56)0.08 (Large)0.015
AXIN17.42a(5.71-10.85)2.39b(1.15-4.52)3.25b(1.86-3.55)2.75b(1.93-4.95)0.63 (Large)<0.0001*
TRAIL7.54a(6.88-8.1)7.87a(6.99-8.82)7.18a,b(7.01-7.95)6.95b(6.03-8.27)0.30 (Large)<0.0001*
IL-20RA0.87a(0.51-1.35)0.79a(0.25-3.01)1.02a(0.5-1.82)0.97a(0.54-2.47)0.07 (Large)0.031
CXCL97.21a(5.64-10.71)7.02a(5.82-8.45)8.17b(7.96-10.67)8.9b(7.47-11.01)0.37 (Large)<0.0001*
CST56.64a(5.83-8.33)6.33a(5.63-7.68)6.34a(5.54-7.04)6.08a(5.51-8.58)0.03 (Large)0.116
IL-2RB0.71a(0.1-1.43)0.57b(-0.09-1.02)0.54a,b(0.25-0.87)0.71a(0.17-1.74)0.10 (Large)0.007
IL-1 alpha0.02a(-1.34-1.75)-0.74b(-1.21-3.14)-0.58a,b(-1.12–0.18)-0.84b(-1.14-0.2)0.23 (Large)<0.0001*
IL20.7a(0.44-1.02)0.55a(0.24-1.08)0.94b(0.58-1.37)0.6a(0.22-1.08)0.17 (Large)<0.0001*
TSLP1.11a(0.16-1.83)1a(0.47-2.66)0.95a(0.6-6.43)1.15a(0.19-4.12)0.28 (Small)0.931
CCL46.09a(5.28-8.11)5.59b(4.09-6.85)6.44a(5.46-8.1)6.41a(5.01-8.26)0.19 (Large)<0.0001*
IL189.34a,b(8.24-10.83)8.89a(7.75-10.45)9.88b,c(9.27-11.26)10.06c(9.17-14.12)0.41 (Large)<0.0001*
TGF-alpha2.54a(2.09-3.29)2.96a(1.59-3.55)3.39b(2.91-4.94)4.19c(2.62-6.04)0.59 (Large)<0.0001*
CCL117.59a(6.63-9.25)7.79a(5.7-8.72)7.17a(6.57-7.96)7.48a(6.68-8.57)0.06 (Large)0.046
TNFSF146.15a(4.57-7.45)4.2b(2.91-5.41)6.08a(5.24-7.06)6.42a(4.37-7.22)0.52 (Large)<0.0001*
FGF-232.18a(1.48-3.34)1.79a(0.83-4.18)2.05a(1.35-2.46)2.17a(1.08-8.45)0.01 (Moderate)0.285
IL-10RA0.89a(0.55-2.49)0.82a(0.58-3.03)0.96a(0.7-2.44)1.01a(0.46-2.7)0.04 (Large)0.073
FGF-51.03a(0.8-1.64)1.03a(0.6-1.39)1.16a(0.86-1.45)1.04a(0.56-1.6)0.008 (Small)0.517
LIF-R4.09a(3.55-4.55)4.24a(3.05-4.94)4.33a,b(3.93-5.29)4.66b(4.13-5.1)0.35 (Large)<0.0001*
CCL199.39(8.05-11.76)a9.41(8.16-11.8)a9.98(9.31-11.93)a11.15(9.87-12.56)b0.46 (Large)<0.0001*
IL-15RA1.24a(0.89-1.87)1.36a(0.58-1.75)1.6a,b7(1.14-2.25)1.63b(1.02-3.15)0.14 (Large)0.002
IL-22 RA11.66a(0.91-3.67)1.49a(0.65-2.43)1.73a(0.84-2.72)1.55a(0.61-2.44)0.01 (Moderate)0.284
Beta-NGF0.02a(-0.75-0.64)-0.12b(-0.41–0.01)-0.11a,b(-0.3-0.17)-0.08a,b(-0.27-0.12)0.25 (Large)<0.0001*
CXCL513.2a9(11.96-13.91)10.16b(7.59-12.72)11.08b(7.34-12.58)10.31b(6.78-12.77)0.58 (Large)<0.0001*
TRANCE4.55a(3.71-5.59)4.82a(3.2-6.49)2.99b(2.5-4.33)2.66b(1.46-5.15)0.54 (Large)<0.0001*
HGF8.57a(7.71-10.01)8.11a(6.28-8.94)9.42b(8.85-10.62)10.19c(8.73-12.55)0.61 (Large)<0.0001*
IL-240.92a(-0.07-2.11)0.74a(-0.1-2.63)1.61b(0.98-2.55)1.95b(0.46-4.28)0.31 (Large)<0.0001*
IL130.71a(0.23-3.82)0.62a(0.03-1.73)0.52a(0.2-0.83)0.7a(0.24-4.28)0.03 (Large)0.116
ARTN0.88a(0.23-1.44)0.84a,b(0.46-2.24)0.94a,b(0.32-1.21)1.07b(0.59-2.95)0.04 (Large)0.106
IL103.29a(2.55-6.25)3.55a(2.74-7.07)4.93b(4.01-6.37)5.66b(4.09-7.13)0.53 (Large)<0.0001*
TNF2.74a(1.71-4.27)2.82a(2-3.8)3.58b(3.25-4.62)4.05b(3.09-5.33)0.51 (Large)<0.0001*
CCL2310.33a(9.05-11.25)10.37a(9.43-10.99)11.47b(9.19-12.09)11.76b(9.92-12.68)0.45 (Large)<0.0001*
CD55.4a(4.28-6.35)5.38a(4.46-6.37)5.28a(4.41-6.02)5.09a(4.58-6.22)0.005 (Small)0.469
CXCL610.45a(8.98-11.75)7.45b(4.32-9.61)8.63b,c(7.1-9.48)8.38c(7.37-10.59)0.60 (Large)<0.0001*
CXCL109.85a(8.61-12.55)9.64a(8.48-13.4)12.52b(11.58-13.64)13.49b(10.41-13.75)0.56 (Large)<0.0001*
IL-200.62a(0.43-1.11)0.46b(-0.03-0.61)0.71a(0.45-1.02)0.55a(0.38-0.78)0.31 (Large)<0.0001*
SIRT28.66a(7.03-9.76)3b(2.27-5.58)4.07c(3.18-5.74)4.48c(3.51-8.07)0.70 (Large)<0.0001*
DNER8.74a(8.23-9.38)8.68a(7.51-9.04)8.14b(7.72-8.81)8.09b(7.54-8.57)0.43 (Large)<0.0001*
EN-RAGE2.3a(1.56-4.9)2.3a(1.35-3.89)4.68b(3.13-5.89)5.11b(3.37-7.51)0.61 (Large)<0.0001*
CD4011.81a(10.92-13.1)10.95b(9.84-11.51)11.58a(10.98-12.45)11.9a(11.14-14.07)0.48 (Large)<0.0001*
IL330.77a(0.49-1.16)0.62a(0.27-1.26)1.12b(0.69-1.56)1.13b(0.64-2.06)0.36 (Large)<0.0001*
IFN-gamma6.89a(5.29-11.09)6.82a(5.42-11.77)8.88b(7.7-11.8)10.45b(6.12-13.42)0.43 (Large)<0.0001*
IL4-0.1a(-1.21-2.78)-0.41b(-1.56-0.44)-0.15a,b(-0.8-0.24)-0.09a,b(-0.83-1.13)0.06 (Large)0.041
LIF-0.02a(-0.39-1.8)-0.12a(-0.42-0.28)0.26a,b(-0.09-0.64)0.51b(-0.13-2.75)0.42 (Large)<0.0001*
NRTN0.55a(0.17-1.06)0.37b(0.16-0.7)0.58a,b(0.37-0.87)0.57a(0.2-1.33)0.12 (Large)0.003
MCP-29.65a(7.47-11.47)9.21a(8.22-11.96)9.22a,b(8.44-12.73)10.84b(8.92-12.19)0.15 (Large)0.001
CASP-86.7a(5.69-7.7)2.41b(1.62-4.24)3.36b,c(2.82-4.8)3.86c(2.89-7.28)0.70 (Large)<0.0001*
CCL206.96a(6.39-12.02)7.47a(5.44-9.36)7.9a(7.1-10.35)9.82b(7.02-12.25)0.33 (Large)<0.0001*
ST1A16.47a(5.55-7.18)1.21b(0.6-3.48)1.88b(0.76-2.65)1.58b(0.98-4.49)0.64 (Large)<0.0001*
STAMBP8.21a(6.87-9.92)4.19b(3.29-5.83)4.82c(4.51-5.86)4.93c(3.77-7.12)0.66 (Large)<0.0001*
IL50.77a(0.14-3.82)0.48a(0.16-7.15)0.98a(0.35-3.32)0.78a(0.22-3.9)0.09 (Large)0.011
ADA6.08a(5.31-7.43)5.33b(4.59-6.36)5.84a,b(5.66-6.66)6.11a(5.09-7.5)0.21 (Large)<0.0001*
Mean ± SDMean ± SDMean ± SDMean ± SD
VEGFA11.41a ± 0.5910.77b ± 0.5311.86a,c ± 0.2712.11c ± 0.550.50 (Large)<0.0001⁎⁎
CDCP12.74a ± 0.692.69a ± 0.653.63b ± 0.354.31b ± 0.910.50 (Large)<0.0001⁎⁎
OPG10.14a,b ± 0.5310.00a ± 0.3810.54b ± 0.4111.08c ± 0.570.46 (Large)<0.0001⁎⁎
LAP TGF-beta-17.59a ± 0.746.03b ± 0.56.67c ± 0.626.65c ± 0.480.53 (Large)<0.0001⁎⁎
MCP-111.28a ± 0.4611.34a ± 0.7112.16b ± 0.6113.30c ± 0.920.62 (Large)<0.0001⁎⁎
CXCL119.19a ± 0.988.13b ± 0.9810.38c ± 0.6710.53c ± 0.630.57 (Large)<0.0001⁎⁎
OSM4.17a ± 0.774.06a ± 0.886.30b ± 1.056.38b ± 1.050.60 (Large)<0.0001⁎⁎
CXCL110.92a ± 0.578.70b ± 0.989.77c ± 0.919.89c ± 0.830.55 (Large)<0.0001⁎⁎
CD66.07a,b ± 0.646.35a ± 0.725.85a,b ± 0.735.69b ± 0.570.14 (Large)0.005⁎⁎
SCF9.34a ± 0.359.26a ± 0.377.80b ± 0.928.05b ± 0.940.51 (Large)<0.0001⁎⁎
SLAMF12.37a ± 0.452.40a ± 0.313.06b ± 0.553.00b ± 0.590.31 (Large)<0.0001⁎⁎
MCP-415.62a ± 0.7214.05b ± 0.7614.28b ± 0.6914.50b ± 0.80.44 (Large)<0.0001⁎⁎
MMP-115.08a ± 0.8813.92b ± 0.9915.33a ± 0.9714.79a ± 0.870.25 (Large)<0.0001⁎⁎
FGF-215.30a ± 1.025.49a ± 1.245.55a ± 1.856.49a ± 2.440.08 (Large)0.325⁎⁎
IL-10RB6.22a ± 0.316.07a ± 0.46.29a ± 0.316.28a ± 0.450.05 (Large)0.357⁎⁎
IL-18R18.38a ± 0.418.39a ± 0.619.27b ± 0.539.75b ± 0.380.63 (Large)<0.0001⁎⁎
PD-L15.51a ± 0.55.48a ± 0.586.51b ± 0.386.93b ± 0.540.61 (Large)<0.0001⁎⁎
IL-12B6.50a ± 0.756.56a ± 0.646.74a ± 0.516.81a ± 0.720.04 (Large)0.405⁎⁎
MMP-109.20a,b ± 0.689.03a ± 0.349.04a,b ± 0.659.61b ± 0.60.15 (Large)0.006⁎⁎
CCL35.81a ± 0.645.38a ± 0.566.71b ± 0.727.24b ± 0.830.56 (Large)<0.0001⁎⁎
Flt3L9.31a ± 0.459.10a,b ± 0.498.77a,b ± 0.558.89b ± 0.680.11 (Large)0.030⁎⁎
4E-BP110.45a ± 0.887.91b ± 0.969.25c ± 0.989.07c ± 0.780.57 (Large)<0.0001⁎⁎
CCL282.65a ± 0.652.36a,b ± 0.532.38a,b ± 0.432.21b ± 0.460.10 (Large)0.093⁎⁎
FGF-198.79a ± 0.88.63a ± 0.917.55b ± 1.138.42a,b ± 0.990.14 (Large)0.023⁎⁎
CCL256.22a ± 0.626.26a ± 0.626.10a ± 0.676.43a ± 0.430.03 (Large)0.451⁎⁎
CX3CL14.10a ± 0.44.10a ± 0.554.46a,b ± 0.724.91b ± 0.550.32 (Large)<0.0001⁎⁎
TNFRSF96.77a ± 0.556.64a ± 0.396.50a ± 0.476.65a ± 0.880.02 (Large)0.806⁎⁎
NT-32.61a ± 0.452.04b ± 0.51.92b ± 0.51.77b ± 0.510.34 (Large)<0.0001⁎⁎
TWEAK9.10a ± 0.378.77b ± 0.448.24c ± 0.248.08c ± 0.510.51 (Large)<0.0001⁎⁎
TNFB4.57a,b ± 0.554.89a ± 0.374.37b ± 0.324.35b,c ± 0.490.19 (Large)0.001⁎⁎
CSF-110.13a ± 0.2110.14a ± 0.3710.67b ± 0.1810.78b ± 0.140.59 (Large)<0.0001⁎⁎

a, b,c: Different characters in each row show a statistically significant difference (p <0.05)

Kruskal-Wallis test

One-way analysis of variance.

Descriptive statistics of proteins in the inflammation panel by groups. a, b,c: Different characters in each row show a statistically significant difference (p <0.05) Kruskal-Wallis test One-way analysis of variance. When Table 4 is examined, except for CST5, TSLP, FGF-23, IL-10RA, FGF-5, IL-22 RA1, IL13, ARTN, CD5, FGF-21, IL-10RB, IL-12B, CCL28, CCL25, and TNFRSF9 proteins, the difference in protein between the groups in the inflammation panel, and IL12 is statistically significant (p <0.05). When the effect size values are examined, the two proteins that most markedly affect the COVID-19 severity and control groups in the inflammation panel are SIRT2 (0.70) and CASP-8 (0.70) proteins. Finally, descriptive statistics of 92 proteins in the neurology panel according to the COVID-19 positive and control group are given in Table 5 .
Table 5

Descriptive statistics of proteins in the neurology panel by groups.

Protein NamesGroups
Effect Sizep-value
ControlMildSevereCritical
Median (Min-Max)Median (Min-Max)Median (Min-Max)Median (Min-Max)
NMNAT13.01a(1.85-7.05)2.87a(1.7-5.38)3.81a,b(2.05-4.72)4.5b(3.86-8.47)0.34 (Large)<0.0001*
NRP28.23a,b(7.94-8.44)8.21a(7.93-8.36)8.26a,b(7.91-8.33)8.33b(7.93-8.54)0.09 (Large)0.012*
MAPT0.09a,b(-0.49-0.88)0.19a(-0.6-0.73)0.29b(-0.25-0.77)0.09b,c(-0.44-1.33)0.16 (Large)0.001*
CADM33.66a(2.57-4.31)3.26a(1.72-3.89)2.62a(2.45-4.36)2.92a(2.07-5.46)0.12 (Large)0.004*
GDNF1.92a(1.09-2.55)1.88a,b(1.38-3.24)2.09a,b(1.4-3.03)2.17b(1.6-3.23)0.07 (Large)0.022*
UNC5C4.62a(3.91-5.3)4.39a(2.9-4.95)4.44a(3.78-5.15)4.61a(3.91-5.59)0.02 (Large)0.193*
VWC25.55a(4.57-7.29)5.41a(4.41-6.5)5.88a(5.13-6.47)5.46a(4.17-6.6)0.02 (Large)0.193*
Siglec-94.92a(4.36-5.48)4.86a(4.37-5.42)5.06a,b(4.45-5.4)5.29b(4.28-5.94)0.16 (Large)0.001*
CLM-65.86a(5.34-6.61)5.87a(5.23-6.57)6.1a,b(5.8-6.82)6.42b(5.9-7.63)0.31 (Large)<0.0001*
NBL15.07a(4.72-5.3)4.88b(4.44-5.03)4.93a,b(4.67-5.14)4.92b(4.72-5.17)0.22 (Large)<0.0001*
EFNA42.9a(2.43-3.61)2.8b(1.99-3.3)3.04a,b(2.78-3.83)3.38b(2.61-5.99)0.29 (Large)<0.0001*
SCARB24.5a(3.94-5.32)4.63a,b(3.77-6.02)5.14b,c(4.53-7.29)5.72c(4.55-7.97)0.43 (Large)<0.0001*
ROBO25.58a(5.1-6.53)5.7a(3.99-6.18)4.58b(4.16-4.95)4.6b(3.88-5.55)0.54 (Large)<0.0001*
CRTAM5.04a(4.32-7.05)5.47a(4.48-6.52)5.23a(4.67-6.57)5.5a(4.01-6.43)0.03 (Large)0.131*
RGMA10.96a(10.34-11.45)11a(9.68-11.39)10.4b(9.71-10.67)10.14b(9.28-11.16)0.40 (Large)<0.0001*
MSR16.55a(4.91-7.61)6.62a(5.31-7.54)6.72a,b(6.35-7.6)7.08b(6.14-7.75)0.18 (Large)<0.0001*
Alpha-2-MRAP9.75a(8.34-10.68)7.9b(6.8-8.4)8.18b,c(7.95-8.95)8.74c(8.07-10.48)0.63 (Large)<0.0001*
sFRP-35.51a,b(4.74-6.01)5.33a(4.24-5.8)5.41a,b(4.9-6.19)5.93b(3.93-6.33)0.16 (Large)0.001*
EPHB64.04a(3.35-4.69)3.85a(2.43-4.43)3.57a(3.31-4.4)3.81a(3.17-4.93)0.06 (Large)0.041*
CNTN54.9a(4.23-5.8)4.74a(3.6-5.67)3.88b(2.78-4.62)3.44b(2.98-4.95)0.52 (Large)<0.0001*
MATN38.94a(7.91-10.34)8.84a(7.56-9.82)9.66b(8.95-13.73)10.58b(9.67-12.18)0.59 (Large)<0.0001*
RSPO12.79a(2.34-4.02)2.51b(1.49-3.24)2.98a,b,c(2.31-4.76)3.31c(2.42-5.11)0.30 (Large)<0.0001*
GAL-88.6a(6.87-9.84)5.49b(4.7-6.53)6.38c(5.65-7.04)6.6c(6.12-7.86)0.76 (Large)<0.0001*
LAYN5.45a(4.53-6.3)5.22a(4.17-5.94)5.32a,b(4.49-6.48)5.65b(5.09-8.93)0.09 (Large)0.010*
NEP2.17a(1.5-4.21)2.31a(1.31-3.92)3.26a(1.39-5.65)3.1a(1.27-5.3)0.04 (Large)0.097*
THY 110.06a(9.49-10.53)9.86b(8.63-10.29)9.65b(8.96-10.25)9.86a,b(9.09-10.88)0.16 (Large)0.001*
TMPRSS52.79a(1.93-3.6)2.7a(2.1-3.57)2.2b(1.32-2.52)2.19b(1.6-3.02)0.35 (Large)<0.0001*
GM-CSF-R-alpha5.66a(3.3-6.84)5.84a(3.96-6.71)4.96a(4.52-6.05)5.91a(4.12-6.32)0.01 (Moderate)0.309*
Beta-NGF1.2a(0.86-2.64)1.17a(0.8-1.86)1.41a,b(1.12-2.36)1.69b(1.27-2.66)0.44 (Large)<0.0001*
CD2006.41a(5.67-7.28)6.5a(5.19-6.84)5.8b(4.89-6.64)6.03a,b(5.21-7.05)0.15 (Large)0.001*
G-CSF3.01a(2.14-4.17)2.71a(1.86-4.74)2.62a(1.81-4.94)3.74b(1.52-6.1)0.26 (Large)<0.0001*
DRAXIN3.16a(2.5-4.67)3.24a(1.84-4.45)3.09a(2.63-4.44)4.02b(2.73-6.97)0.10 (Large)0.008*
PVR8.14a(7.31-8.61)8.17a(6.6-9.09)8.77b(8.1-9.26)9.07b(8.56-9.47)0.56 (Large)<0.0001*
TNFRSF12A4.97a(4.11-5.91)4.82a(3.12-6.03)4.71a(4.51-6.2)5.67b(4.34-7.51)0.25 (Large)<0.0001*
SKR36.9a(6.37-7.6)6.79a(5.96-7.44)6.98a,b(6.49-8.16)7.24b(6.48-9.92)0.14 (Large)0.001*
FLRT22.35a(1.76-2.94)2.34a,b(1.44-2.68)2.08b(1.86-2.18)2.32a,b(1.65-3.03)0.10 (Large)0.007*
MDGA15.04a(4.23-6.04)3.39b(1.98-5.03)3.82b(2.3-4.37)4.3c(3.37-5.87)0.19 (Large)<0.0001*
CDH64.78a(3.93-5.23)4.61b(3.48-5.02)3.73c(3.09-4.51)3.89c(2.57-5.11)0.44 (Large)<0.0001*
DDR17.36a(6.93-7.81)7.28a(6.3-7.65)6.9b(6.17-7.35)7.26a(6.61-7.9)0.12 (Large)0.003*
JAM-B8.06a(7.53-8.75)7.73a,b(6.79-8.38)7.27b(6.97-8.07)7.51b,c(6.68-9.81)0.20 (Large)<0.0001*
NAAA3.58a(2.66-4.46)3.25a(2.79-4.02)3.05a(1.89-4.87)3.07a(1.92-5.3)0.10 (Large)0.006*
N2DL-23.06a(2.31-4.19)2.89a(1.98-3.38)3.51a(2.86-4.23)4.17b(2.94-7.95)0.42 (Large)<0.0001*
PLXNB11.99a(1.43-2.65)1.6a(1.23-4.49)2.01a,b(1.78-2.74)2.62(1.79-3.65)0.38 (Large)<0.0001*
Dkk-43.7a(2.57-5.05)3.35a(2.24-4.32)3.42a(2.98-4.55)3.39a(2.72-5.88)0.01 (Moderate)0.304*
EDA2R4.7a,b(0.79-6.12)4.23a(2.84-4.79)4.7a,b(3.88-5.74)5.13b(3.55-7.91)0.15 (Large)0.001*
LAT10.04a(8.89-10.56)5.03b(3.31-7.04)6.4c(5.33-7.52)6.47c(4.11-8.3)0.71 (Large)<0.0001*
NTRK37.54a(6.81-8.27)7.36a(6.22-7.72)6.93b(6.38-7.08)6.66b(5.18-7.55)0.49 (Large)<0.0001*
LAIR-25.05a(0.66-8.4)4.77a(3.28-7.55)5.1a(4.27-8.78)5.37a(4.38-8.87)0.01 (Moderate)0.320*
Nr-CAM9.66a(9.26-10.05)9.57b(8.75-9.83)9.21b(8.96-9.52)9.42b(9.13-9.8)0.23 (Large)<0.0001*
Mean ± SDMean ± SDMean ± SDMean ± SD
EZR3.92a ± 0.33.75a ± 0.374.45b ± 0.354.84c ± 0.370.64 (Large)<0.0001⁎⁎
SMOC27.96a ± 0.437.69a,b ± 0.537.40b ± 0.467.56a,b ± 0.660.12 (Large)0.006⁎⁎
NCAN8.51a ± 0.338.60a ± 0.318.28a ± 0.458.48a ± 0.410.06 (Large)0.147⁎⁎
PRTG6.63a ± 0.246.55a ± 0.356.20b ± 0.166.31b ± 0.310.23 (Large)<0.0001⁎⁎
PLXNB35.49a ± 0.473.91b ± 0.523.81b ± 0.193.99b ± 0.420.73 (Large)<0.0001⁎⁎
CPA210.18a ± 0.569.97a ± 0.839.05b ± 0.619.03b ± 0.930.32 (Large)<0.0001⁎⁎
CD385.57a ± 0.565.76a,b ± 0.416.19b,c ± 0.446.66c ± 0.540.46 (Large)<0.0001⁎⁎
SMPD14.39a ± 0.434.58a ± 0.395.04b ± 0.545.27b ± 0.350.46 (Large)<0.0001⁎⁎
RGMB6.02a ± 0.335.84a,c ± 0.45.41b ± 0.425.65b,c ± 0.430.21 (Large)<0.0001⁎⁎
SIGLEC15.81a ± 0.516.25a,b ± 0.846.89b,c ± 0.767.34c ± 0.410.51 (Large)<0.0001⁎⁎
ADAM 224.48a ± 0.414.27a ± 0.433.67b ± 0.373.85b ± 0.620.29 (Large)<0.0001⁎⁎
CLEC1B12.34a ± 0.559.52b ± 0.7810.92c ± 0.811.19c ± 0.90.69 (Large)<0.0001⁎⁎
ADAM 234.26a ± 0.453.97a,b ± 0.73.59b ± 0.533.54b,c ± 0.530.23 (Large)<0.0001⁎⁎
HAGH7.84a ± 0.615.48b ± 1.046.56c ± 1.316.11b,c ± 1.240.49 (Large)<0.0001⁎⁎
LXN2.79a ± 0.51.63b ± 0.181.84b ± 0.41.75b ± 0.340.66 (Large)<0.0001⁎⁎
BCAN4.50a ± 0.424.22a ± 0.403.69b ± 0.343.71b ± 0.590.36 (Large)<0.0001⁎⁎
GDF-83.66a ± 0.563.64a ± 0.792.41b ± 0.342.53b ± 1.070.35 (Large)<0.0001⁎⁎
WFIKKN13.52a ± 0.363.69a ± 0.473.07b ± 0.183.10b ± 0.520.27 (Large)<0.0001⁎⁎
CDH37.19a ± 0.277.29a ± 0.377.06a ± 0.437.36a ± 0.570.05 (Large)0.217⁎⁎
GFR-alpha-16.79a ± 0.366.95a ± 0.467.78b ± 0.548.18b ± 0.590.62 (Large)<0.0001⁎⁎
SCARA58.72a ± 0.38.52a,b ± 0.388.20b ± 0.348.35b,c ± 0.380.21 (Large)<0.0001⁎⁎
NTRK26.64a ± 0.226.32b ± 0.335.76c ± 0.275.66c ± 0.440.62 (Large)<0.0001⁎⁎
GZMA5.85a ± 0.446.28b ± 0.466.51b,c ± 0.466.71c ± 0.470.37 (Large)<0.0001⁎⁎
SCARF26.39a ± 0.496.04b ± 0.435.63b ± 0.565.74b ± 0.480.28 (Large)<0.0001⁎⁎
GDNFR-alpha-35.14a ± 0.354.96a ± 0.434.10b ± 0.314.29b ± 0.510.49 (Large)<0.0001⁎⁎
CPM6.80a ± 0.196.66a ± 0.216.14b ± 0.326.15b ± 0.460.48 (Large)<0.0001⁎⁎
CLEC10A5.20a ± 0.365.14a,b ± 0.74.58b,c ± 0.54.47c ± 0.560.28 (Large)<0.0001⁎⁎
GCP55.26a ± 0.734.74b ± 0.634.12b ± 0.754.23b ± 0.570.32 (Large)<0.0001⁎⁎
BMP-44.33a ± 0.534.99b ± 0.624.60a,b ± 0.54.18a ± 0.570.26 (Large)<0.0001⁎⁎
FcRL24.89a ± 0.545.01a,b ± 0.565.11a,b ± 0.485.45b ± 0.650.14 (Large)0.009⁎⁎
IL-5R-alpha4.02a ± 0.773.70a ± 0.555.15b ± 0.75.14b ± 0.940.42 (Large)<0.0001⁎⁎
PDGF-R-alpha5.25a ± 0.315.28a ± 0.365.42a,b ± 0.465.77b ± 0.390.28 (Large)<0.0001⁎⁎
CTSC5.03a ± 0.413.59b ± 0.743.65b ± 0.634.33c ± 0.60.52 (Large)<0.0001⁎⁎
CTSS5.61a ± 0.245.62a ± 0.245.95b ± 0.46.22b ± 0.230.53 (Large)<0.0001⁎⁎
N-CDase3.72a ± 0.613.61a,b ± 0.693.00b ± 0.483.31a,b ± 0.820.11 (Large)0.014⁎⁎
TNFRSF218.28a ± 0.248.22a ± 0.298.01a ± 0.348.10a ± 0.320.09 (Large)0.054⁎⁎
CLM-16.08a ± 0.986.28a,b ± 0.827.19b,c ± 0.637.56c ± 0.840.36 (Large)<0.0001⁎⁎
SPOCK12.38a ± 0.272.42a ± 0.272.55a,b ± 0.332.68b ± 0.350.16 (Large)0.009⁎⁎
IL128.30a ± 0.778.35a ± 0.698.54a ± 0.558.53a ± 0.690.02 (Large)0.579⁎⁎
MANF8.98a ± 0.076.85b ± 1.147.36b ± 0.957.50b ± 0.980.52 (Large)<0.0001⁎⁎
TN-R4.27a ± 0.364.36a ± 0.554.00a,b ± 0.383.86b ± 0.530.17 (Large)0.003⁎⁎
CD200R14.28a,b ± 0.334.37a ± 0.334.04a,b ± 0.354.11b ± 0.320.13 (Large)0.008⁎⁎
KYNU9.04a ± 0.748.35b ± 0.69.37a,c ± 0.479.62c ± 0.580.39 (Large)<0.0001⁎⁎

a, b, c: Different characters in each row show a statistically significant difference (p <0.05)

Kruskal-Wallis test

One-way analysis of variance.

Descriptive statistics of proteins in the neurology panel by groups. a, b, c: Different characters in each row show a statistically significant difference (p <0.05) Kruskal-Wallis test One-way analysis of variance. According to Table 5, the difference between groups in terms of protein in the neurology panel except for proteins UNC5C, VWC2, CRTAM, NEP, GM-CSF-R-alpha, Dkk-4, LAIR-2, NCAN, CDH3, TNFRSF21, and IL12 is statistically significant (p<0.05). As the effect sizes are appreciated, the two proteins that most markedly affect the COVID-19 severity and control groups in the neurology panel are PLXNB3 (0.73) and GAL-8 (0.70).

Results of Preprocessing data

In the data set, a total of 7 missing values were determined in other variables except for gender and age variables. As a result of the assignment made using the Fully Conditional Specification method instead of the determined missing values, the missing values in the data set were completed. The class imbalance problem is solved using the "Sample (balance)" operator on the data set that does not contain any missing value. As a result, the data set became balanced, with 33 people in each group. As a result of the feature selection with the Recursive Feature Elimination method to increase the model performance, the number of variables in the data set decreased to 138 proteins.

Artificial intelligence models and Performance Evaluation

In this study, artificial intelligence models (deep learning (DL), Random Forest (RF), Gradient Boosted Trees (GBTs) are constructed to classify three COVID-19 positive patient groups (mild, severe, and critical) and a control group based on the blood protein profiling. The hyperparameters for the models are tuned by the Evolutionary optimization method, which uses an evolutionary approach. As a result, the hyperparameters for the deep learning model were optimized as 1.0e-8 for epsilon, 0.99 for rho, 0.0 for L1, 0.17 for L2, and 10.0 for maxw2. Hyperparameters for the random forest model were tuned as 35.0 for maximal depth, 77.0 for minimal leaf size, 55.0 for minimal size for the split, and 28.0 for the number of pre pruning alternatives. Finally, the hyperparameters for the GBTs model have been optimized as 62.0 for maximal depth, 0.94 for learning rate, 20.0 for the number of bins, and 1.0e-5 for min split improvement. According to the model performance metric results in Table 6 , the GBTs classification algorithm gave the most successful result. The accuracy rate based on Random Forest (96.21%) was more successful than the accuracy rate based on classic machine learning (DL 94.73%). In kappa statistic, which measures the reliability of the statistical fit, the GBTs, RF, and DL approaches represent a perfect fit with the values of 0.96, 0.95, and 0.93.
Table 6

Performance metrics of the models.

ModelsGroupsSensitivity (%)Specificity (%)Precision (%)F1 score (%)Accuracy (%)Class. ErrorKappa
Gradient Boosted Trees (GBTs)Mild96.9797.9894.1295.5296.983.020.96
Severe1.0098.9997.0698.51
Critical90.9198.9996.7793.75
Control1.001.001.001.00
Random Forest (RF)Mild96.9798.9996.9796.9796.213.790.95
Severe1.0097.9894.2997.06
Critical87.8898.9996.6792.06
Control1.0098.9997.0698.51
Deep Learning (DL)Mild96.9796.9791.4394.1294.735.270.93
Severe1.0097.9894.2997.06
Critical87.8897.9893.5590.62
Control93.941.001.0096.88
Performance metrics of the models. Fig. 1 displays the pseudo-codes of the GBTs algorithm, which produces the best prediction in classifying the severity of COVID19 disease based on the proteomics data.
Fig. 1

The pseudo-codes of the GBTs algorithm.

The pseudo-codes of the GBTs algorithm. The graphic of variable importance values of the top ten proteins for the GBTs algorithms. Input: The training set , a differentiable loss function L(y,F(x)), number of iterations M. Initialize model with a constant value: For m = 1 to M: Compute so-called pseudo-residuals: Fit a base learner (or weak learner, e.g., tree) h(x) to pseudo-residuals, i.e., train it using the training set . Compute multiplier γ by solving the following one-dimensional optimization problem: Update the model: Output F(x). Figure 1. The pseudo-codes of the GBTs algorithm Table 7 and Figure 1 depict the importance levels of the top ten proteins in COVID-19 positive and control individuals on the severity of the disease in the GBTs modeling.
Table 7

Variable importance values of the top ten proteins for the GBTs algorithms.

Protein NamesRelative ImportancePercentage
ITGB1BP2 CVDII22.790.26
MILR1 INF18.520.21
MATN3 NEU15.500.18
ROBO2 NEU8.770.10
REN CVDII8.360.09
CLEC4C IR3.530.04
IL6 INF2.410.03
ZBTB16 IR1.810.02
PLXNB3 NEU0.710.01
LILRB4 IR0.610.01

CVDII: Cardiovascular II Panel; IR: Immune Response Panel; INF: Inflammation Panel; NEU: Neurology Panel

Variable importance values of the top ten proteins for the GBTs algorithms. CVDII: Cardiovascular II Panel; IR: Immune Response Panel; INF: Inflammation Panel; NEU: Neurology Panel ITGB1BP2 CVDII (22.79-0,26%), MILR1 INF (18,52-0,21%), MATN3 NEU (15,50-0,18%) and ROBO2 NEU (8,77-0,10%) provided the highest importance, while the lowest importance values were for PLXNB3 NEU (0,71-0,01%) and LILRB4 IR (0,61-0,01%) from the GBTs technique.

Discussion

COVID-19 is widespread, with high morbidity and high mortality in chronically ill patients all over the world. According to recent studies, older patients are more likely to become infected with COVID-19, particularly those with underlying diseases. The seriousness of cases puts great pressure on the deficiency of services for intensive care. Unfortunately, basic clinical characteristics of COVID-19 at distinct serious stages remain unknown to date. Several attempts are being made to develop automated systems to enable the early identification of the disease by medical experts based on medical images or -omics technologies. Prediction models that combine factors or characteristics to predict the likelihood of people being infected are helping clinicians cope with the outbreak of COVID-19 [27]. During the severe acute respiratory syndrome-new coronavirus-2 pandemic, the insufficiency of laboratory diagnostic tools and the take a long time led clinicians to more rapid diagnosis methods. Although COVID-19 can be effectively diagnosed at an early stage by the approaches based on proteomic analysis, the detection of serious COVID-19 patients before the manifestation of severe symptoms to reduce mortality is equally important. In this study, three positive (mild, extreme, and critical) COVID-19 patient groups and a control group may be separated based on deep learning and multiple machine learning models (i.e., Random Forest, Gradient Boosted Tree) related to blood protein profiling. According to the experimental results from the current study, it can be concluded that the models based on blood proteins generate promising prediction results in classifying COVID-19 (mild/severe/critical) severity levels and the control group. When the prediction results of the algorithms are compared according to the performance metrics (i.e., accuracy, sensitivity, specificity, precision, F1, kappa, and classification error), the GBTs algorithm slightly outperforms deep learning and random forest techniques on the classification problem under question. The ten top proteins, ITGB1BP2, MILR1, MATN3, ROBO2, REN, CLEC4C, IL6, ZBTB16, PLXNB3, and LILRB4, calculated from the best performing GBTs algorithm, can be used as biomarkers in the COVID-19 severity classification. A similar paper has been reported that six proteins (IL6, CKAP4, Gal-9, IL-1ra, LILRB4, and PD-L1) are associated with the severity of the COVID-19 disease, and complex variations in blood proteins associated with the severity of the disease may be used as early biomarkers to screen the severity of the disease in COVID-19 and act as future therapeutic targets [8]. On the other hand, when the effect sizes for all proteins are examined, the five proteins with the highest values are DECR-1 (0.85), ICA1 (0.82), IL-6 (0.79), MGMT (0.79), and PLXNB3 (0.73), respectively. Besides, the findings of the proposed GBTs model indicate that the two proteins (i.e., IL6 and LILRB4) are significantly related to COVID-19 severity, as reported by the previous work [8]. Different studies on the proteomics profiling of the COVID-19 pandemic have been reported to identify the differences in proteins. A novel paper [28] has performed RNA-seq and high-resolution mass spectrometry on 128 blood samples from COVID-19-positive and COVID-19-negative patients with various disease severities and outcomes and mapped 219 molecular features with high significance for the status and severity of COVID-19. Finally, the related study presents a web-based platform to be interactively explored and demonstrated through a machine learning approach (ExtraTrees classifier) to COVID-19 severity prediction [28]. Another research has profiled host responses to COVID-19 by studying plasma proteomics in a population of patients with COVID-19, including non-survivors and survivors emerging from moderate or extreme symptoms, and revealed numerous plasma protein alterations associated with COVID-19. To classify 11 proteins as biomarkers and a range of biomarker combinations validated by an independent cohort and precisely differentiated and projected COVID-19 outcomes, we developed a pipeline based on machine-learning (penalized logistic regression) [29]. A recent study has been described the development of a proteomic risk score (PRS) based on 20 blood proteomic biomarkers linked to progression to severe COVID-19 and established that using a machine learning model (Light Gradient Boosting Machine), a core group of gut microbiota could reliably predict the blood proteomic biomarkers of COVID-19 [30]. The study conducted by Gomila et al. [31] used matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI TOF MS) to analyze the mass spectra profiles of the sera from 80 COVID-19 patients, clinically classified as mild (33), severe (26), critical (21), and 20 healthy controls and they found a clear variability of the serum peptidome profile depending on COVID-19 severity. The two support vector machines discrete severe (severe and critical) and non-severe (mild) patients with 90% precision in the study of the resulting matrix of peak intensity and estimated correctly the non-negative outcome of the severe patients in 85% of the cases and the negative in 38% of the cases. Yet, the current work additionally encapsulates the use of a deep learning approach in the proteomics analysis of the COVID-19 severity, which is an important difference from the studies given earlier. To sum up, the proposed model (Gradient Boosted Tree) achieved the best prediction of disease severity based on the proteins compared to the other algorithms. The results point out that changes in blood proteins associated with the severity of the disease may be used in monitoring the severity of COVID-19 disease and in early diagnosis and treatment.

Declaration of Competing Interest

The authors declare that they have no conflict of interest.
  6 in total

Review 1.  The accuracy of machine learning approaches using non-image data for the prediction of COVID-19: A meta-analysis.

Authors:  Kuang-Ming Kuo; Paul C Talley; Chao-Sheng Chang
Journal:  Int J Med Inform       Date:  2022-05-13       Impact factor: 4.730

2.  Machine learning approaches in Covid-19 severity risk prediction in Morocco.

Authors:  Mariam Laatifi; Samira Douzi; Abdelaziz Bouklouz; Hind Ezzine; Jaafar Jaafari; Younes Zaid; Bouabid El Ouahidi; Mariam Naciri
Journal:  J Big Data       Date:  2022-01-06

3.  Gender Bias in Artificial Intelligence: Severity Prediction at an Early Stage of COVID-19.

Authors:  Heewon Chung; Chul Park; Wu Seong Kang; Jinseok Lee
Journal:  Front Physiol       Date:  2021-11-29       Impact factor: 4.566

4.  A Comprehensive Review of Artificial Intelligence in Prevention and Treatment of COVID-19 Pandemic.

Authors:  Haishuai Wang; Shangru Jia; Zhao Li; Yucong Duan; Guangyu Tao; Ziping Zhao
Journal:  Front Genet       Date:  2022-04-26       Impact factor: 4.772

5.  Combining Deep Phenotyping of Serum Proteomics and Clinical Data via Machine Learning for COVID-19 Biomarker Discovery.

Authors:  Antonio Paolo Beltrami; Maria De Martino; Emiliano Dalla; Matilde Clarissa Malfatti; Federica Caponnetto; Marta Codrich; Daniele Stefanizzi; Martina Fabris; Emanuela Sozio; Federica D'Aurizio; Carlo E M Pucillo; Leonardo A Sechi; Carlo Tascini; Francesco Curcio; Gian Luca Foresti; Claudio Piciarelli; Axel De Nardin; Gianluca Tell; Miriam Isola
Journal:  Int J Mol Sci       Date:  2022-08-15       Impact factor: 6.208

6.  Melatonin inhibits lipid accumulation to repress prostate cancer progression by mediating the epigenetic modification of CES1.

Authors:  Lijie Zhou; Cai Zhang; Xiong Yang; Lilong Liu; Junyi Hu; Yaxin Hou; Hong Tao; Haruhiko Sugimura; Zhaohui Chen; Liang Wang; Ke Chen
Journal:  Clin Transl Med       Date:  2021-06
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.