| Literature DB >> 35615376 |
Priyanka Srivastava1, Chitra Bamba1, Rakesh Kumar Pilania2, Anu Kumari1, Rajni Kumrah2, Archan Sil2, Surjit Singh2.
Abstract
Kawasaki disease (KD) is a common childhood systemic vasculitis with a special predilection for coronary arteries. Even after more than five decades of the initial description of the disease, the etiology of KD remains an enigma. This transcriptome data re-analysis study aimed to elucidate the underlying pathogenesis of KD using a bioinformatic approach to identify differentially expressed genes (DEGs) to delineate common pathways involved in KD. Array datasets from the Gene Expression Omnibus database were extracted and subjected to comparative meta-analysis for the identification of prominent DEGs. Fifteen hub genes with high connectivity were selected from these DEGs (IL1B, ITGAM, TLR2, CXCL8, SPI1, S100A12, MMP9, PRF1, TLR8, TREM1, CD44, UBB, FCER1G, IL7R, and FCGR1A). Of these 15 genes, five genes (CXCL8, FCGR1A, IL1B, TLR2, and TLR8) were found to be involved in neutrophil degranulation. To gain further insight into the molecular mechanism, a protein-protein network was established. Significantly enriched pathways based on the above-mentioned genes were mainly centered on biological regulation and signaling events. In addition, the pathway analysis also indicated that the majority of the DEGs in KD were enriched in systemic lupus erythematosus, suggesting a strong interplay between immunological and genetic factors in the pathogenesis of KD. These findings could significantly aid in identifying therapeutic targets and understanding KD biosignatures to design a biomarker panel for early diagnosis and severity of the disease.Entities:
Keywords: Kawasaki disease; bioinformatics; biomarkers; hub genes; in-silico analysis; microarray; transcriptomics analysis
Year: 2022 PMID: 35615376 PMCID: PMC9124956 DOI: 10.3389/fgene.2022.849834
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.772
Search result GEO datasets using the ARGEOS web tool.
| Accession | Organism | Type | Platform |
|---|---|---|---|
| GSE73461 |
| Expression profiling by array | GPL10558 |
| GSE73463 |
| Expression profiling by array | GPL10558 |
| GSE63881 |
| Expression profiling by array | GPL10558 |
| GSE68004 |
| Expression profiling by array | GPL10558 |
| GSE73462 |
| Expression profiling by array | GPL6947 |
| GSE73464 |
| Expression profiling by array | GPL6947 |
| GSE48498 |
| Expression profiling by array | GPL570 |
| GSE16797 |
| Expression profiling by array | GPL570 |
| GSE9864 |
| Expression profiling by array | GPL6270 |
| GSE9863 |
| Expression profiling by array | GPL6271 |
| GSE18606 |
| Expression profiling by array | GPL6480 |
| GSE109351 |
| Expression profiling by array | GPL17586 |
| GSE73577 |
| Expression profiling by array | GPL4133 |
List of the datasets included in the study.
| Accession | Samples | Type | Summary | References |
|---|---|---|---|---|
| GSE73461 | 459 | Illumina HumanHT-12 V4.0 expression beadchip | Genome-wide analysis of transcriptional profiles in children <17 years of age with inflammatory diseases, bacterial or viral infections, or with clinical features suggestive of an infection | Wright et al. 2018 |
| GSE73463 | 233 | |||
| GSE63881 | 341 | Transcriptional profiles in KD patients at acute and convalescent phases with different clinical outcomes were investigated | Hoang et al., 2014 | |
| GSE68004 | 162 | 1) To define the transcriptional signature of KD that can aid in the diagnosis of complete and incomplete KD in children; 2) to identify specific biomarkers that objectively discriminate between KD and other mimicking conditions, including HAdV and 3) to test the prognostic utility of GEP to determine response to IVIG therapy and development of coronary artery abnormalities (CAAs) | Jaggi et al., 2018 |
Top 25 upregulated genes of the microarray meta-analyses along with their fold change values.
| Upregulated genes | ||||
|---|---|---|---|---|
| Gene symbol | Gene name | Log ratio combined | Fold change combined | FDR |
| PPM1A | Protein phosphatase, Mg2+/Mn2+ dependent 1A | 1.5612 | 36.408 | 0.001274 |
| KIR2DL5A | Killer cell immunoglobulin-like receptor, two domains, long cytoplasmic tail, 5A | 1.5063 | 32.085 | 0.004013 |
| PPFIBP1 | PTPRF interacting protein, binding protein 1 (liprin beta 1) | 1.4856 | 30.594 | 0.000258 |
| MBD3L5 | Methyl-CpG binding domain protein 3-like 5 | 1.4677 | 29.354 | 0.0185 |
| OSGIN1 | Oxidative stress–induced growth inhibitor 1 | 1.3628 | 23.057 | 0.0319 |
| FCAR | Fc fragment of IgA receptor | 1.0824 | 12.089 | 0.000323 |
| MCEMP1 | Mast cell-expressed membrane protein 1 | 1.0476 | 11.158 | 0 |
| HP | Haptoglobin | 1.0464 | 11.127 | 0 |
| FAM177A1 | Family with sequence similarity 177 member A1 | 1.0097 | 10.227 | 0.000461 |
| ANXA3 | Annexin A3 | 1.0074 | 10.171 | 0 |
| KCNJ15 | Potassium channel, inwardly rectifying subfamily J, member 15 | 0.9882 | 9.731 | 5.19E-13 |
| CEP170 | Centrosomal protein 170 kDa | 0.896 | 7.87 | 0.0192 |
| FCGR1A | Fc fragment of IgG, high-affinity Ia, receptor (CD64) | 0.8823 | 7.625 | 0 |
| HMX3 | H6 family homeobox 3 | 0.8658 | 7.342 | 0 |
| SOD2 | Superoxide dismutase 2, mitochondrial | 0.8651 | 7.33 | 2.84E-06 |
| S100A12 | S100 calcium-binding protein A12 | 0.8571 | 7.196 | 0 |
| H2AC20 | Histone cluster 2, H2ac | 0.8223 | 6.642 | 0 |
| LIMK2 | LIM domain kinase 2 | 0.8072 | 6.415 | 0.0275 |
| C19orf38 | Chromosome 19 open reading frame 38 | 0.7805 | 6.033 | 0.0396 |
| RBMS1 | RNA binding motif, single-stranded interacting protein 1 | 0.7598 | 5.751 | 0.00032 |
| CYSTM1 | Cysteine-rich transmembrane module containing 1 | 0.7549 | 5.687 | 0 |
| ALPL | Alkaline phosphatase, liver/bone/kidney | 0.7503 | 5.627 | 0 |
| IL1R2 | Interleukin 1 receptor, type II | 0.7208 | 5.258 | 0 |
| C1QB | Complement component 1, q subcomponent, B chain | 0.7202 | 5.251 | 0 |
| CCPG1 | Cell cycle progression 1 | 0.6979 | 4.988 | 0.0239 |
Top 25 downregulated genes of the microarray meta-analyses along with their fold change values.
| Downregulated genes | ||||
|---|---|---|---|---|
| Gene symbol | Gene name | Log ratio combined | Fold change combined | FDR |
| RPL17P43 | Ribosomal protein L17 pseudogene 43 | −2.1741 | 149.314 | 7.33E-05 |
| LRRC4B | Leucine-rich repeat-containing 4B | −1.9794 | 95.367 | 0.00244 |
| OXNAD1 | Oxidoreductase NAD-binding domain containing 1 | −1.8402 | 69.215 | 0.006463 |
| NR1I2 | Nuclear receptor subfamily 1 group I member 2 | −1.6318 | 42.835 | 0.008047 |
| ASS1P13 | Argininosuccinate synthetase 1 pseudogene 13 | −1.2125 | 16.312 | 0.0476 |
| GNLY | Granulysin | −0.9048 | 8.032 | 0.003075 |
| SAMD3 | Sterile alpha motif domain containing 3 | −0.8114 | 6.477 | 0.006021 |
| ITM2C | Integral membrane protein 2C | −0.7768 | 5.981 | 0.008414 |
| SIRPG | Signal-regulatory protein gamma | −0.7626 | 5.789 | 0.00083 |
| PYHIN1 | Pyrin and HIN domain family member 1 | −0.7397 | 5.492 | 0.0246 |
| IL7R | Interleukin 7 receptor | −0.6883 | 4.879 | 0.000688 |
| TLE5 | Amino-terminal enhancer of split | −0.594 | 3.927 | 0.000246 |
| TMEM204 | Transmembrane protein 204 | −0.5689 | 3.706 | 0.0214 |
| TXNDC5 | Thioredoxin domain containing 5 (endoplasmic reticulum) | −0.5602 | 3.633 | 0.0247 |
| GNLY | Granulysin | −0.5543 | 3.583 | 1.47E-10 |
| FAM83A-AS1 | FAM83A antisense RNA 1 | −0.5483 | 3.534 | 3.61E-06 |
| RNF213 | Ring finger protein 213 | −0.5095 | 3.232 | 6.75E-05 |
| SLAMF6 | SLAM family member 6 | −0.5078 | 3.219 | 0.0473 |
| WASH3P | WAS protein family homolog 3 pseudogene | −0.488 | 3.076 | 0.0071 |
| ADGRG1 | Adhesion G protein-coupled receptor G1 | −0.4751 | 2.986 | 1.35E-09 |
| GZMK | Granzyme K | −0.4613 | 2.893 | 3.27E-07 |
| POLR3GL | Polymerase (RNA) III (DNA directed) polypeptide G (32kD)-like | −0.4545 | 2.848 | 2.25E-05 |
| SERPINA13P | Serpin peptidase inhibitor, clade A (alpha-1 antiproteinase, antitrypsin), member 13, pseudogene | −0.4539 | 2.844 | 0.000964 |
| IL32 | Interleukin 32 | −0.4476 | 2.803 | 1.8E-06 |
| RUNDC3A | RUN domain containing 3A | −0.4169 | 2.612 | 0.0124 |
FIGURE 1The protein–protein interaction (PPI) network analysis of differentially expressed genes in KD. The network was constructed by Cytoscape based on the PPI correlations from the STRING database.
Top 15 hub genes with high degrees of connectivity in patients with KD.
| Genes | Betweenness centrality | Closeness centrality | Degree |
|---|---|---|---|
| IL1B | 0.19793249 | 0.43856333 | 54 |
| ITGAM | 0.08124394 | 0.40347826 | 44 |
| TLR2 | 0.10349151 | 0.41281139 | 42 |
| CXCL8 | 0.07645807 | 0.40773286 | 42 |
| SPI1 | 0.08767911 | 0.38538206 | 34 |
| S100A12 | 0.07193145 | 0.36535433 | 29 |
| MMP9 | 0.0400542 | 0.38410596 | 27 |
| PRF1 | 0.05125956 | 0.37001595 | 27 |
| TLR8 | 0.01113809 | 0.36535433 | 26 |
| TREM1 | 0.01784325 | 0.35474006 | 25 |
| CD44 | 0.08380828 | 0.38157895 | 25 |
| UBB | 0.10610165 | 0.33237822 | 24 |
| FCER1G | 0.01477434 | 0.33819242 | 23 |
| IL7R | 0.01549726 | 0.3625 | 23 |
| FCGR1A | 0.01234507 | 0.35474006 | 21 |
FIGURE 2Enrichment network of the shared DEGs based on biological processes. Biological process network of differentially expressed genes of KD patients using the BINGO app of Cytoscape. Large nodes indicate more genes involved, and the size of a node is proportional to the number of targets in the GO category. Yellow nodes indicate the genes playing a significant role in KD: p-value < 0.05.
List of top 20 significantly overrepresented GO categories derived from the BINGO analysis output, based on our data. The list has been arranged in descending order of node size.
| Name | Description GO | Average shortest path length | Betweenness centrality | Closeness centrality | Node size | Adjusted | No. of genes |
|---|---|---|---|---|---|---|---|
| 65,007 | Biological regulation | 4.07075472 | 0.41100745 | 0.24565469 | 23.7486842 | 0.0394 | 43 |
| 50,896 | Response to stimulus | 4.27830189 | 0.24678813 | 0.2337376 | 20.4939015 | 0.0000 | 40 |
| 23,052 | Signaling | 4.65566038 | 0.02528647 | 0.2147923 | 17.8885438 | 0.0043 | 50 |
| 6,950 | Response to stress | 5.17924528 | 0.03215335 | 0.19307832 | 16.3707055 | 0.0000 | 67 |
| 23,033 | Signaling pathway | 5.52358491 | 0.01811721 | 0.18104184 | 15.7480157 | 0.0009 | 105 |
| 23,060 | Signal transmission | 5.16509434 | 0.02117959 | 0.19360731 | 14.832397 | 0.0314 | 28 |
| 23,046 | Signaling process | 4.58018868 | 0.02435299 | 0.21833162 | 14.832397 | 0.0314 | 20 |
| 7,165 | Signal transduction | 5.52358491 | 0.02548615 | 0.18104184 | 14.4222051 | 0.0164 | 34 |
| 2,376 | Immune system process | 4.55660377 | 0.07892457 | 0.2194617 | 14.1421356 | 0.0000 | 62 |
| 42,221 | Response to chemical stimulus | 5.14622642 | 0.01607496 | 0.19431714 | 13.8564065 | 0.0009 | 48 |
| 65,008 | Regulation of biological quality | 4.95283019 | 0.03262659 | 0.20190476 | 13.4164079 | 0.0162 | 32 |
| 6,955 | Immune response | 5.0990566 | 0.03425987 | 0.19611471 | 13.114877 | 0.0000 | 42 |
| 7,166 | Cell surface receptor linked signaling pathway | 6.51886792 | 0 | 0.15340087 | 12.9614814 | 0.0031 | 80 |
| 6,952 | Defense response | 6.0754717 | 0.00547133 | 0.16459627 | 12.6491106 | 0.0000 | 37 |
| 23,034 | Intracellular signaling pathway | 6.10377358 | 0.00418892 | 0.16383308 | 12.1655251 | 0.0132 | 45 |
| 10,033 | Response to organic substance | 5.94811321 | 0.00434888 | 0.16812054 | 11.6619038 | 0.0009 | 52 |
| 35,556 | Intracellular signal transduction | 6.33490566 | 0.01587904 | 0.15785555 | 11.3137085 | 0.0019 | 55 |
| 9,611 | Response to wounding | 6.16509434 | 0.00429051 | 0.16220352 | 10.5830052 | 0.0000 | 55 |
| 6,468 | Protein amino acid phosphorylation | 8.49528302 | 3.87E-04 | 0.11771238 | 9.38083152 | 0.0438 | 141 |
| 6,954 | Inflammatory response | 7.06132075 | 4.47E-05 | 0.14161657 | 8.94427191 | 0.0001 | 22 |
FIGURE 3Graphical representation of significantly overrepresented individual categories of GO molecular function for the upregulated genes.
Significantly enriched GO Molecular functions of DEGs.
| Title | z-value | FDR | Fold enrichment | N genes | Genes |
|---|---|---|---|---|---|
| GO: 0043312, neutrophil degranulation | 17.3639 | 0 | 7.6743 | 50 | STXBP2, FCAR, CRISPLD2, LILRA3, RETN, S100P, CDA, TCN1, ITGAM, BST1, CAMP, OSCAR, PGLYRP1, CD55, MMP9, CEACAM8, LCN2, SLPI, CLEC4D, CYSTM1, FOLR3, MCEMP1, MMP25, QSOX1, RAB27A, VNN1, PGM2, GPR84, FTH1, CKAP4, S100A11, MAPK14, S100A12, ORM1, SIGLEC14, CANT1, TLR2, HP, QPCT, SIRPB1, CEACAM3, TNFAIP6, BPI, HSPA1A, FCER1G, CEACAM1, GYG1, ANXA3, RAB31, and CD44 |
| GO: 0035580, specific granule lumen | 15.6351 | 0 | 17.9727 | 15 | ORM1, CANT1, HP, QPCT, BPI, RETN, TCN1, CAMP, OSCAR, PGLYRP1, LCN2, SLPI, FOLR3, QSOX1, and RAB27A |
| GO: 1904724, tertiary granule lumen | 15.536 | 0 | 18.9095 | 14 | MMP9, PGLYRP1, OSCAR, QSOX1, FOLR3, CDA, CAMP, TCN1, TNFAIP6, ORM1, FTH1, QPCT, HP, and CANT1 |
Significantly enriched KEGG pathway of DEGs.
| Title | z-value | FDR | Fold enrichment | N genes | Genes |
|---|---|---|---|---|---|
| Systemic lupus erythematosus | 4.758 | 0 | 3.9336 | 10 | H2AC21, HLA-DRB5, H2BC18, C1QB, H2AC20, FCGR1A, H2BC21, H2AC18, H2BC5, and H3C4 |