| Literature DB >> 29979707 |
Cherif Ben Hamda1,2,3, Raphael Sangeda4, Liberata Mwita4, Ayton Meintjes5, Siana Nkya4, Sumir Panji5, Nicola Mulder5, Lamia Guizani-Tabbane2,6, Alia Benkahla1,2, Julie Makani3, Kais Ghedira1,2.
Abstract
A chronic inflammatory state to a large extent explains sickle cell disease (SCD) pathophysiology. Nonetheless, the principal dysregulated factors affecting this major pathway and their mechanisms of action still have to be fully identified and elucidated. Integrating gene expression and genome-wide association study (GWAS) data analysis represents a novel approach to refining the identification of key mediators and functions in complex diseases. Here, we performed gene expression meta-analysis of five independent publicly available microarray datasets related to homozygous SS patients with SCD to identify a consensus SCD transcriptomic profile. The meta-analysis conducted using the MetaDE R package based on combining p values (maxP approach) identified 335 differentially expressed genes (DEGs; 224 upregulated and 111 downregulated). Functional gene set enrichment revealed the importance of several metabolic pathways, of innate immune responses, erythrocyte development, and hemostasis pathways. Advanced analyses of GWAS data generated within the framework of this study by means of the atSNP R package and SIFT tool identified 60 regulatory single-nucleotide polymorphisms (rSNPs) occurring in the promoter of 20 DEGs and a deleterious SNP, affecting CAMKK2 protein function. This novel database of candidate genes, transcription factors, and rSNPs associated with SCD provides new markers that may help to identify new therapeutic targets.Entities:
Mesh:
Year: 2018 PMID: 29979707 PMCID: PMC6034806 DOI: 10.1371/journal.pone.0199461
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Characteristics of included individual microarray dataset.
| GEO Accession Number | Tissue | Number of samples | Microarray Platform | Reference study |
|---|---|---|---|---|
| GSE11524 | Peripheral blood | 30(12/18) | Affymetrix Human Genome U133 plus 2.0 | [ |
| GSE16728 | Peripheral blood | 20(10/10) | Affymetrix Human Genome U133 plus 2.1 | [ |
| GSE53441 | Peripheral blood | 34(10/24) | Affymetrix Human Genome U133 plus 2.2 | [ |
| GSE31757 | Whole blood | 8(3/5) | Affymetrix Human Exon 1.0 ST | [ |
| GSE35007 | Whole blood | 251(61/190) | Illumina HumanHT-12 v4 | [ |
Inclusion criteria: Only samples homozygotes SS and submitted to Globin reduction were selected.
*: Consists of two studies with two datasets in total
MetaQC quantitative quality control measures for gene expression data.
| Dataset | IQC | EQC | CQCg | CQCp | AQCg | AQCp | Rank |
|---|---|---|---|---|---|---|---|
| GSE35007 | 2.53 | 3.7 | 301.34 | 307.65 | 145.57 | 220.18 | 2.17 |
| GSE11524 | 5.33 | 3.6 | 148.75 | 307.65 | 101.43 | 210.4 | 2.5 |
| GSE16728-A | 3.58 | 3.7 | 145.79 | 307.65 | 78.46 | 157.71 | 3 |
| GSE31757 | 3.97 | 1.17 | 102.06 | 307.65 | 44.78 | 138.97 | 3.92 |
| GSE53441 | 7.02 | 3.82 | 58.5 | 73.32 | 27.43 | 37.71 | 4 |
| GSE16728-B | 0.41 | 3.6 | 33.06 | 212.76 | 18.27 | 88.93 | 5.42 |
Inclusion Criteria: Dataset with good performance in at least five quality control criteria
*: Low performance
Top 20 shared DEGs identified in the meta-analysis ranked by average Log2FC.
| Entrez Gene ID | HGNC Gene symbol | Gene Description | Average Log2FC | FDR |
|---|---|---|---|---|
| 9829 | DnaJ heat shock protein family (Hsp40) member C6 | 4,938 | 0,017 | |
| 221687 | Ring finger protein 182 | 4,003 | 0,002 | |
| 759 | carbonic anhydrase I | 3,366 | 1,38E-18 | |
| 3045 | Hemoglobin subunit delta | 3,294 | 1,38E-18 | |
| 2994 | Glycophorin B (MNS blood group) | 3,246 | 1,38E-18 | |
| 9911 | Transmembrane and coiled-coil domain family 2 | 3,166 | 1,38E-18 | |
| 2993 | Glycophorin A (MNS blood group) | 3,070 | 1,38E-18 | |
| 8140 | Solute carrier family 7 (amino acid transporter light chain, L system), member 5 | 2,999 | 0,0004 | |
| 66008 | Trafficking protein, kinesin binding 2 | 2,985 | 0,023 | |
| 493856 | CDGSH iron sulfur domain 2 | 2,980 | 1,38E-18 | |
| 79971 | Wntless Wnt ligand secretion mediator | -1,766 | 0,036 | |
| 6934 | Transcription factor 7-like 2 (T-cell specific, HMG-box) | -1,562 | 0,016 | |
| 29909 | G protein-coupled receptor 171 | -1,523 | 0,027 | |
| 79668 | Poly(ADP-ribose) polymerase family member 8 | -1,344 | 0,002 | |
| 64167 | Endoplasmic reticulum aminopeptidase 2 | -1,343 | 0,048 | |
| 91526 | Ankyrin repeat domain 44 | -1,300 | 0,001 | |
| 5788 | Protein tyrosine phosphatase, receptor type, C | -1,275 | 0,005 | |
| 1362 | Carboxypeptidase D | -1,209 | 0,030 | |
| 50852 | T cell receptor associated transmembrane adaptor 1 | -1,169 | 0,009 | |
| 23224 | Spectrin repeat containing, nuclear envelope 2 | -1,159 | 0,047 | |
Fig 3Network-based meta-analysis of hub genes.
A: Protein interaction network analysis indicates a central role for SKP1, NAPA, EPB42, and ARPC5 in SCD anemia. All 335 genes served as input for the STRING database with the high confidence interaction score 0.7, and a network was built by means of Cytoscape. The network topology was analyzed by the Cytoscape NetworkAnalyzer tool, and then network topology measures such as the degree (represented by the node size scale), betweenness (represented by police size scale), closeness centrality, and clustering coefficient were calculated. B and C: The top-ranked subnetwork identified by the OH-PIN algorithm (threshold: 2, overlapping score 0.5) using CytoCluster (a Cytoscape plugin).
Fig 4Transcriptional regulatory subnetwork based on microarray meta-analysis.
Regulatory network analysis was performed using RNEA R/package to determine the regulation complexes upstream of DEGs identified in the meta-analysis. Genes carrying in their promoter, a significant regulatory SNPs are marked by a yellow star. Each node represents a DEG or enriched transcription factor, depending on their shapes. The node size indicate greater significance of the enrichment. The edges reflect the relationships between the nodes.