| Literature DB >> 35545758 |
Andi Liu1,2, Astrid M Manuel2, Yulin Dai2, Zhongming Zhao3,4,5.
Abstract
BACKGROUND: Multiple sclerosis (MS) is a debilitating immune-mediated disease of the central nervous system that affects over 2 million people worldwide, resulting in a heavy burden to families and entire communities. Understanding the genetic basis underlying MS could help decipher the pathogenesis and shed light on MS treatment. We refined a recently developed Bayesian framework, Integrative Risk Gene Selector (iRIGS), to prioritize risk genes associated with MS by integrating the summary statistics from the largest GWAS to date (n = 115,803), various genomic features, and gene-gene closeness.Entities:
Keywords: Bayesian framework; Multi-omics; Multiple sclerosis; Single-cell RNA-sequencing; Two-sample Mendelian randomization
Mesh:
Year: 2022 PMID: 35545758 PMCID: PMC9092676 DOI: 10.1186/s12864-022-08580-y
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 4.547
Fig. 1Workflow of the study. Multi-omics data input for MS-associated risk gene prioritization and validation are labeled in blue. Primary analyses of MS-associated risk gene prioritization and validation are labeled in orange. Analyses on MS-PRGenes and validation gene sets are labeled in red
Summary of features used in the prioritization of MS-PRGs and validation
| Genetic variants in MS | 200 genetic variants from the currently largest MS GWAS with genome-wide significance |
| Functional Annotation of the Mammalian Genome 5 (FANTOM5) | Annotations of mammalian regulatory components, such as promoters, enhancers lncRNAs and miRNAs (provided by original iRIGS analysis) |
| Genome-scale chromosome conformation capture (Hi-C) | Brain Hi-C data including both short- and long-range interactions among genomic loci (provided by original iRIGS analysis) |
| Expression in MS brain tissue | Differentially expressed genes in MS brain tissue from 21 post-mortem brain samples (11 MS cases and 10 control subjects) |
| DNA methylation in MS brain tissue | Differentially methylated genes from epigenome-wide changes in DNA methylation levels of 28 MS cases and 19 control subjects |
| Gene–gene relationships | Gene interactions from Gene Ontology (GO) network (provided by original iRIGS analysis) |
| Expression quantitative trait loci (eQTL) | Top cis-eQTL of 19 tissues based on tissue expression data from the Genotype-Tissue Expression (GTEx) portal |
| Probability of loss of function (LoF) intolerant (pLI) scores | A high pLI score indicates the gene is more likely to be intolerant towards protein-truncating variant(s) |
| Evolutionary rate | The ratio of nonsynonymous over synonymous substitution rate (dN/dS) |
| Genes of human diseases | From OMIM and ClinVar databases |
| Single-nuclei RNA-sequencing (snRNA-seq) | Cell type-specificity enrichment analysis using a snRNA-seq dataset from the brain tissue of 4 progressive MS patients and 5 non-neurological controls |
| MS drug targets | 32 MS drug targets were collected from the DrugBank database, followed by the enrichment analysis |
| Connectivity Map (CMap) drug signatures | The co-expressed gene-set enrichment analysis (Cogena) R package was performed for the downregulated and upregulated 100 CMap gene sets |
Fig. 2Manhattan plots of results of iRIGS and 2SMR. A Manhattan plot of posterior probabilities of candidate genes from iRIGS. Highlighted genes are candidate risk genes with the highest posterior probability at each index SNP. Genes are labeled with high posterior probability (> 0.75). (B-D) Manhattan plots of 2SMR analyses on whole blood, spleen, and brain cerebellum. Highlighted and labeled genes are significant genes (FDR < 0.05) overlapped with MS-PRGenes
Fig. 3Heatmap of 2SMR analysis estimates of significant genes overlapped with MS-PRGenes in 19 tissues. Gene names are shown on the x-axis. Tissue names are shown in the y-axis ordered by the number of significant genes in each tissue. Red and blue are proportional to the effect size of each gene in each tissue
Fig. 4Exploration of the gene features of MS-PRGenes and MAGMA genes. (A) Comparing the boxplots of pLI scores between MS-PRGenes and MAGMA genes in a two-sided t-test. (B) Comparing the boxplots of pLI scores within the closest gene and non-closest gene groups of MS-PRGenes in a two-sided t-test. (C) Comparing the boxplots of pLI scores within the 2SMR validated gene and 2SMR unvalidated gene groups of MS-PRGenes in a two-sided t-test. (D) Comparing the boxplots of evolutionary rates between MS-PRGenes and MAGMA genes in a two-sided t-test. (E) Comparing the boxplots of evolutionary rates within the closest gene and non-closest gene groups of MS-PRGenes in a two-sided t-test. (F) Comparing the boxplots of evolutionary rates within the 2SMR validated gene and 2SMR unvalidated gene groups of MS-PRGenes in a two-sided t-test. (G) The proportional comparison of MS-PRGenes and MAGMA genes overlapped with disease genes parsed from OMIM in the chi-squared test. (H) The proportional comparison of MS-PRGenes and MAGMA genes overlapped with known genes with disease-associated variants parsed from ClinVar in the chi-squared test
Fig. 5Single-cell context-specific enrichment analysis. (A, B) Single-cell context-specific enrichment analysis of MS-PRGenes in MS snRNA-seq case and control panels, respectively. (C, D) Single-cell context-specific enrichment analysis of MAGMA genes in MS snRNA-seq case and control panels, respectively
Fig. 6CMap drug signatures enriched in the MS-PRGenes depict potential drug repositioning strategies. (A, B) Drugs enriched for the MS-PRGenes are shown on the y-axis. Enrichment scores represent -log2 (false discovery rate), as reported by the Cogena R package. The color is proportional to the enrichment score. (A) Drugs listed on the y-axis show the enrichment in the drug signatures of downregulated 100 CMap gene set. (B) Drugs show the enrichment from the drug signatures of upregulated 100 CMap gene set