| Literature DB >> 34916285 |
Vanessa Aguiar-Pulido1, Paul Wolujewicz1, Alexander Martinez-Fundichely2,3, Eran Elhaik4, Gaurav Thareja5, Alice Abdel Aleem6, Nader Chalhoub6, Tawny Cuykendall2,3, Jamel Al-Zamer7, Yunping Lei8, Haitham El-Bashir7, James M Musser9,10, Abdulla Al-Kaabi11, Gary M Shaw12, Ekta Khurana2,3, Karsten Suhre5, Christopher E Mason1,2,3, Olivier Elemento1,2,3,13, Richard H Finnell8,14,15, M Elizabeth Ross16.
Abstract
Spina bifida (SB) is a debilitating birth defect caused by multiple gene and environment interactions. Though SB shows non-Mendelian inheritance, genetic factors contribute to an estimated 70% of cases. Nevertheless, identifying human mutations conferring SB risk is challenging due to its relative rarity, genetic heterogeneity, incomplete penetrance, and environmental influences that hamper genome-wide association studies approaches to untargeted discovery. Thus, SB genetic studies may suffer from population substructure and/or selection bias introduced by typical candidate gene searches. We report a population based, ancestry-matched whole-genome sequence analysis of SB genetic predisposition using a systems biology strategy to interrogate 298 case-control subject genomes (149 pairs). Genes that were enriched in likely gene disrupting (LGD), rare protein-coding variants were subjected to machine learning analysis to identify genes in which LGD variants occur with a different frequency in cases versus controls and so discriminate between these groups. Those genes with high discriminatory potential for SB significantly enriched pathways pertaining to carbon metabolism, inflammation, innate immunity, cytoskeletal regulation, and essential transcriptional regulation consistent with their having impact on the pathogenesis of human SB. Additionally, an interrogation of conserved noncoding sequences identified robust variant enrichment in regulatory regions of several transcription factors critical to embryonic development. This genome-wide perspective offers an effective approach to the interrogation of coding and noncoding sequence variant contributions to rare complex genetic disorders.Entities:
Keywords: myelomeningocele; neural tube defects; pathway analysis; rare variant enrichment; whole-genome sequence
Mesh:
Substances:
Year: 2021 PMID: 34916285 PMCID: PMC8713748 DOI: 10.1073/pnas.2106844118
Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN: 0027-8424 Impact factor: 11.205
Fig. 1.WGS analysis overview. (A) The admixture composition of the ethnically diverse cohort of 149 SB cases and 149 ancestry-matched controls used in the analysis. For brevity, the nine gene pools were collapsed by continent. (B) The strategy used to interrogate WGS data. (C) The proportion of variants found in the cohort by type.
Genes found in this machine learning strategy to have high discriminatory potential between SB cases and controls that were previously found to be differentially expressed in human fetal NTD versus healthy control amniocytes
| Gene | Expression up/down | Fold change (log2) | Adjusted |
|
| + | 2.82 | 0.02 |
|
| + | 3.15 | 0.00 |
|
| + | 2.92 | 0.01 |
|
| + | 2.69 | 0.04 |
|
| − | 2.96 | 0.01 |
|
| − | 2.37 | 0.02 |
|
| − | 2.69 | 0.02 |
|
| − | 2.81 | 0.00 |
|
| − | 2.23 | 0.04 |
Amniocyte data (log2 fold changes and adjusted P values) reported by Nagy et al., 2006 (34).
*These differentially expressed genes are also found in significantly overrepresented pathways obtained in our analysis.
†These differentially expressed genes have been associated with neuronal synapse assembly and axon pathfinding.
Fig. 2.The genes with the greatest potential to discriminate between SB cases and non-SB controls and their relationships in pathways. (A) The genes with high discriminatory potential to distinguish SB cases and controls significantly enrich an early progenitor class, gene coexpression module identified in a transcriptome WGCNA study of midgestation human cortex (35). The modules most highly enriched in rare variants found in individuals with developmental delay (DD, neuronal regulation module) or autism spectrum disorder (ASD, neuronal regulation and neurobehavior modules) (Walker et al.) are distinct from SB (this study, early neural progenitor proliferation module). (B) The pathways related to immunity are enriched with genes that contain LGD mutations in SB cases and impact the interferon arm of the HSV-1 pathway. (C) SB risk genes affect cytoskeletal regulation. The genes enriched with LGD variants in SB cases disrupt RhoGDI signaling and actin-myosin components of the cytoskeleton. Red stars in B and C indicate LGD-enriched genes.
Pathways enriched with genes of high discriminatory potential between SB cases and healthy controls
| Pathway | Adjusted | Genes |
| Carbon metabolism | 0.00081 |
|
| Cobalamin (Cbl, vitamin B12) transport and metabolism | 0.00099 |
|
| Glyoxylate and dicarboxylate metabolism | 0.00358 |
|
| Propanoate metabolism | 0.00449 |
|
| Herpes simplex virus 1 infection | 0.00685 |
|
| DNA damage | 0.00885 |
|
| ECM–receptor interaction | 0.01062 |
|
| RhoGDI pathway | 0.01321 |
|
| Codeine and morphine metabolism | 0.01358 |
|
| Sertoli-Sertoli cell junction dynamics | 0.01605 |
|
| Homologous DNA pairing and strand exchange | 0.01793 |
|
| NAD metabolism, sirtuins, and aging | 0.02171 |
|
| Pentose phosphate pathway | 0.02427 |
|
| PERK regulates gene expression | 0.02427 |
|
| Interaction between L1 and ankyrins | 0.02427 |
|
| Cell cycle checkpoints | 0.02468 |
|
| Valine, leucine, and isoleucine degradation | 0.02575 |
|
| Amino acid transport across the plasma membrane | 0.02638 |
|
| Aurora A signaling | 0.02638 |
|
| Oncogene-induced senescence | 0.02860 |
|
TF genes whose regulatory regions are significantly enriched with rare variants
| TF | Full name | Adjusted | Coordinates |
|
| Zinc finger protein 274 | 1.64E-11 | GeneHancer |
|
| Regulatory factor X5 | 5.25E-05, 7.56E-08 | hESC CTCF loops (naïve and primed) |
|
| MYC associated factor X | 6.37E-05, 0.018 | CTCF loop conserved across tissues and hESC (naïve) |
|
| JunD proto-oncogene, AP-1 TF | 0.026 | hESC CTCF loops (primed) |
Fig. 3.TF genes whose regulatory regions are enriched with rare noncoding SNVs and their interactions. (A) The location of rare noncoding variants within the CTCF loops spanning MAX and JUND in cases. (B) The pathways regulating cell processes are impacted by rare noncoding variants. The regulatory regions of MAX and JUND are enriched in rare SNVs, impacting the JNK and p38 signaling pathways. Red stars indicate rare variant enrichment of regulatory regions in SB cases. (C) The interaction partners of ZNF274 based on data from the Search Tool for the Retrieval of Interacting GenesProteins (STRING).