| Literature DB >> 28120152 |
Giovanna Morello1, Maria Guarnaccia1, Antonio Gianmaria Spampinato1, Valentina La Cognata1,2, Velia D'Agata2, Sebastiano Cavallaro3.
Abstract
Amyotrophic lateral sclerosis (ALS) is a devastating and still untreatable motor neuron disease. Despite the molecular mechanisms underlying ALS pathogenesis that are still far from being understood, several studies have suggested the importance of a genetic contribution in both familial and sporadic forms of the disease. In addition to single-nucleotide polymorphisms (SNPs), which account for only a limited number of ALS cases, a consistent number of common and rare copy number variations (CNVs) have been associated to ALS. Most of the CNV-based association studies use a traditional candidate-gene approach that is inadequate for uncovering the genetic architectures of complex traits like ALS. The emergent paradigm of "systems biology" may offer a new perspective to better interpret the wide spectrum of CNVs in ALS, enabling the characterization of the complex network of gene products underlying ALS pathogenesis. In this review, we will explore the landscape of CNVs in ALS, putting specific emphasis on the functional impact of common CNV regions and genes consistently associated with increased risk of developing disease. In addition, we will discuss the potential contribution of multiple rare CNVs in ALS pathogenesis, focusing our attention on the complex mechanisms by which these proteins might impact, individually or in combination, the genetic susceptibility of ALS. The comprehensive detection and functional characterization of common and rare candidate risk CNVs in ALS susceptibility may bring new pieces into the intricate mosaic of ALS pathogenesis, providing interesting and important implications for a more precise molecular biomarker-assisted diagnosis and more effective and personalized treatments.Entities:
Keywords: Amyotrophic lateral sclerosis (ALS); Copy number variations (CNVs); Genomics; Systems biology
Mesh:
Year: 2017 PMID: 28120152 PMCID: PMC5820374 DOI: 10.1007/s12035-017-0393-x
Source DB: PubMed Journal: Mol Neurobiol ISSN: 0893-7648 Impact factor: 5.590
Fig. 1Schematic representation of the complex mosaic of ALS pathogenesis
Fig. 2The systems biology approach: from integration of large-scale “omics” data to personalized medicine practice
Fig. 3The most widely used methods for genome-wide and targeted CNVs detection and analysis
Characteristics of the most significant CNVs and overlapped genes from different genome-wide studies showing association with ALS
| Chr | Position start–end (CNV length) | CNV type | CNV detection method | ALS patients | Controls | Overlapped genes¥ | Reference |
|---|---|---|---|---|---|---|---|
| 1 | 41,119,815–41,147,030 | Gain | SNP array | 8 | 1 | None | [ |
| 1 | Not reported | Deletion | SNP array | 3 | 0 | Intron of | [ |
| 1 | 52,994,160–53,423,907 | Gain | BAC array | 1 | 0 | ZYG11B, ECHDC2, SCP2, PODN, | [ |
| 3 | 2,944,819–2,947,844 | Gain | SNP array | 2 | 0 | Intron of | [ |
| 3 | 89,485,137–89,499,861 | Loss | SNP array | 2 | 11 |
| [ |
| 3 | 33,270,957–33,296,620 | Gain | SNP array | 3 | 18 | FBXL2 | [ |
| 3 | Not reported | Deletion | SNP array | 1 | 0 | Promoter of | [ |
| 3 | Not reported | Loss | SNP array | 1 | 0 | 52% of | [ |
| 3 | 60,357,746–60,604,845 | Gain | BAC array | 1 | 0 | FHIT | [ |
| 4 | 761,587–1,014,752 | Gain | SNP array | 16 | 4 |
| [ |
| 4 | Not reported | Loss | SNP array | 1 | 0 | 3% of | [ |
| 4 | Not reported | Deletion | SNP array | 1 | 0 | Promoter of | [ |
| 5 | 28,842,013–28,912,873 | Gain | SNP array | 7 | 0 | None | [ |
| 5 | 45,850,032–46,384,240 | Gain | SNP array | 264 | 174 |
| [ |
| 5 | 70,925,030–70,953,012 | Gain | MLPA assay | 121 | 68 |
| [ |
| Loss | qPCR | 14 | 4 | ||||
| 6 | 109,034,609–109,074,882 | Gain | SNP array | 1 | 0 | Intron of | [ |
| 6 | 123,569,244–124,360,902 | Gain | BAC array | 1 | 0 |
| [ |
| 7 | 153,031,806–154,276,435 | Loss and gain | SNP array | 10 | 13 |
| [ |
| 7 | 61,663,407–62,155,064 | Gain | SNP array | 177 | 132 | None | [ |
| 8 | 47,062,007–47,406,312 | Gain | SNP array | 30 | 8 |
| [ |
| 8 | 47,062,007–47,711,911 | Gain | SNP array | 3 | 18 | None | [ |
| 8 | 43,689,385–43,910,848 | Gain | SNP array | 74 | 52 | None | [ |
| 8 | 73,609,541–73,629,084 | Gain | SNP array | 1 | 0 | Region overlaps with 86.50% of | [ |
| 8 | 144,686,338–144,765,210 | Loss and gain | SNP array | 6 | 0 | ZC3H3, | [ |
| 10 | 1,050,000–1,090,000 | Gain | aCGH | 46 | 10 |
| [ |
| 11 | 50,545,00–50,586,426 | Loss | SNP array | 21 | 2 | None | [ |
| 11 | 539,119–652,407 | Loss | SNP array | 5 | 0 |
| [ |
| 12 | 36,528,296–36,801,139 | Gain | SNP array | 157 | 8 | None | [ |
| 14 | 103,232,016–103,721,150 | Gain | SNP array | 5 | 0 | KLC1, XRCC3, ZFYVE21, | [ |
| 14 | Not reported | Deletion | SNP array | 5 | 0 | None | [ |
| 15 | 20,387,566–44,672,396 | Loss and gain | SNP array | 12 | 34 | TUBGCP5, CYFIP1, NIPA2, | [ |
| 15 | Not reported | Deletion | SNP array | 2 | 0 | None | [ |
| 15 | 19,818,989–20,084,080 | Deletion | SNP array | 2 | 0 | LOC650137 (+), | [ |
| 16 | 76,578,045–77,657,555 | Loss | SNP array | 0 | 6 | CLEC3A, WWOX | [ |
| 16 | 87,957,353–87,971,263 | Gain | SNP array | 2 | 0 | Intron of | [ |
| 16 | 969,913–1,834,962 | Gain | SNP array | 8 | 15 | SOX8, SSTR5, C1QTNF8, | [ |
| 17 | Not reported (1159 bp) | Gain | SNP array | 1 | 0 |
| [ |
| 19 | 20,860,930–20,875,787 | Loss | SNP array | 15 | 2 | None | [ |
| 19 | 32,615,675-32,935,836 | Gain | SNP array | 165 | 122 | RDH13 | [ |
| 19 | Not reported | Loss | SNP array | 2 | 0 | Intron of | [ |
| 22 | 23,696,411–24,240,667 | Loss and gain | SNP array | 10 | 28 | CRYBB3, CRYBB2, LOC91353, LRP5L, CRYBB2P1 | [ |
| 22 | 21,011,312–21,394,287 | Loss | SNP array | 0 | 11 | ZNF280B, ZNF280A, PRAME, BCR, | [ |
| 22 | 29,489,697–29,489,738 | Deletion | SSCP | 5 | 2 |
| [ |
| X | 139,400,576–telomere | Deletion | BAC array | 1 | 0 | >100 (No ALS candidate genes) | [ |
| X | 139,526,743–139,942,807 | Gain | BAC array | 1 | 0 |
| [ |
The table shows the most significant CNV loci and relative genes that partially or completely fall within them. Chromosomal positions are referred to the human reference genome assembly corresponding to each individual study: in particular [115], [40], and [81] refer to the NCBI reference sequence build 36, [98] refers to the build 38 and [41] refers to UCSC Genome Browser, May 2004 Freeze. In the case of [71], the authors described deletion coordinates by using the numbering of the published sequence by Lees et al. EMBO J. 1988. Genes that may be reasonable ALS candidates are in bold
Chr Chromosome, SSCP single-strand conformation polymorphism analysis, Loss heterozygous deletion, Deletion homozygous deletion
¥Gene symbols correspond to the NCBI Refseq names
Genes overlapped with CNV loci with a potential relevance for ALS susceptibility
| Gene | Description | Chromosomal location | GO Processes Associated | Type of CNV |
|---|---|---|---|---|
| SMN1 and SMN2 | Survival motor neurons 1 and 2 | 5q13.2 | mRNA processing, spliceosomal complex assembly, synaptic transmission | Gain and loss |
| NEFH | Neurofilament heavy subunit | 22q12.2 | Cellular response to oxidative stress, neurofilament cytoskeleton organization | Deletion |
| IDI1 and IDI2 | Isopentenyl diphosphate delta isomerase 1 and 2 | 10p15.3 | Cholesterol biosynthesis | Gain |
| DPP6 | Dipeptidyl-peptidase 6 | 7q36.2 | Regulation of neuronal action potential, regulation of potassium ion transport, protein localization to plasma membrane, proteolysis | Gain and loss |
| NIPA1 | Non-imprinted in Prader-Willi/Angelman syndrome 1 | 15q11.2 | Transmembrane transport | Loss |
| EPHA3 | Ephrin type-A receptor 3 | 3p11.2 | Axon guidance, cell adhesion, cell migration, signaling, cytoskeleton organization | Loss |
| AATK | Apoptosis-associated tyrosine kinase | 17q25.3 | Brain development, regulation of axon extension, neuron apoptotic process | Gain |
| BAIAP2 | BAI1-associated protein 2 | 17q25 | G protein coupled receptor signaling pathway, neurotransmitter secretion | Gain |
| CHMP6 | Charged multivesicular body protein 6 | 17q25.3 | Endosomal transport, membrane organization, nucleus organization, protein transport | Gain |
| IGFALS | Insulin-like growth factor binding protein acid labile subunit | 16p13.3 | Cell adhesion, cellular protein metabolic process, signal transduction | Gain |
| CACNA1H | Calcium channel, voltage-dependent, T type, alpha 1H subunit | 16p13.3 | Axon guidance, calcium ion import, membrane depolarization during action potential, muscle contraction, regulation of membrane potential | Gain |
| MAPK8IP3 | Mitogen-activated protein kinase 8 interacting protein 3 | 16p13.3 | JNK cascade, axon guidance, neuron projection development, positive regulation of neuron differentiation, regulation of gene expression, vesicle-mediated transport | Gain |
| BAIAP3 | BAI1-associated protein 3 | 16p13.3 | G-protein coupled receptor signaling pathway, neurotransmitter secretion | Gain |
| UBE2I | Ubiquitin-conjugating enzyme E2I | 16p13.3 | DNA repair, cell cycle, cell division, cellular protein metabolic process, chromosome segregation, nucleotide-excision repair, post-translational protein modification, protein ubiquitination | Gain |
| SPSB3 | splA/ryanodine receptor domain and SOCS box containing 3 | 16p13.3 | Intracellular signal transduction, protein ubiquitination | Gain |
| KCNIP4 | Kv channel interacting protein 4 | 4p15.32 | Potassium ion transport | Deletion |
| KCNB2 | Potassium channel, voltage gated Shab related subfamily B, member 2 | 8q13.2 | Potassium ion transport, synaptic transmission | Gain |
| KCNQ5 | Potassium channel, voltage gated KQT-like subfamily Q, member 5 | 6q14 | Potassium ion transport, synaptic transmission | Loss |
| GSDMD | Gasdermin D | 8q24.3 | Cellular response to extracellular stimulus | Gain & Loss |
| GRIK1 | Glutamate receptor, ionotropic, kainate 1 | 21q22.11 | Central nervous system development, glutamate receptor signaling pathway, ion transport, membrane depolarization, regulation of synaptic transmission, regulation of synaptic plasticity | Loss |
| GRIK2 | Glutamate receptor, ionotropic, kainate 2 | 6q16.3 | Central nervous system development, glutamate receptor signaling pathway, ion transport, membrane depolarization, regulation of synaptic transmission, regulation of synaptic plasticity, regulation of neuron apoptotic process, regulation of JNK cascade | Gain |
| EEF1D | Eukaryotic translation elongation factor 1 delta (guanine nucleotide exchange protein) | 8q24.3 | Cellular protein metabolic process, gene expression, mRNA transcription, positive regulation of I-kappaB kinase/NF-kappaB signaling, regulation of cell death, signal transduction | Gain & Loss |
| ATXN1 | Ataxin 1 | 6p23 | RNA processing, excitatory postsynaptic potential, regulation of transcription, regulation of glial cell proliferation | Loss |
| ATXN3L | Ataxin 3-like | Xp22.2 | Cellular response to misfolded protein, protein deubiquitination, regulation of transcription | Gain |
| SLC1A7 | Solute carrier family 1 (glutamate transporter), member 7 | 1p32.3 | L-glutamate transmembrane transport, neurotransmitter secretion, synaptic transmission | Gain |
| TRDN | Triadin | 6q22.31 | Cellular calcium ion homeostasis, cytoplasmic microtubule organization, endoplasmic reticulum membrane organization, muscle contraction, transmembrane transport | Gain |
| CPLX1 | Complexin 1 | 4p16.3 | Exocytosis, glutamate secretion, regulation of exocytosis, synaptic transmission, synaptic vesicle exocytosis, transport | Gain |
| ANKRD11 | Ankyrin repeat domain 11 | 16q24.3 | Multicellular organism growth, tissue homeostasis | Gain |
| PPP1R13B | Protein phosphatase 1 regulatory subunit 13B | 14q32.33 | Apoptotic process | Gain |
| FOXO3 | Forkhead box O3 | 6q21 | DNA damage response, apoptotic process, cell differentiation, cellular response to oxidative stress, epidermal growth factor receptor signaling pathway, immune response, regulation of neuron differentiation, transcription, tumor necrosis factor-mediated signaling pathway | Gain |
| HFE | Hemochromatosis | 6p21.3 | Antigen processing and presentation, immune response, ion transport, regulation of proteasomal ubiquitin-dependent protein catabolic process, regulation of receptor activity, positive regulation of gene expression, regulation of protein binding, transport | Gain |
| GGTLC2 | Gamma-glutamyltransferase light chain 2 | 22q11.22 | Glutathione metabolic process | Loss |
| ATG7 | Autophagy related 7 | 3p25.3 | Autophagy, cellular homeostasis, cellular protein modification process, cellular response to hyperoxia, central nervous system neuron axonogenesis, cerebral cortex development, membrane organization, mitochondrion organization, regulation of apoptotic process, protein transport, protein ubiquitination | Loss |
| ANXA5 | Annexin A5 | 4q27 | Calcium ion transmembrane transport, regulation of apoptotic process, signal transduction | Loss |
| GEMIN6 | Gem nuclear organelle-associated protein 6 | 2p22.1 | Gene expression, mRNA processing, mRNA splicing | Loss |
| MTMR7 | Myotubularin-related protein 7 | 8p22 | Dephosphorylation | Loss |
| ACYP2 | Acylphosphatase 2, muscle type | 2p16.2 | Phosphate-containing compound metabolic process | Loss |
| ZFP14 | ZFP14 zinc finger protein | 19q13.12 | Regulation of transcription | Loss |
| FMN2 | Formin 2 | 1q43 | Actin filament assembly, cellular response to DNA damage stimulus, cellular response to hypoxia, intracellular signal transduction, intracellular transport, multicellular organismal development, regulation of apoptotic process, protein transport, vesicle-mediated transport | Deletion |
Loss heterozygous deletion; Deletion homozygous deletion
Fig. 4A representative illustration showing the functional correlation between ALS-associated CNV-affected genes and their biological processes. Interaction map represents the most promising candidate genes overlapping CNVs that have been consistently associated with ALS, grouped on the basis of the main biological processes associated with them. The map was created using the MetaCore Pathway Map Creator tool (GeneGo). Genes associated with CNV gain regions are labeled with red dots while genes associated with homozygous or heterozygous deleted CNVs are labeled with blue dots. The “checkerboard” color indicates genes displayed both CNV gains and losses. Detailed information about genes depicted in the figure and related biological processes are reported in Table 2. A detailed legend for the network objects is shown in Supplementary Fig. 1
Fig. 5The functional enrichment analysis of the most plausible candidate genes overlapping ALS-specific CNV loci reveals biological processes relevant to ALS pathogenesis. a Representation of the top 10 most significantly enriched (p value <0.05) canonical GO biological processes associated with genes significantly enriched in rare and novel ALS-specific copy number changes (not reported in controls of each of the individual studies and/or in >2500 controls present in DGV). The analysis was performed using the Gene Ontology and KEGG databases and the list is arranged in descending order with the most significant GO biological processes at the top. Detailed information about the entire list of genes affected by ALS-specific CNV loci are reported in Supplementary Table 1. b GO term pie chart of the top 10 enriched (p < 0.05) “Biological processes” for genes overlapping ALS-specific CNV loci. GO terms or biological features of candidate CNV-affected genes and the percentage of genes represented in each category are indicated
Fig. 6Functional network of known and predicted interactions between proteins encoded by genes affected by rare ALS-specific CNV loci and the most known causative ALS genes. The network was produced by the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) v10 (http://string-db.org/) using default settings. Proteins are represented by spheres. Lines linking proteins indicate evidence for interactions: a red line indicates the presence of gene fusion (genes that are sometimes fused into single open reading frames); a green line gene neighborhood (genes that reside within 300 bp on the same strand in the genome); a blue line co-occurrence (gene families whose occurrence patterns across genomes show similarities); a purple line experimental evidence (interaction extracted from protein-protein interaction databases); a yellow line text mining (interaction extracted from scientific literature); a light blue line database (interaction extracted from curated databases); a black line co-expression (proteins whose genes are co-expressed in the same or in other species)