| Literature DB >> 29479499 |
Shalini Iyer1, K Ravi Acharya1, Vasanta Subramanian1.
Abstract
C9orf72 is associated with frontotemporal dementia (FTD) and Amyotrophic Lateral Sclerosis (ALS), both of which are devastating neurodegenerative diseases. Findings suggest that an expanded hexanucleotide repeat in the non-coding region of the C9orf72 gene is the most common cause of familial FTD and ALS. Despite considerable efforts being made towards discerning the possible disease-causing mechanism/s of this repeat expansion mutation, the biological function of C9orf72 remains unclear. Here, we present the first comprehensive genomic study on C9orf72 gene. Analysis of the genomic level organization of C9orf72 across select species revealed architectural similarity of syntenic regions between human and mouse but a lack of conservation of the repeat-harboring intron 1 sequence. Information generated in this study provides a broad genomic perspective of C9orf72 which would form a basis for subsequent experimental approaches and facilitate future mechanistic and functional studies on this gene.Entities:
Keywords: Amyotrophic Lateral Sclerosis (ALS); Bioinformatics; C9orf72; Frontotemporal Dementia (FTD); Genome
Year: 2018 PMID: 29479499 PMCID: PMC5822839 DOI: 10.7717/peerj.4391
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Alternate splice variants of C9orf72 gene in different species.
| Species | Gene ID number | Transcript ID number |
|---|---|---|
| Human ( | ENSG00000147894 | ENST00000380003 |
| ENST00000379997 | ||
| ENST00000379995 | ||
| ENST00000461679 | ||
| ENST00000488117 | ||
| Mouse ( | ENSMUSG00000028300 | ENSTMUST00000084724 |
| ENSTMUST00000108126 | ||
| ENSTMUST00000108127 | ||
| ENSTMUST00000130538 | ||
| ENSTMUST00000142628 | ||
| ENSTMUST00000149138 | ||
| ENSTMUST00000156472 | ||
| Fugu ( | ENSTRUG00000001050 | ENSTRUT00000002473 |
| ENSTRUT00000002474 | ||
| ENSTRUT00000002475 | ||
| ENSTRUT00000002476 | ||
| Tetraodon ( | ENSTNIG00000005644 | ENSTNIT00000008515 |
| Zebrafish ( | ENSDARG00000011837 | ENSDART00000015127 |
| ENSDART00000125180 | ||
| ENSDARKT00000126136 | ||
| Stickleback ( | ENSGACG00000005564 | ENSGACT00000007387 |
Details of the intron-exon structure of C9orf72 gene in different species.
| Human | Mouse | Fugu | Tetraodon | Zebrafish | Stickleback | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Chromosome location | 9 Reverse strand | 4 Reverse strand | Scaffold_396 forward strand | 10 Reverse strand | 13 forward strand | GROUP XV forward strand | |||||||||||||
| No of transcripts | 5 | 7 | 4 | 1 | 3 | 1 | |||||||||||||
| No of exons | 11 exons 3,200 bp | 11 exons 3,192 bp | 10 exons 1,437 bp | 11 exons 1,437 bp | 10 exons 2,427 bp | 10 exons 2,051 bp | |||||||||||||
| 5 exons 1,873 bp | 10 exons 2,644 bp | 9 exons 1,389 bp | 9 exons 4,394 bp | ||||||||||||||||
| 5 exons 777 bp | 10 exons 3,414 bp | 7 exons 972 bp | 11 exons 2,401 bp | ||||||||||||||||
| 9 exons 954 bp | |||||||||||||||||||
| Length 5′ UTR (bp) | |||||||||||||||||||
| 64 | 124 | 76 | 152 | 95 | 254 | – | – | – | – | – | 197 | – | 173 | 136 | |||||
| Length of 3′ UTR (bp) | 1,690 | 1,079 | 32 | 1,896 | 1,595 | 1,536 | – | – | – | – | – | 841 | 3,005 | 12,546 | 526 | ||||
| Intron length (bp) | 6,266 | 6,622 | 6,559 | 6,871 | 8,515 | 7,012 | 188 | 576 | 576 | 166 | 300 | 517 | 81 | 517 | 687 | ||||
| 1,086 | 1,086 | 1,086 | 1,141 | 3,568 | 1,141 | 340 | 792 | 792 | 792 | 332 | 81 | 2,376 | 81 | 892 | |||||
| 3,054 | 3,054 | 3,054 | 3,568 | 475 | 3,568 | 792 | 681 | 681 | 681 | 250 | 2,376 | 97 | 2,376 | 564 | |||||
| 731 | 731 | 731 | 475 | 1,644 | 475 | 681 | 482 | 482 | 482 | 284 | 97 | 77 | 97 | 880 | |||||
| 1,285 | 1,644 | 5,407 | 1,644 | 482 | 703 | 703 | 703 | 574 | 77 | 3,459 | 77 | 680 | |||||||
| 1,619 | 5,407 | 8,559 | 5,407 | 703 | 144 | 5 | 144 | 678 | 3,459 | 4,066 | 3,459 | 604 | |||||||
| 1,694 | 8,559 | 2,778 | 8,559 | 144 | 701 | 701 | 604 | 4,066 | 87 | 2 | 123 | ||||||||
| 5,853 | 2,778 | 842 | 2,778 | 701 | 2,389 | 2,389 | 175 | 87 | 4,054 | 4,066 | 674 | 175 | 175 | 175 | |||||
| 1,983 | 842 | 116 | 842 | 2,389 | 658 | 4,054 | 87 | 2,130 | 658 | 658 | 658 | ||||||||
| 134 | 116 | 2,476 | 4,054 | 2,476 | 2,476 | 2,476 | |||||||||||||
| Transcription factor sites in intron 1 | 1,401 | 1,493 | 1,478 | 1,547 | 1,930 | 1,575 | 30 | 109 | 109 | 34 | 56 | 103 | 26 | 103 | 150 | ||||
Figure 1Cartoon representation of the C9orf72 gene structure.
Cartoon representation of the alternative splicing transcripts of C9orf72 in human (A), mouse (B) and Fugu (C). Exons are shown as boxes and lines connecting the boxes represent introns. Filled boxes are coding sequence, and empty, unfilled boxes are UTR (UnTranslated Region). The coloring scheme follows that used by Ensembl. A red transcript comes from either the Ensembl automatic annotation pipeline or manual curation by the VEGA/Havana project. A gold transcript is identical between the Ensembl automatic annotation pipeline and manual curation by the VEGA/Havana project. Non-coding transcripts are coloured blue.
Figure 2Phylogenetic analysis of C9orf72.
(A) A rooted phylogenetic tree of all the species that express C9orf72 generated by the Ensemble Compara server. The tree was drawn using TreeDyn (Chevenet et al., 2006). The branch-length scale represents substitutions per base pair. (B) A graph representing the percent of identical amino acids in the orthologues inferred from the gene trees constructed by Ensembl. The query % ID refers to the identity at the amino acid level of the gene of interest when compared with the orthologue in question. Target % ID refers to the percent of identical amino acids in the orthologue compared with the gene of interest. Both images have been downloaded from the Ensembl server.
Figure 3Conserved Synteny between human and mouse genomes.
Synteny Database was used to detect the region of conserved synteny (Catchen, Conery & Postlethwait, 2009). (A) A trace image of the region of conserved synteny between human and mouse genomes. The image, created using a 25-gene sliding window, displays 39 orthologous pairwise clusters. The genes are displayed on their physical locations on the chromosomes, showing them on the proper strand, actual length, and in the proper orientation. Lines connect orthologs together. Genes colored grey are peripheral genes that are located within the same region of the cluster, but are not members of the cluster. (B) A gene homology matrix representing homology of the gene clusters between the chromosomal segments in human and mouse. Genes from the human cluster are placed along the X-axis of the plot while the genes from the murine cluster are placed along the Y-axis. Orthologs are marked at their intersection with well-conserved synteny shown as yellow boxes in a diagonal line in the plot and inversions of genes between the two chromosomes shown perpendicular to one another (blue boxes).
Figure 4Genome graph for human C9orf72.
Schematic representation of the genome graph of C9orf72 as displayed by the UCSC Genome Browser (Kent et al., 2002). The figure shows basewise conservation of 100 vertebrates as well as 11 primates. Blue graphs represent conserved sites whilst red graphs represent faster evolving regions. The predicted CpG islands are also shown in green. The inset shows a region of ˜130 bp of higher conservation in Intron 1 with a transcription factor match to ETS1.
Figure 5Local alignments of intron 1 sequences.
Sequence alignment of Intron 1 sequences using EBI’s EMBOSS matcher (Rice, Longden & Bleasby, 2000) . (A) Hexanucleotide repeat site in human (200 bp upstream of the repeat unis, the repeat sequence themselves, and 200 bp downstream of the repeat units) aligned with the intron 1 sequence of transcript 2 in Fugu. (B) The hexanucleotide repeat site in human (same as above but with the repeat units removed) aligned with the intron 1 sequence of transcript 2 in Fugu. (C) The hexanucleotide repeat site in human aligned with the intron 1 sequence of transcript 2 in Fugu.
Transcription factors identified using MatInspector (Cartharius et al., 2005).
| Matrix family | |||
|---|---|---|---|
| V$ETSF | 0.00685405 | Human and murine ETS1 factors | Motor neuron axon guidance, neuron fate specification, neuron maturation, peripheral nervous system neuron development, positive regulation of neuron death and differentiation. |
| V$SORY | 0.00970298 | SOX/SRY-sex/testis determining and related HMG box factors | Brain development, Central, enteric, sympathetic and peripheral nervous system development, CNS neuron differentiation, myelination in peripheral nervous system, spinal cord development, spinal cord motor neuron differentiation and ventral spinal cord interneuron specification. |
| V$RXRF | 0.0124996 | RXR heterodimer binding sites | Brain development, central and peripheral nervous system development, CNS neuron differentiation, CNS projection neuron axonogenesis, dopaminergic and midbrain neuron differentiation, forebrain neuron development, neuron differentiation, migration and maturation, negative regulation of neuron apoptotic process, and positive regulation of neuron differentiation. |
| V$WHNF | 0.0208955 | Winged helix binding sites | Neuron fate commitment, ventral spinal cord interneuron fate commitment. |
| V$HOMF | 0.0231715 | Homeodomain transcription factors | Brain development, Central and enteric nervous system development, neuron differentiation, neuron migration, negative regulation of neuron apoptotic process and regulation of neuron differentiation. |
| V$BPTF | 0.0255628 | Bromodomain and PHD domain transcription factors | Brain development. |
| V$RP58 | 0.0257514 | RP58 (ZFP238) zinc finger protein | Cerebellum development, cerebral cortex development, hippocampus development and neuron development. |
| V$FKHD | 0.0346126 | Fork head domain factors | Brain development and morphogenesis, enteric, sympathetic and peripheral nervous system development, CNS neuron development, myelination in central nervous system, regulation of nervous system development, cell morphogenesis in neuron differentiation, dopaminergic neuron differentiation, motor neuron axon guidance, negative regulation of neuron differentiation, neuron development and differentiation, neuron fate determination and specification, neuron stem cell population maintenance, pyramidal neuron migration, positive regulation of neuron apoptotic process and differentiation, spinal cord development and ventral spinal cord interneuron specification. |
| V$NKXH | 0.0376881 | NKX homeodomain factors | Brain development, nervous system development, cerebral cortex, forebrain neuron differentiation, spinal cord motor neuron differentiation, motor neuron axon guidance, forebrain neuron fate commitment, negative regulation of neuron apoptotic process, and positive regulation of neuron differentiation, neuron fate commitment, neuron fate specification, neuron migration, spinal cord oligodendrocyte cell differentiation and fate specification, ventral spinal cord interneuron differentiation and fate determination. |
| V$IRXF | 0.0537914 | Iroquois homeobox transcription factors | CNS development, CNS neuron differentiation, negative regulation of neuron differentiation, neuron maturation, positive regulation of neuron differentiation and retinal bipolar neuron differentiation. |
| V$ATBF | 0.06225 | AT-binding transcription factor | Brain development, regulation of neuron differentiation. |
| V$ZICF | 0.0637499 | Members of ZIC-family, zinc finger protein of the cerebellum | Brain development, cell proliferation in forebrain, midbrain and hindbrain, CNS development, determination of right/left asymmetry in nervous system, forebrain morphogenesis, retinal ganglion cell axon guidance, spinal cord development. |
| V$ABDB | 0.0678214 | Abdominal-B type homeodomain transcription factors | Peripheral nervous system neuron development and spinal cord motor neuron cell fate specification. |
| V$SIXF | 0.0726783 | Sine oculis (SIX) homeodomain factors | Brain development, peripheral nervous system neuron development, generation of neurons, negative regulation of neuron apoptotic process and neuron differentiation, neuron fate specification. |
| V$BRNF | 0.0784254 | Brn POU domain factors | Brain development, CNS neuron differentiation, myelination in peripheral nervous system, peripheral nervous system neuron development and differentiation, forebrain neuron differentiation, neuron development, differentiation, neuron fate commitment and fate specification, neuron projection development and negative regulation of neuron apoptotic process. |
| V$HOXC | 0.0805504 | HOX-PBX complexes | Brain development and segmentation, CNS neuron differentiation, peripheral nervous system neuron development and differentiation, motor neuron axon guidance, positive & negative regulation of neuron differentiation, neuron development, differentiation and migration, dorsal spinal cord development and spinal cord motor neuron cell fate specification. |
| V$HBOX | 0.086678 | Homeobox transcription factors | Brain development, embryonic brain development, autonomic nervous system development, CNS development and neuron differentiation, cell morphogenesis involved in neuron differentiation, dopaminergic neuron differentiation, forebrain neuron development, motor neuron axon guidance, negative regulation of neuron death and neuron apoptotic process, neuron development, differentiation, death, fate commitment, migration, fate specification and projection morphogenesis, sensory neuron axon guidance, subpallium neuron fate commitment, spinal cord association neuron differentiation, spinal cord neuron cell fate specification, spinal cord motor neuron differentiation and spinal cord interneuron specification. |
| V$LHXF | 0.0890478 | Lim homeodomain factors | Brain development, CNS neuron development and differentiation, peripheral nervous system neuron axonogenesis and neuron development, GABAergic and dopaminergic neuron differentiation, forebrain neuron development, differentiation and fate commitment, lateral motor column neuron migration, medial motor column neuron differentiation, midbrain-hindbrain boundary development, motor neuron axon guidance, negative regulation of neuron apoptotic process and differentiation, neuron development, differentiation, fate commitment, fate specification, maturation and migration, positive regulation of Wnt-mediated midbrain dopaminergic neuron differentiation, spinal cord development, spinal cord association neuron differentiation, spinal cord motor neuron cell fate specification, spinal cord motor neuron differentiation, ventral spinal cord interneuron specification. and visceral motor neuron differentiation. |
| V$BRN5 | 0.0896913 | Brn-5 POU domain factors | Brain development, CNS development and ganglion mother cell fate determination. |
| V$DLXF | 0.095084 | Distal-less homeodomain transcription factors | Brain and nervous system development, neuron development and differentiation, negative regulation of neuron apoptotic process and regulation of transcription from RNA polymerase II promoter involved in forebrain neuron fate commitment. |
| V$HIFF | 0.0956065 | Hypoxia inducible factor, bHLH/PAS protein family | Brain development, CNS development, dopaminergic neuron differentiation, motor neuron axon guidance, negative regulation of oxidative-stress induced neuron intrinsic apoptotic signalling pathway and neuron apoptotic process. |
| V$TALE | 0.0960602 | TALE homeodomain class recognizing TG motifs | Brain morphogenesis, enteric nervous system development, neural crest cell migration involved in autonomic nervous system development and positive & negative regulation of neuron differentiation. |
| V$RORA | 0.0967563 | v-ERB and RAR-related orphan receptor alpha | Brain development. |
| V$HESF | 0.0988147 | Vertebrate homologues of enhancer of split complex | Brain development, CNS development, myelination in CNS, peripheral nervous system development, GABAergic neuron differentiation in basal ganglia, cell morphogenesis involved in neuron differentiation, positive & negative regulation of neuron differentiation, negative regulation of neuron projection development, neuron stem cell population maintenance, regulation of neuronal synaptic plasticity and regulation of timing of neuron differentiation. |
| V$HOXH | 0.0995363 | HOX-MEIS1 heterodimers | Enteric nervous system development, peripheral nervous system neuron development, neural crest cell migration involved in autonomic nervous system development, negative regulation of neuron differentiation and spinal cord motor neuron cell fate specification. |
| V$BCDF | 0.103333 | Bicoid-like homeodomain transcription factors | Brain development, CNS development, dopaminergic neuron differentiation, positive regulation of neuron apoptotic process, neuron development, neuron differentiation, neuron fate commitment, neuron fate determination and neuron proliferation in midbrain. |
| V$NKX6 | 0.105076 | NK6 homeobox transcription factors | CNS myelination and neuron differentiation, positive regulation of neuron differentiation, regulation of neuron migration, regulation of transcription from RNA polymerase II promoter involved in spinal cord motor neuron fate specification, spinal cord motor neuron differentiation and regulation of transcription from RNA polymerase II promoter involved in ventral spinal cord interneuron specification. |
| V$NEUR | 0.105134 | NeuroD, Beta2, HLH domain | Brain development, CNS development, CNS neuron development, enteric nervous system development, peripheral nervous system development, peripheral nervous system neuron development, sympathetic and parasympathetic nervous systems development, commitment of neuronal cell to specific neuron type in forebrain, dopaminergic neuron differentiation, forebrain neuron development and differentiation, generation of neurons, positive & negative regulation of neuron differentiation, noradrenergic neuron differentiation and neuron fate commitment, neuron development, neuron differentiation, neuron fate commitment, neuron fate determination, neuron migration, regulation of timing of subpallium neuron differentiation, spinal cord association neuron differentiation, spinal cord motor neuron cell fate specification, spinal cord motor neuron differentiation, spinal cord oligodendrocyte cell differentiation and fate specification, subpallium neuron fate commitment, sympathetic ganglion development, spinal reflex action, dorsal spinal cord development, spinal cord development, ventral spinal cord interneuron differentiation and fate commitment, vestibulocochlear nerve development and trigeminal nerve development. |
Figure 6The motifs identified by the MEME suite.
(A) The location of the top five motifs found by MEME (Bailey & Elkan, 1994) on the different species. The motifs are coloured differently; motif 1, red; motif 2, blue; motif 3, green; motif 4, yellow and motif 5, purple. (B) The top four motifs (motifs 1, 2, 3 and 4 from above) identified by MEME (Bailey & Elkan, 1994). Motif 5 is not shown because GOMo (Buske et al., 2010) could not predict any biological function that could be associated with the motif.