| Literature DB >> 30085012 |
Runsheng Li1, Xiaoliang Ren1, Yu Bi1, Qiutao Ding1, Vincy Wing Sze Ho1, Zhongying Zhao1,2.
Abstract
Mitochondrial genome (mtDNA) carries not only well-conserved protein coding, tRNA and rRNA genes, but also highly variable non-coding regions (NCRs). However, the NCRs show poor conservation across species, making their function and evolution elusive. Identification and functional characterization of NCRs across species would be critical for addressing these questions. To this end, we devised a computational pipeline and performed de novo assembly and annotation of mtDNA from 19 Caenorhabditis species using next-generation sequencing (NGS) data. The mtDNAs for 14 out of the 19 species are reported for the first time. Comparison of the 19 genomes reveals species-specific sampling of partial displacement-loop (D-loop) sequence as a novel NCR inserted into a unique tRNA cluster, suggesting an important role of the D-loop and the tRNA cluster in shaping NCR evolution. Intriguingly, RNA-Seq analysis suggests that a novel NCR resulting from a recent duplication of NADH dehydrogenase subunit 5 (ND5) could be utilized as a 3' UTR for up-regulation of its upstream gene. The expression analysis shows a species- and sex-specific expression of mitochondrial genes encoded by mtDNA and nucleus, respectively. Our analyses provide important insights into the function and evolution of mitochondrial NCRs and pave the way for further studying the function and evolution of mitochondrial genome.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30085012 PMCID: PMC6289777 DOI: 10.1093/dnares/dsy026
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Statistics of the mtDNA sequences in Caenorhabditis species and P. pacificus
| Species | Strain | Accessiono. | Size (bp) | Status | NCR1 | NCR2 | NCR3 | NCR4 |
|---|---|---|---|---|---|---|---|---|
| Newly assembled | ||||||||
| JU1286 | KY552909 | 14245 | Circular | 664 | 216 | 110 | – | |
| PS1010 | KY552902 | 13596 | Partial | 264 | – | 96 | – | |
| PB2801 | KY552900 | 14212 | Circular | 672 | 200 | 112 | – | |
| JU1956 | KY552910 | 13655 | Circular | 310 | – | 96 | – | |
| JU1771 | KY552904 | 13719 | Circular | 385 | – | 108 | – | |
| DF5081 | MF370516 | 13988 | Circular | 643 | – | 109 | – | |
| JU2083 | KY552908 | 13633 | Circular | 298 | – | 106 | – | |
| JU2079 | KY552907 | 13786 | Circular | 443 | – | 104 | – | |
| SB355 | KY552901 | 14248 | Circular | 929 | – | 101 | – | |
| PB4641 | KY552899 | 13610 | Partial | 291 | – | 107 | – | |
| JU2809 | KY552911 | 13762 | Circular | 428 | – | 100 | – | |
| JU1667 | KY552903 | 13350 | Partial | 62 | – | 106 | – | |
| JU1968 | KY552905 | 13661 | Partial | 344 | – | 99 | – | |
| JU1898 | KY552906 | 13705 | Circular | 372 | – | 109 | – | |
| JU1421 | KP259621 | 13856 | Circular | 518 | – | 109 | – | |
| AF16 | NC_009885 | 14420 | Circular | 548 | 222 | 109 | 324 | |
| N2 | NC_001328 | 13794 | Circular | 466 | – | 109 | – | |
| JU727 | EU407780 | 13886 | Circular | 383 | 234 | 109 | – | |
| JU1836 | KM403565 | 13874 | Circular | 410 | 130 | 107 | – | |
| PS312 | NC_015245 | 15954 | Circular | 2222 | 425 | 51 | – | |
Circular/complete mtDNA can be achieved during mtDNA assembly.
Size in bp for NCR1-4.
Located in a different tRNA cluster from NCR2 in Caenorhabditis species (Fig. 1C).
Previous mtDNA was partial and was rendered circular by Nanopore sequencing.
Figure 1.Species-specific sampling of ND5 or non-conserved part of D-loop (also called NCR-1) as a novel NCR2. (A) A phylogenetic tree of Caenorhabditis species inferred from the sequences of 12 mitochondrial proteins. Scale of branch length is indicated on the left. Tree was constructed with ML methods implemented in raxML using the concatenated 12 protein sequences from 19 Caenorhabditis species with P. pacifics as an outgroup. The bootstrap support values in percent from 1000 repeats are indicated on branch node. Five species that contain an NCR2 are highlighted in green. Elegans and Drosophilae supergroup are differentially shaded. The mtDNAs that were previously assembled are indicated with ‘*’. (B) Diagrams showing the origins of NCR2 or 4 (green ribbons) in the mtDNA of C. briggsae, respectively. Sampled sequence size and relative position are shown in scale. (C) Diagrams showing the origins of NCR2 (grey ribbons) in the mtDNA of C. sinica, C. tropicalis, C. brenneri, C. afra and P. pacificus. Mitochondrial genes are differentially colour coded based on the MRCs they are affiliated with as indicated on the top left. Consensus size in kb is indicated starting from the Phenylalanine tRNA gene (P). NCR, NCR. Note, only C. briggsae contains an NCR4 located in the boundary between ND3 and ND5. All six species carry an NCR2 located in a single tRNA gene cluster. The NCR1 and 3 are shared in all Caenorhabditis species.
Figure 2.Sequence conservation of NCRs in Caenorhabditis species. (A) Alignment of NCR1 sequences from the conserved 3' part of NCR1 in 18 Caenorhabditis species whose NCR1 sequence is over 200 bps in length. (B) Alignment of NCR3 full-length sequences from 19 Caenorhabditis species and an NCR from outgroup species P. pacificus. (C) Sequence alignment of NCR2s between five Caenorhabditis species. Note the sequences are barely alignable.
Figure 3.Association of the elevated expression of a recent ND5 duplication, i.e. NCR4 with up-regulation of ND3 expression in C. briggsae. (A) RNA-Seq tracks showing the read coverage (vertical axis) aggregated from both sexes of C. nigoni (top) and C. briggsae (bottom) along their respective reference genomes (two horizontal bars in the middle). Peaks and genes are colour coded based on their origin of respective respiratory chain complex and two C. briggsae unique NCRs are coloured in black (highlighted in box). Note the up-regulated expression of ND3 in C. briggsae versus that in its closest relative C. nigoni is highlighted with a red box. ND3 expression is significantly lower in C. nigoni. (B) Heatmap showing the normalized expression of mtDNA-encoded proteins with names indicated on the left. C. briggsae-specific higher expression of ND3 is highlighted in red boxes. Note that relatively low expression of various ND proteins versus that of other mitochondrial proteins in Caenorhabditis species. (C) Sequence alignment for RNA-Seq reads against a mitochondrial genomic region spanning ND3, NCR4 and ND5 (shown as thick bars on the top). Alignments of RNA-Seq reads are shown at the bottom with poly-A sequences coloured in green. Note that the most reads are continuous between the end of ND3 and NCR4 followed by poly-A tailing (shown in green on the right); whereas only a small portion of ND3 reads are poly-A tailed at its own end (green on the left). The NCR4 contain multiple stop codons which was annotated a pseudogene of ND5 (not shown).
Figure 4.RNA-Seq tracks showing the coverage of reads (vertical axis) derived from 12 Caenorhabditis species along their respective reference genomes (horizontal bars). Peaks are colour-coded as indicated in Fig. 3. Note the apparent expression of NCR2 (highlighted in black line) in Caenorhabditis species carrying an NCR2 except C. tropicalis.