| Literature DB >> 31396450 |
An Huang1,2, Shuo Liu3, Haijun Li1,2, Hongdi Luo1,2, Qingyong Ni1,2, Yongfang Yao4, Huailiang Xu4, Bo Zeng1,2, Ying Li1,2, Zhimin Wei5, Song Li3, Mingwang Zhang1,2.
Abstract
The mitochondrial genome (mitogenome) sequence of the tree frog Polypedates megacephalus (16,473 bp) was previously reported as having the unusual characteristic of lacking the ND5 gene. In this study, a new mitogenome of P. megacephalus (19,952 bp) was resequenced using the next-generation sequencing (NGS) and standard Sanger sequencing technologies. It was discovered that the ND5 gene was not lost but translocated to the control region (CR) from its canonical location between the ND4 and ND6 genes. In addition, a duplicated control region was found in the new mitogenome of this species. Conservative region identification of the ND5 gene and phylogenetic analysis confirmed that the ND5 gene was located between two control regions. The phylogenetic relationship among 20 related species of anura revealed a rearrangement of the ND5 gene during the evolutionary process. These results also highlighted the advantages of next-generation sequencing. It will not only decrease the time and cost of sequencing, but also will eliminate the errors in published mitogenome databases.Entities:
Keywords: Mitochondrial gene rearrangement; ND5; Phylogenetic relationships; Polypedates megacephalus
Year: 2019 PMID: 31396450 PMCID: PMC6679912 DOI: 10.7717/peerj.7415
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Figure 1Sanger sequencing and next-generation sequencing coverage of P. megacephalus mitogenome.
(A) The Illumina reads coverage of the mitogenome represented by sequencing depth. The reads of CR2 were mapped to CR1 as they are similar and caused the NGS results could not align into a complete genome. The red region shows the coverage of ND5 gene is similar to the average coverage on rest of the mitogenome. (B) Schematic of fragment sizes based upon the sequence between COII and ATP6, and the presence of a single mitochondrial control region. The difference between this study and previous study is highlighted by dashed box. (C) A representative agarose gel of LA-PCR products sizes. Single letters C and M represent control and DNA marker, respectively.
PCR primer pairs, sequences used in this study.
| F1 | COIIF | GACTCACTCAAGCGTCTATTC | 7734 |
| ATP6R | TGTGGGCGGGTTTATT | 9012 | |
| F2 | CytbFow1 | GTYCTMCCNTGRGGHCAAATATCHTTYTG | 13510 |
| FND512800H | CCTATTTTDCGRATRTCYTGYTC | 16867 | |
| F3 | ND5F2 | CTCACCCCTCTATTACGACTT | 15877 |
| R16M1 | GGGTATCTAATCCCAGTTTG | 701 |
Notes.
Retrieved from this study.
Sano et al. (2005).
Zhang et al. (2013).
Sano et al. (2004).
Figure 2Mitochondrial gene organization of P. megacephalus.
Genes encoded by the L-strand are on the inner side. O represents the replication original area of L-strand.
Figure 3Evidences of the existence of ND5 gene in the mitogenome of P. megacephalus.
(A) Comparison of amino acids of the ND5 with the corresponding amino acid sequences of seven other Rhacophoridae species. The highlighted section in yellow indicates an amino acid conserved in all eight sequences; an arrow indicates amino acids conserved in 7/8 sequences, and dashes indicate indels. The functionally conserved regions of ND5 proteins are boxed. (B) Phylogenetic results of BI and ML analysis among 24 related frogs using single ND5 gene. NCBI accession number: P. leucomystax (MK622898), P. impresus (MK622901), P. braueri (MK6 87567), P. mutus (MK622900).
Figure 4Structures of P. megacephalus control regions.
(A) These regions consist of tandem repeat units, termination-associated sequences (TAS), H-stand origin of replication (O) and conserve sequence block 1, 2, 3 (CSB1, CSB2, CSB3). (B) The putative secondary structures of the replication original area of L-strand. The 5′-GCCGG-3′ sequence motif as the base of the stem within the tRNA. (C) Nucleotide sequences of conserved elements.
Figure 5Phylogenetic tree constructed based on the nucleotide dataset of 12 PCGs and 2 rRNAs using Bayesian inference and ML method.
The PCGs are shown in abbreviations. tRNA genes are represented by the standard single amino acid code, the rearranged ND5 and CR are in shed boxes.