| Literature DB >> 27994980 |
Siqi Yuan1, Yun Xia2, Yuchi Zheng2, Xiaomao Zeng2.
Abstract
Recent improvements in next-generation sequencing (NGS) technologies can facilitate the obtainment of mitochondrial genomes. However, it is not clear whether NGS could be effectively used to reconstruct the mitogenome with high gene rearrangement. These high rearrangements would cause amplification failure, and/or assembly and alignment errors. Here, we choose two frogs with rearranged gene order, Amolops chunganensis and Quasipaa boulengeri, to test whether gene rearrangements affect the mitogenome assembly and alignment by using NGS. The mitogenomes with gene rearrangements are sequenced through Illumina MiSeq genomic sequencing and assembled effectively by Trinity v2.1.0 and SOAPdenovo2. Gene order and contents in the mitogenome of A. chunganensis and Q. boulengeri are typical neobatrachian pattern except for rearrangements at the position of "WANCY" tRNA genes cluster. Further, the mitogenome of Q. boulengeri is characterized with a tandem duplication of trnM. Moreover, we utilize 13 protein-coding genes of A. chunganensis, Q. boulengeri and other neobatrachians to reconstruct the phylogenetic tree for evaluating mitochondrial sequence authenticity of A. chunganensis and Q. boulengeri. In this work, we provide nearly complete mitochondrial genomes of A. chunganensis and Q. boulengeri.Entities:
Keywords: Gene rearrangement; Illumina sequencing; Mitogenome; Neobatrachian; WANCY
Year: 2016 PMID: 27994980 PMCID: PMC5162401 DOI: 10.7717/peerj.2786
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Statistics and assembled result of SOAP denovo2 and Trinity.
| Species | Assembly method | Total number of contigs | Total number of length | N50 | Max contig length |
|---|---|---|---|---|---|
| K50 | 6,341,679 | 1,860,878,175 | 300 | 2,615 | |
| K55 | 6,086,855 | 1,875,157,178 | 314 | 2,880 | |
| K60 | 5,715,340 | 1,870,183,020 | 332 | 3,183 | |
| K65 | 5,460,129 | 1,843,817,246 | 338 | 2,817 | |
| K70 | 5,122,048 | 1,779,429,007 | 332 | 2,980 | |
| K75 | 4,926,588 | 1,723,696,265 | 318 | 2,779 | |
| K80 | 4,686,214 | 1,630,358,728 | 300 | 2,830 | |
| Trinity | 1,555,983 | 755,862,296 | 526 | 16,672 | |
| K50 | 5,072,290 | 1,417,075,271 | 300 | 2,537 | |
| K55 | 4,955,300 | 1,436,818,547 | 300 | 2,741 | |
| K60 | 4,746,605 | 1,436,171,325 | 300 | 2,416 | |
| K65 | 4,549,311 | 1,407,207,578 | 300 | 2,513 | |
| K70 | 4,301,826 | 1,350,614,260 | 300 | 2,527 | |
| K75 | 4,167,451 | 1,308,301,284 | 300 | 2,578 | |
| K80 | 4,007,361 | 1,244,363,080 | 300 | 4,366 | |
| Trinity | 1,017,500 | 448,075,968 | 469 | 16,795 |
Notes.
The k-mer sizes used in SOAP denovo2.
Figure 1Annotation and map of Amolops chunganensis and Quasipaa boulengeri.
Annotation of Amolops chunganensis (A) and Quasipaa boulengeri (B) mitogenome. PCGs are colored in red, tRNA-coding genes are in blue, rrnL and rrnS are in green. A pseudogene of trnK is located between cox2 and atp8 genes. Each gene is shown as an arrow indicating the transcription direction. The arrows on top of the black line correspond to genes coded on the H-strand, and those below show genes on the L-strand.
Figure 2The Bayesian Inference (BI) and Maximum Likelyhood (ML) tree (combined 13 PCGs, 11,502 bp).
The Bayesian Inference (BI) and Maximum Likelihood (ML) tree based on nucleotide sequences of the combined 13 protein-coding genes (11,502 in size). For the BI tree, the GTR + I + G model was selected, and two independent runs were performed for 1,000,000 generations with sampling frequency 0.001. The GenBank accession numbers of all species are shown. Numbers beside the nodes are Bayesian posterior probabilities and ML Bootstrap, respectively (showed in BPP/BS). between cox2 and atp8 genes. Each gene is shown as an arrow indicating the transcription direction. The arrows on top of the black line correspond to genes coded on the H-strand, and those below show genes on the L-strand.