| Literature DB >> 31346443 |
Zhichao Zhang1,2, Qiqun Cheng1, Yushuang Ge1,3.
Abstract
Rhynchocypris oxycephalus (Teleostei: Cyprinidae) is a typical small cold water fish, which is distributed widely and mainly inhabits in East Asia. Here, we sequenced and determined the complete mitochondrial genome of R. oxycephalus and studied its phylogenetic implication. R. oxycephalus mitogenome is 16,609 bp in length (GenBank accession no.: MH885043), and it contains 13 protein-coding genes (PCGs), two rRNA genes, 22 tRNA genes, and two noncoding regions (the control region and the putative origin of light-strand replication). 12 PCGs started with ATG, while COI used GTG as the start codon. The secondary structure of tRNA-Ser (AGN) lacks the dihydrouracil (DHU) arm. The control region is 943bp in length, with a termination-associated sequence, six conserved sequence blocks (CSB-1, CSB-2, CSB-3, CSB-D, CSB-E, CSB-F), and a repetitive sequence. Phylogenetic analysis was performed with maximum likelihood and Bayesian methods based on the concatenated nucleotide sequence of 13 PCGs and the complete sequence without control region, and the result revealed that the relationship between R. oxycephalus and R. percnurus is closest, while the relationship with R. kumgangensis is farthest. The genus Rhynchocypris is revealed as a polyphyletic group, and R. kumgangensis had distant relationship with other Rhynchocypris species. In addition, COI and ND2 genes are considered as the fittest DNA barcoding gene in genus Rhynchocypris. This work provides additional molecular information for studying R. oxycephalus conservation genetics and evolutionary relationships.Entities:
Keywords: DNA barcoding; Rhynchocypris oxycephalus; conservation genetics; mitochondrial genome; phylogenetic analysis
Year: 2019 PMID: 31346443 PMCID: PMC6635945 DOI: 10.1002/ece3.5369
Source DB: PubMed Journal: Ecol Evol ISSN: 2045-7758 Impact factor: 2.912
Figure 1Photograph of Rhynchocypris oxycephalus
The 16 primer combinations for amplifying the complete mitochondrial DNA of Rhynchocypris oxycephalus
| Primer name | Primer sequence(5'–3') |
|---|---|
| Rhynchocypris‐1F | GACGAGGAGCGGGCATCAGG |
| Rhynchocypris‐1R | CGGGGTATCAAACTAAAGGTC |
| Rhynchocypris‐2F | CCAACACCACAAACTAAACCAT |
| Rhynchocypris‐2R | TCTAGCCATTCATACAGGTCTCT |
| Rhynchocypris‐3F | CAACGAACCAAGTTACCCAAG |
| Rhynchocypris‐3R | GTGCCCAAAAATAGTACGACTG |
| Rhynchocypris‐4newF | AACCTGTTCGCCCCTCTACCT |
| Rhynchocypris‐4newR | GGCAAGGAAGGCTGCGGATGT |
| Rhynchocypris‐5F | CCTCTTAACGGCCTTTGGACT |
| Rhynchocypris‐5R | TTCCAAACCCTCCAATAAGAA |
| Rhynchocypris‐6F | GTGACAGCCGTCCTTCTCCTC |
| Rhynchocypris‐6R | GTAAGTTTGGTTGAGACTATCGC |
| Rhynchocypris‐7NEWF | ACCCCTGTATGTCTTGAGCTC |
| Rhynchocypris‐7NEWR | ATTAGTTGATTGGTAAATCGGTTC |
| Rhynchocypris‐8F | ATAARACTGACTCCTGAACCTGA |
| Rhynchocypris‐8R | GCCTGGAGAGCGGTAAAATAA |
| Rhynchocypris‐9F | AGGAGTTATTACGCTGGACCC |
| Rhynchocypris‐9R | GTTRAGGTTTTGTAGGCGGTC |
| Rhynchocypris‐10NEWF | GGTTAGCATTTCATCGCACACA |
| Rhynchocypris‐10NEWR | TGGGTTCGTTCATAGGCTGT |
| Rhynchocypris‐11F | TGCCTACGACAAACAGACCTTA |
| Rhynchocypris‐11R | GTGTAATCATGGCTACCAAGAA |
| Rhynchocypris‐12F | GCGTTCGACACAAACATTAGCT |
| Rhynchocypris‐12R | AATGGATTGTCCTCGCTGAT |
| Rhynchocypris‐13F | TRGCACTGACAGGCACCCCAT |
| Rhynchocypris‐13R | GTTYTAATTGTGGGTTTAATTGCT |
| Rhynchocypris‐14F | AAAGRACGAGGGATAAGAAGGA |
| Rhynchocypris‐14R | CCCTGTCTCGTGTAGAAAGAGCA |
| Rhynchocypris‐15F | AGACCTCCTTGGCTTTGTAGTA |
| Rhynchocypris‐15R | TGTTGGGTAACGAGGAGTATG |
| Rhynchocypris‐16F | ATGATAGAACCAGGGACACAAT |
| Rhynchocypris‐16NEWR | TATTGCTCCTCCTAACCACCC |
Figure 2Gene map of the Rhynchocypris oxycephalus mitochondrial genome
Characteristics of the mitochondrial genome of Rhynchocypris oxycephalus
| Feature | Length/bp | Position | Start codon | Stop codon | Anticodon | Intergenic nucleotide | Number of amino acid | Strand |
|---|---|---|---|---|---|---|---|---|
| tRNAPhe | 69 | 1–69 | GAA | 0 | H | |||
| 12S rRNA | 957 | 70–1026 | 0 | H | ||||
| tRNAVal | 72 | 1,027–1098 | TAC | 0 | H | |||
| 16S rRNA | 1690 | 1,099–2788 | 0 | H | ||||
| tRNALeu | 76 | 2,789–2864 | TAA | 1 | H | |||
| ND1 | 975 | 2,866–3840 | ATG | TAA | 4 | 324 | H | |
| tRNAIle | 72 | 3,845–3916 | GAT | −2 | H | |||
| tRNAGln | 71 | 3,915–3985 | TTG | 1 | L | |||
| tRNAMet | 69 | 3,987–4055 | CAT | 0 | H | |||
| ND2 | 1,045 | 4,056–5100 | ATG | T | 0 | 348 | H | |
| tRNATrp | 71 | 5,101–5171 | TCA | 1 | H | |||
| tRNAAla | 69 | 5,173–5241 | TGC | 1 | L | |||
| tRNAAsn | 73 | 5,243–5315 | GTT | 0 | L | |||
| OL | 36 | 5,316–5351 | −3 | L | ||||
| tRNACys | 68 | 5,349–5416 | GCA | 1 | L | |||
| tRNATyr | 71 | 5,418–5488 | GTA | 1 | L | |||
| COI | 1551 | 5,490–7040 | GTG | TAA | 0 | 516 | H | |
| tRNASer | 71 | 7,041–7111 | TGA | 3 | L | |||
| tRNAAsp | 74 | 7,115–7188 | GTC | 13 | H | |||
| COII | 691 | 7,202–7892 | ATG | T | 0 | 230 | H | |
| tRNALys | 76 | 7,893–7968 | TTT | 1 | H | |||
| ATP8 | 165 | 7,970–8134 | ATG | TAG | −7 | 54 | H | |
| ATP6 | 684 | 8,128–8811 | ATG | TAA | −1 | 227 | H | |
| COIII | 784 | 8,811–9594 | ATG | T | 0 | 261 | H | |
| tRNAGly | 71 | 9,595–9665 | TCC | 0 | H | |||
| ND3 | 349 | 9,666–10014 | ATG | T | 0 | 116 | H | |
| tRNAArg | 69 | 10,015–10083 | TCG | 0 | H | |||
| ND4L | 297 | 10,084–10380 | ATG | TAA | 0 | 98 | H | |
| ND4 | 1,382 | 10,374–11755 | ATG | TA | −7 | 460 | H | |
| tRNAHis | 69 | 11,756–11824 | GTG | 0 | H | |||
| tRNaSer | 68 | 11,825–11892 | GTC | 0 | H | |||
| tRNALeu | 73 | 11,894–11966 | TAG | 1 | H | |||
| ND5 | 1836 | 11,967–13802 | ATG | TAA | 0 | 611 | H | |
| ND6 | 522 | 13,799–14320 | ATG | TAA | −4 | 173 | L | |
| tRNAGlu | 68 | 14,321–14388 | TTC | 0 | L | |||
| Cytb | 1,141 | 14,391–15531 | ATG | T | 2 | 380 | H | |
| tRNAThr | 72 | 15,532–15603 | TGT | 0 | H | |||
| tRNAPro | 71 | 15,603–15673 | TGG | −1 | L | |||
| D‐loop | 936 | 15,674–16609 | 0 | H |
Codon usage in Rhynchocypris oxycephalus mitochondrial protein‐coding genes
| Codon | Count | RSCU | Codon | Count | RSCU | Codon | Count | RSCU | Codon | Count | RSCU |
|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
| UCU(S) | 44 | 1.07 |
|
|
| UGU(C) | 14 | 1 |
| UUC(F) | 112 | 1 | UCC(S) | 58 | 1.41 | UAC(Y) | 37 | 0.65 | UGC(C) | 14 | 1 |
| UUA(L) | 137 | 1.33 |
|
|
| UAA( | 6 | 3.43 |
|
|
|
| UUG(L) | 31 | 0.3 | UCG(S) | 13 | 0.32 | UAG( | 1 | 0.57 | UGG(W) | 25 | 0.42 |
| CUU(L) | 142 | 1.37 | CCU(P) | 43 | 0.8 | CAU(H) | 38 | 0.75 | CGU(R) | 15 | 0.79 |
| CUC(L) | 88 | 0.85 | CCC(P) | 70 | 1.31 |
|
|
| CGC(R) | 16 | 0.84 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| CUG(L) | 54 | 0.52 | CCG(P) | 24 | 0.45 | CAG(Q) | 17 | 0.34 | CGG(R) | 8 | 0.42 |
|
|
|
| ACU(T) | 41 | 0.55 |
|
|
| AGU(S) | 14 | 0.34 |
| AUC(I) | 78 | 0.59 | ACC(T) | 111 | 1.5 | AAC(N) | 50 | 0.89 |
|
|
|
|
|
|
|
|
|
|
|
|
| AGA( | 0 | 0 |
| AUG(M) | 59 | 0.64 | ACG(T) | 23 | 0.31 | AAG(K) | 19 | 0.49 | AGG( | 0 | 0 |
| GUU(V) | 79 | 1.25 | GCU(A) | 62 | 0.72 | GAU(D) | 27 | 0.7 | GGU(G) | 34 | 0.55 |
| GUC(V) | 38 | 0.6 |
|
|
|
|
|
| GGC(G) | 71 | 1.15 |
|
|
|
| GCA(A) | 109 | 1.27 |
|
|
|
|
|
|
| GUG(V) | 39 | 0.62 | GCG(A) | 28 | 0.33 | GAG(E) | 27 | 0.55 | GGG(G) | 58 | 0.94 |
Stop codons; the letters in brackets are abbreviations of each amino acid, preferred codons in bold.
Figure 3Codon distribution in Rhynchocypris oxycephalus. CDspT‐codons per thousand codons
Figure 4The secondary structures of the tRNA‐Ser(AGY) genes in Rhynchocypris oxycephalus
Figure 5Schematic map characterizing of the control region of Rhynchocypris oxycephalus. ETAS‐extended termination‐associated sequence, CSB‐conserved sequence blocks
Figure 6The secondary structures of the putative origin of light‐strand replication gene in Rhynchocypris oxycephalus
Mitochondrial genome of the Rhynchocypris species used in this study
| Species | Length (bp) | A + T % | AT‐skew | GC‐skew | Accession number | Reference |
|---|---|---|---|---|---|---|
|
| 16,609 | 56.0 | 0.030 | 0.196 |
| This study |
|
| 16,608 | 55.8 | 0.034 | 0.204 |
| Unpublished |
|
| 16,603 | 55.7 | 0.039 | 0.205 |
| Unpublished |
|
| 16,602 | 57.9 | 0.030 | 0.200 |
| Saitoh et al. ( |
|
| 16,599 | 58.0 | 0.033 | 0.214 |
| Imoto et al. ( |
|
| 16,604 | 54.5 | 0.038 | 0.221 |
| Yun, Yu, Kim, and Kwak ( |
|
| 16,605 | 55.7 | 0.025 | 0.193 |
| Yu, Kim, and Kim ( |
|
| 16,607 | 55.8 | 0.025 | 0.191 |
| Imoto et al. ( |
Some feature of mitochondrial genomes and different regions of 8 Rhynchocypris species
| Length of consensus sequence | Amount of variable sites | Kimura 2‐Parameter distance | Ts/Tv ratios | Base composition | ||||
|---|---|---|---|---|---|---|---|---|
| T | C | A | G | |||||
| Mitochondrion genome | 15,684 | 3,417 (21.8%) | 0.097 | 4.72 | 26.9 | 26.6 | 28.8 | 17.6 |
| ND1 | 975 | 297 (30.5%) | 0.141 | 4.18 | 29.8 | 27.7 | 25.7 | 16.8 |
| ND2 | 1,047 | 360 (34.4%) | 0.163 | 4.13 | 26.0 | 31.1 | 27.6 | 15.4 |
| COI | 1,551 | 314 (20.2%) | 0.088 | 5.91 | 31.1 | 24.8 | 25.8 | 18.3 |
| COII | 691 | 139 (20.1%) | 0.086 | 7.14 | 28.5 | 25.2 | 29.1 | 17.2 |
| ATP8 | 165 | 40 (24.2%) | 0.107 | 7.34 | 23.9 | 29.0 | 34.8 | 12.3 |
| ATP6 | 684 | 181 (26.5%) | 0.117 | 4.09 | 31.2 | 26.2 | 28.1 | 14.6 |
| COIII | 785 | 150 (19.1%) | 0.090 | 7.01 | 29.7 | 26.8 | 25.5 | 18.0 |
| ND3 | 351 | 93 (26.5%) | 0.116 | 6.32 | 29.3 | 28.4 | 26.0 | 16.3 |
| ND4L | 297 | 60 (20.2%) | 0.086 | 5.65 | 28.0 | 29.9 | 26.4 | 15.7 |
| ND4 | 1,383 | 395 (28.6%) | 0.139 | 4.69 | 29.1 | 27.3 | 28.0 | 15.7 |
| ND5 | 1,839 | 555 (30.2%) | 0.145 | 4.51 | 28.8 | 27.8 | 28.5 | 14.9 |
| ND6 | 522 | 166 (31.8%) | 0.147 | 6.37 | 38.3 | 14.4 | 16.0 | 31.2 |
| Cytb | 1,141 | 275 (24.1%) | 0.112 | 3.93 | 30.2 | 27.0 | 26.8 | 16.0 |
| 12S rRNa | 957 | 71 (7.4%) | 0.029 | 4.94 | 19.5 | 26.3 | 30.2 | 24.0 |
| 16S rRNA | 1,693 | 177 (10.5%) | 0.040 | 2.78 | 21.1 | 23.1 | 34.4 | 21.3 |
| Combined sequences of tRNA genes | 2,650 | 249 (9.4%) | 0.036 | 3.23 | 20.5 | 24.3 | 32.9 | 22.3 |
| Combined sequences of rRNA genes | 1,566 | 124 (8.0%) | 0.028 | 7.12 | 26.6 | 21.1 | 28.6 | 23.7 |
Figure 7(a) Saturation plot for the substitutions of 13 protein‐coding genes; (b) saturation plot for the substitutions of the complete mitochondrial genome (excepted D‐loop)
Figure 8(a) The maximum likelihood analyses of phylogenetic relationship based on 12 PCGs (excepted ND6) of 8 Rhynchocypris species. (b) The maximum likelihood analyses of phylogenetic relationship based on 13 PCGs of 8 Rhynchocypris species. Acrossocheilus fasciatus was selected as outgroup to root the tree in both (a) and (b)
Mitochondrial genome of the Leuciscus species used in this study
| Genus | Species | Length (bp) | Accession number | Reference |
|---|---|---|---|---|
|
|
| 16,604 |
| Saitoh et al. ( |
|
|
| 16,602 |
| Imoto et al. ( |
|
| 16,598 |
| Imoto et al. ( | |
|
|
| 17,859 |
| Imoto et al. ( |
|
| 17,738 |
| Xu et al. (2013) | |
|
|
| 16,602 |
| Imoto et al. ( |
|
|
| 16,607 |
| Hinsinger et al. ( |
|
| 16,605 |
| Wang et al. ( | |
|
|
| 16,589 |
| Cheng et al. ( |
Figure 9(a) The phylogenetic relationship among 17 Leuciscus fishes based on the complete mitochondrial genome (excepted D‐loop) from maximum likelihood analyses. The bootstrap support values are shown above the branches. (b) The phylogenetic relationship among 17 Leuciscus species based on the complete mitochondrial genome (excepted D‐loop) from Bayesian analyses. Acrossocheilus fasciatus was selected as outgroup to root the tree in both (a) and (b)
Figure 10Kimura 2‐parameter distance among 6 selected protein‐coding genes. (a) interspecies (b) intraspecies
(a) The Wilcoxon test of interspecific divergence among Rhynchocypris species; (b) The Wilcoxon test of intraspecific divergence among Rhynchocypris species
| W+ | W− | Relative ranks, | Result |
|---|---|---|---|
| (a) | |||
| COI | COIII | W+ = 6,698, W− = 19,408, | COI < COIII |
| COI | ND2 | W+ = 0, W− = 26,106, | COI < ND2 |
| COI | ND4 | W+ = 0, W− = 26,106, | COI < ND4 |
| COI | ND5 | W+ = 0, W− = 26,106, | COI < ND5 |
| COI | Cytb | W+ = 399, W− = 25,707, | COI < Cytb |
| COIII | ND2 | W+ = 141, W− = 25,965, | COIII < ND2 |
| COIII | ND4 | W+ = 607, W− = 25,499, | COIII < ND4 |
| COIII | ND5 | W+ = 722, W− = 25,384, | COIII < ND5 |
| COIII | Cytb | W+ = 5,162, W− = 20,944, | COIII < Cytb |
| ND2 | ND4 | W+ = 25,965, W− = 141, | ND2 > ND4 |
| ND2 | ND5 | W+ = 25,808, W− = 198, | ND2 > ND5 |
| ND2 | Cytb | W+ = 24,235, W− = 1871, | ND2 > Cytb |
| ND4 | ND5 | W+ = 14,286, W− = 11,820, | ND4 = ND5 |
| ND4 | Cytb | W+ = 3,852, W− = 22,254, | ND4 > Cytb |
| ND5 | Cytb | W+ = 3,827, W− = 22,279, | ND5 > Cytb |
| (b) | |||
| COI | COIII | W+ = 48, W− = 142, | COI = COIII |
| COI | ND2 | W+ = 0, W− = 153, | COI < ND2 |
| COI | ND4 | W+ = 0, W− = 153, | COI < ND4 |
| COI | ND5 | W+ = 0, W− = 153, | COI < ND5 |
| COI | Cytb | W+ = 10, W− = 180, | COI < Cytb |
| COIII | ND2 | W+ = 22, W− = 168, | COIII < ND2 |
| COIII | ND4 | W+ = 34, W− = 156, | COIII < ND4 |
| COIII | ND5 | W+ = 37, W− = 153, | COIII < ND5 |
| COIII | Cytb | W+ = 26, W− = 184, | COIII < Cytb |
| ND2 | ND4 | W+ = 150, W− = 3, | ND2 > ND4 |
| ND2 | ND5 | W+ = 150, W− = 3, | ND2 > ND5 |
| ND2 | Cytb | W+ = 112, W− = 78, | ND2 = Cytb |
| ND4 | ND5 | W+ = 150, W− = 34, | ND4 > ND5 |
| ND4 | Cytb | W+ = 72, W− = 118, | ND4 = Cytb |
| ND5 | Cytb | W+ = 53, W− = 137, | ND5 = Cytb |
Figure 11Distribution of interspecific and intraspecific variations of Rhynchocypris species. (a) COI sequence; (b) COIII sequence; (c) Cytb sequence; (d) ND2 sequence; (e) ND4 sequence; (f) ND5 sequence