| Literature DB >> 32962212 |
Chen Wang1, Hao Chen2, Silin Tian1, Cheng Yang1, Xiao Chen1,3.
Abstract
Cynoglossus monopus, a small benthic fish, belongs to the Cynoglossidae, Pleuronectiformes. It was rarely studied due to its low abundance and cryptical lifestyle. In order to understand the mitochondrial genome and the phylogeny in Cynoglossidae, the complete mitogenome of C. monopus has been sequenced and analyzed for the first time. The total length is 16,425 bp, typically containing 37 genes with novel gene rearrangements. The tRNA-Gln gene is inverted from the light to the heavy strand and translocated from the downstream of tRNA-Ile gene to its upstream. The control region (CR) translocated downstream to the 3'-end of ND1 gene adjoining to inverted to tRNA-Gln and left a 24 bp trace fragment in the original position. The phylogenetic trees were reconstructed by Bayesian inference (BI) and maximum likelihood (ML) methods based on the mitogenomic data of 32 tonguefish species and two outgroups. The results support the idea that Cynoglossidae is a monophyletic group and indicate that C. monopus has the closest phylogenetic relationship with C. puncticeps. By combining fossil records and mitogenome data, the time-calibrated evolutionary tree of families Cynoglossidae and Soleidae was firstly presented, and it was indicated that Cynoglossidae and Soleidae were differentiated from each other during Paleogene, and the evolutionary process of family Cynoglossidae covered the Quaternary, Neogene and Paleogene periods.Entities:
Keywords: Cynoglossus monopus; divergence times; intramitochondrial recombination; mitochondrial genome; novel rearrangement; phylogenetic analyses
Mesh:
Substances:
Year: 2020 PMID: 32962212 PMCID: PMC7555148 DOI: 10.3390/ijms21186895
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Details of species and mitogenomes of Pleuronectiformes used in this study.
| Family | Genus | Species | Length (bp) | Accession ID | Reference |
|---|---|---|---|---|---|
| Cynoglossidae | Cynoglossus |
| 16,417 | JQ349004 | [ |
|
| 16,454 | JQ349000 | [ | ||
|
| 16,565 | KT809367 | [ | ||
|
| 17,262 | LC482306 | [ | ||
|
| 16,915 | JQ639062 | Unpublished | ||
|
| 16,428 | NC030256 | [ | ||
|
| 16,425 | MT798589 | This study | ||
|
| 17,130 | MT117229 | [ | ||
|
| 17,142 | JQ349003 | [ | ||
|
| 16,720 | LC482305 | [ | ||
|
| 16,565 | MN966658 | [ | ||
|
| 16,731 | EU366230 | [ | ||
|
| 16,519 | MH709122 | [ | ||
|
| 16,478 | JQ348998 | [ | ||
|
| 18,369 | KP057581 | [ | ||
|
| 16,569 | KJ433559 | [ | ||
| Paraplagusia |
| 16,985 | NC023227 | Unpublished | |
|
| 16,611 | JQ349002 | [ | ||
|
| 16,694 | JQ639066 | [ | ||
| Soleidae | Aesopia |
| 16,737 | KF000065 | [ |
| Aseraggodes |
| 16,944 | KJ601760 | [ | |
| Brachirus |
| 16,600 | KJ513134 | [ | |
| Heteromycteris |
| 17,111 | JQ639060 | [ | |
| Liachirus |
| 17,001 | KF573188 | [ | |
| Pardachirus |
| 16,573 | KJ461620 | [ | |
| Pseudaesopia |
| 16,789 | KJ433482 | [ | |
| Solea |
| 16,782 | KF142459 | [ | |
|
| 16,659 | AB270760 | [ | ||
| Zebrias |
| 16,734 | KJ433564 | [ | |
|
| 17,045 | NC023225 | [ | ||
|
| 16,758 | JQ700100 | [ | ||
|
| 16,762 | KC491209 | Unpublished | ||
| Paralichthyidae | Paralichthys |
| 17,090 | NC002386 | Unpublished |
| Psettodidae | Psettodes |
| 17,315 | FJ606835 | Unpublished |
Figure 1Graphical genome map of the mitogenome of C. monopus. The genes outside the outermost circle are transcribed clockwise, whereas the genes inside the outermost circle are transcribed counterclockwise. The inside circle shows the GC content and GC skews.
Features of the C. monopus mitochondrial genome.
| Gene | Strand | Position | Size (bp) | Amino Acids (aa) | Codon | Anti-Codon | Intergenic Nucleotides (bp) | ||
|---|---|---|---|---|---|---|---|---|---|
| From | To | Start | Stop | ||||||
| tRNA- | H | 1 | 68 | 68 | GAA | 0 | |||
| 12S rRNA | H | 69 | 1014 | 946 | 0 | ||||
| tRNA- | H | 1015 | 1087 | 73 | TAC | 0 | |||
| 16S rRNA | H | 1088 | 2789 | 1702 | 0 | ||||
| tRNA- | H | 2790 | 2859 | 70 | TAA | 0 | |||
|
| H | 2860 | 3834 | 975 | 324 | ATG | TAA | 0 | |
| Control region | H | 3835 | 4570 | 736 | 0 | ||||
| tRNA- | H | 4571 | 4643 | 73 | GAT | 0 | |||
| tRNA- | H | 4649 | 4717 | 69 | TTG | 5 | |||
| tRNA- | H | 4724 | 4793 | 70 | CAT | 6 | |||
|
| H | 4795 | 5838 | 1044 | 347 | ATG | TAA | 1 | |
| tRNA- | H | 5838 | 5905 | 68 | TCA | −1 | |||
| tRNA- | L | 5908 | 5976 | 69 | TGC | 2 | |||
| tRNA- | L | 5979 | 6051 | 73 | GTT | 2 | |||
| OL | L | 6056 | 6079 | 24 | 4 | ||||
| tRNA- | L | 6086 | 6150 | 65 | GCA | 6 | |||
| tRNA- | L | 6151 | 6218 | 68 | GTA | 0 | |||
|
| H | 6220 | 7770 | 1551 | 516 | GTG | TAA | 1 | |
| tRNA- | L | 7771 | 7841 | 71 | TGA | 0 | |||
| tRNA- | H | 7844 | 7912 | 69 | GTC | 2 | |||
|
| H | 7914 | 8604 | 691 | 230 | ATG | T | 1 | |
| tRNA- | H | 8605 | 8680 | 76 | TTT | 0 | |||
|
| H | 8683 | 8847 | 165 | 54 | ATG | TAA | 2 | |
|
| H | 8838 | 9521 | 684 | 227 | ATG | TAA | −10 | |
|
| H | 9521 | 10,306 | 786 | 261 | ATG | TAA | −1 | |
| tRNA- | H | 10,306 | 10,375 | 70 | TCC | −1 | |||
|
| H | 10,376 | 10,726 | 351 | 116 | ATG | TAA | 0 | |
| tRNA- | H | 10,725 | 10,793 | 69 | TCG | −2 | |||
|
| H | 10,794 | 11,090 | 297 | 98 | ATG | TAA | 0 | |
|
| H | 11,084 | 12,448 | 1365 | 454 | ATG | TAA | −7 | |
| tRNA- | H | 12,456 | 12,524 | 69 | GTG | 9 | |||
| tRNA- | H | 12,525 | 12,592 | 68 | GCT | 0 | |||
| tRNA- | H | 12,595 | 12,667 | 73 | TAG | 2 | |||
|
| H | 12,671 | 14,524 | 1854 | 617 | ATG | TAA | 3 | |
|
| L | 14,530 | 15,051 | 522 | 183 | ATG | TAG | 5 | |
| tRNA- | L | 15,052 | 15,120 | 69 | TTC | 0 | |||
|
| H | 15,123 | 16,259 | 1137 | 378 | ATG | TAG | 2 | |
| tRNA- | H | 16,263 | 16,331 | 69 | TGT | 3 | |||
| tRNA- | L | 16,331 | 16,401 | 71 | TGG | −1 | |||
| Noncoding region | H | 16,402 | 16,425 | 24 | 0 | ||||
Nucleotide composition and skewness levels calculated for sequenced majority strand of the C. monopus.
| Regions | Size (bp) | Nucleotides Composition (%) | AT-Skew | GC-Skew | |||||
|---|---|---|---|---|---|---|---|---|---|
| T | C | A | G | AT | GC | ||||
| Whole genome | 16,425 | 30.39 | 24.04 | 30.80 | 14.77 | 61.19 | 38.81 | 0.01 | −0.24 |
| PCGs | 11,422 | 32.38 | 25.03 | 28.38 | 14.21 | 60.76 | 39.24 | −0.07 | −0.28 |
| 1st codon position | 3808 | 24.91 | 24.52 | 27.10 | 23.46 | 52.02 | 47.98 | 0.04 | −0.02 |
| 2nd codon position | 3807 | 42.10 | 26.48 | 18.15 | 13.27 | 60.25 | 39.75 | −0.40 | −0.33 |
| 3rd codon position | 3807 | 30.94 | 24.06 | 38.92 | 6.08 | 69.86 | 30.14 | 0.11 | −0.60 |
|
| 684 | 30.85 | 31.43 | 27.49 | 10.23 | 58.33 | 41.67 | −0.06 | −0.51 |
|
| 165 | 32.73 | 25.45 | 33.94 | 7.88 | 66.67 | 33.33 | 0.02 | −0.53 |
|
| 1551 | 31.72 | 23.86 | 26.69 | 17.73 | 58.41 | 41.59 | −0.09 | −0.15 |
|
| 691 | 32.27 | 22.43 | 30.39 | 14.91 | 62.66 | 37.34 | −0.03 | −0.20 |
|
| 786 | 30.53 | 27.35 | 25.83 | 16.28 | 56.36 | 43.64 | −0.08 | −0.25 |
|
| 1137 | 34.21 | 25.77 | 26.30 | 13.72 | 60.51 | 39.49 | −0.13 | −0.31 |
|
| 975 | 32.51 | 24.62 | 28.41 | 14.46 | 60.92 | 39.08 | −0.07 | −0.26 |
|
| 1044 | 31.90 | 26.44 | 31.32 | 10.34 | 63.22 | 36.78 | −0.01 | −0.44 |
|
| 351 | 32.76 | 26.21 | 27.35 | 13.68 | 60.11 | 39.89 | −0.09 | −0.31 |
|
| 1365 | 32.60 | 24.62 | 30.26 | 12.53 | 62.86 | 37.14 | −0.04 | −0.33 |
|
| 297 | 29.97 | 29.29 | 24.24 | 16.50 | 54.21 | 45.79 | −0.11 | −0.28 |
|
| 1854 | 31.01 | 25.57 | 30.91 | 12.51 | 61.92 | 38.08 | 0.00 | −0.34 |
|
| 522 | 41.38 | 12.26 | 21.65 | 24.71 | 63.03 | 36.97 | −0.31 | 0.34 |
| tRNAs | 1540 | 30.19 | 17.92 | 30.91 | 20.97 | 61.10 | 38.90 | 0.01 | 0.08 |
| rRNAs | 2648 | 25.11 | 20.85 | 35.31 | 18.73 | 60.42 | 39.58 | 0.17 | −0.05 |
| Control region | 736 | 35.87 | 17.66 | 35.05 | 11.41 | 70.92 | 29.08 | −0.01 | −0.21 |
Figure 2Graphical illustration shown the usage bias of AT and GC (AT-skew and GC-skew values) (A), and the length of protein-coding genes (PCGs), tRNAs, rRNAs, and control regions (B) in the mitogenomes of 19 species in the subfamily Cynoglossinae. Note: Species 1–21: C. abbreviatus, C. bilineatus, C. gracilis, C. interruptus, C. itinus, C. joyneri, C. monopus, C. nanhaiensis, C. puncticeps, C. robustus, C. roulei, C. semilaevis, C. senegalensis, C. sinicus, C. trigrammus, C. zanzibarensis, P. bilineata, P. blochii and P. japonica.
Figure 3Amino acid composition (A) and relative synonymous codon usage (B) in the mitogenome of C. monopus.
Figure 4Inferred gene rearrangement between the gene order of typical vertebrates and the mitogenome of C. monopus. The typical vertebrate gene order (a); inferred intermediate processes of gene rearrangement (b,c); the gene order in the mitogenome of C. monopus (d).
Figure 5Phylogenetic tree of C. monopus was performed using partial genomes of 32 species of Pleuronectiformes and Psettodes erumei and Paralichthys olivaceus were used as outgroups, with Bayesian analyses and Maximum likelihood analyses. Species in red indicates sequence generated in this study. Bootstrap support (right) and Bayesian posterior probability values (left) of each clade are displayed next to the nodes.
Figure 6Chronogram for the 32 species of Pleuronectiformes with P. erumei and P. olivaceus as outgroups based on the concatenated nucleotide sequences of the 12 PCGs (except ND6 and the third codon positions) and 2 rRNA genes using BEAST analysis. Numbers near the nodes indicate the average estimated divergence time estimated in Mya, and the 95% confidence intervals for each node are shown in blue bars.