| Literature DB >> 35379869 |
Anjaly Jose1,2, Sandhya Sukumaran3, Lakshmi P Mukundan3, Neenu Raj3, Sujitha Mary3, K Nisha3, A Gopalakrishnan3.
Abstract
Carangids are abundant and commercially important marine fish that contribute to a significant portion of the fisheries in many parts of the world. In the present study, we characterized the complete mitogenome of the Indian scad, Decapterus russelli and performed a comprehensive comparative mitogenomic analysis of the family Carangidae. The comparative mitogenomics provided valuable insights into the structure, variability, and features of the coding and non-coding regions that evolved across species over millions of years. The structural features of tRNAs revealed changes in the frequency of mismatched and wobble base pairs, which is reflected in the base composition of H and L strands. The highly conserved sequence motif of the mTERF binding site in carangids over the ~ 400 MYA of their divergence demonstrated the functional importance of these sites. The control region of carangids was characterized by the presence of discontinuous repeat units with a high rate of sequence divergence in the form of base substitutions, insertions, and deletions. The maintenance of secondary structures in the control region independent of the rapid evolution of primary structure suggested the effect of selective constraints on their maintenance. Maximum likelihood (ML) and Bayesian inference (BI) phylogeny revealed a similar topology consistent with previous taxonomic studies. The extant carangids diverged through the evolutionary events experienced during the Cretaceous, Paleogene, and Neogene periods.Entities:
Mesh:
Year: 2022 PMID: 35379869 PMCID: PMC8980026 DOI: 10.1038/s41598-022-09636-5
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Sequence characteristics of Decapterus russelli mitochondrial genome.
| Locus | Strand | Position | Size (bp) | Codon | A% | T% | G% | C% | A + T | G + C | AT skew | GC skew | Anticodon | Intergenic nucleotide | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Start | Stop | ||||||||||||||
| tRNAPhe | H | 1–68 | 68 | 35.3 | 19.1 | 23.5 | 22.1 | 54.4 | 45.6 | 0.297 | 0.030 | GAA | |||
| 12SrRNA | H | 69–1022 | 954 | 30.8 | 21.4 | 22 | 25.8 | 52.2 | 47.8 | 0.180 | − 0.079 | ||||
| tRNA Val | H | 1023–1094 | 72 | 30.6 | 19.4 | 23.6 | 26.4 | 50 | 50 | 0.224 | − 0.056 | TAC | |||
| 16SrRNA | H | 1095–2817 | 1723 | 32.4 | 22 | 20.1 | 25.4 | 54.4 | 45.5 | 0.191 | − 0.106 | ||||
| tRNALeu (TAA) | H | 2818–2891 | 74 | 28.4 | 23 | 23 | 25.7 | 51.4 | 48.7 | 0.105 | − 0.060 | TAA | |||
| ND1 | 2892–3866 | 975 | ATG | TAA | 23.4 | 26.4 | 16 | 34.3 | 49.8 | 50.3 | − 0.060 | − 0.363 | 5 | ||
| tRNA Ile | H | 3872–3941 | 70 | 27.1 | 21.4 | 28.6 | 22.9 | 48.5 | 51.5 | 0.117 | 0.110 | GAT | |||
| tRNAGln | L | 3941–4011 | 71 | 35.2 | 25.4 | 15.5 | 23.9 | 60.6 | 39.9 | 0.161 | − 0.210 | TTG | − 1 | ||
| tRNA Met | H | 4011–4080 | 70 | 25.7 | 25.7 | 22.9 | 25.7 | 51.4 | 48.6 | 0.00 | − 0.057 | CAT | |||
| ND2 | H | 4081–5125 | 1045 | ATG | T– | 27.2 | 24.1 | 12.2 | 36.5 | 51.3 | 48.7 | 0.060 | − 0.500 | ||
| tRNATrp | H | 5126–5196 | 71 | 32.4 | 16.9 | 25.4 | 25.4 | 49.3 | 50.8 | 0.315 | 0.00 | TCA | 1 | ||
| tRNAAla | L | 5198–5266 | 69 | 34.8 | 24.6 | 14.5 | 26.1 | 59.4 | 40.6 | 0.171 | − 0.285 | TGC | 1 | ||
| tRNAAsn | L | 5268–5340 | 73 | 32.9 | 20.5 | 17.8 | 28.8 | 53.4 | 46.6 | 0.232 | − 0.236 | GTT | 38 | ||
| tRNACys | L | 5379–5445 | 67 | 26.9 | 26.9 | 22.4 | 23.9 | 53.8 | 46.3 | 0.00 | − 0.032 | GCA | |||
| tRNA Tyr | L | 5446–5515 | 70 | 30 | 22.9 | 18.6 | 28.6 | 52.9 | 47.2 | 0.134 | − 0.211 | GTA | 1 | ||
| COI | H | 5517–7067 | 1551 | GTG | TAA | 24.5 | 29.7 | 18.6 | 27.2 | 54..2 | 45.8 | − 0.095 | − 0.187 | ||
| tRNASer (TGA) | L | 7068–7138 | 71 | 28.2 | 23.9 | 19.7 | 28.2 | 52.1 | 47.9 | 0.082 | − 0.180 | TGA | 3 | ||
| tRNA Asp | H | 7142–7212 | 71 | 29.6 | 28.2 | 21.1 | 21.2 | 53.8 | 46.3 | 0.026 | − 0.002 | GTC | 7 | ||
| COII | H | 7220–7910 | 691 | ATG | T– | 27.9 | 25.3 | 16.6 | 30.1 | 53.2 | 46.7 | 0.049 | − 0.289 | ||
| tRNA Lys | H | 7911–7984 | 74 | 31.1 | 21.6 | 20.3 | 27 | 52.7 | 47.3 | − 3.614 | − 0.141 | TTT | 1 | ||
| ATP8 | H | 7986–8150 | 165 | ATG | TAG | 24.1 | 26.8 | 13.9 | 35.2 | 50.9 | 49.1 | − 0.053 | − 0.430 | − 7 | |
| ATP6 | H | 8144–8827 | 684 | ATG | TAA | 24.1 | 26.8 | 13.9 | 35.2 | 50.9 | 49.1 | − 0.053 | − 0.430 | − 1 | |
| COIII | H | 8827–9611 | 785 | ATG | TA- | 23.9 | 26.6 | 17.8 | 31.6 | 50.5 | 49.4 | − 0.053 | − 0.280 | ||
| tRNAGly | H | 9612–9681 | 70 | 37.1 | 28.6 | 15.7 | 18.6 | 65.7 | 34.3 | 0.129 | − 0.084 | TCC | |||
| ND3 | H | 9682–10,030 | 349 | ATG | T– | 22.1 | 29.2 | 15.5 | 33.2 | 51.3 | 48.7 | − 0.138 | − 0.363 | ||
| tRNAArg | H | 10,031–10,099 | 69 | 31.9 | 33.3 | 18.8 | 15.9 | 65.2 | 34.7 | − 0.021 | 0.083 | TCG | 1 | ||
| ND4L | H | 10,101–10,397 | 297 | ATG | TAA | 22.2 | 27.9 | 15.5 | 34.3 | 50.1 | 49.8 | − 0.113 | − 0.380 | ||
| ND4 | H | 10,391–11,771 | 1381 | ATG | T– | 25.6 | 25.8 | 14.7 | 34 | 51.4 | 48.7 | − 0.003 | − 0.396 | ||
| tRNA His | H | 11,772–11,843 | 72 | 29.2 | 29.2 | 19.4 | 22.2 | 58.4 | 41.6 | 0.00 | − 0.067 | GTG | |||
| tRNASer(GCT) | H | 11,844–11,911 | 68 | 25 | 17.6 | 25 | 32.4 | 42.6 | 57.4 | 0.173 | − 0.128 | GCT | |||
| tRNALeu(TAG) | H | 11,917–11,989 | 73 | 31.5 | 23.3 | 20.5 | 24.7 | 54.8 | 45.2 | 0.149 | − 0.0930 | TAG | |||
| ND5 | H | 11,990–13,828 | 1839 | ATG | TAG | 26.6 | 26.4 | 14.2 | 32.7 | 53 | 46.9 | 0.003 | − 0.394 | − 4 | |
| ND6 | L | 13,825–14,346 | 522 | ATG | TAG | 13.8 | 36 | 33.3 | 16.9 | 49.8 | 50.2 | − 0.445 | 0.327 | ||
| tRNAGlu | L | 14,347–14,415 | 69 | 33.3 | 21.7 | 17.4 | 27.5 | 55 | 44.9 | 0.210 | − 0.300 | TCC | 4 | ||
| Cytb | H | 14,420–15,560 | 1141 | ATG | T– | 23.7 | 27.1 | 15.8 | 33.5 | 50.8 | 49.3 | − 0.070 | − 0.352 | ||
| tRNAThr | H | 15,561–15,632 | 72 | 22.2 | 23.6 | 26.4 | 27.8 | 45.8 | 54.2 | − 0.030 | − 0.025 | TGT | − 1 | ||
| tRNA Pro | L | 15,632–15,702 | 71 | 33.8 | 25.4 | 12.7 | 28.2 | 59.2 | 40.9 | 0.134 | − 0.378 | TGG | |||
| Dloop | H | 15,703–16,542 | 840 | 33 | 30.6 | 16.3 | 20.1 | 63.6 | 36.4 | 0.037 | − 0.104 | ||||
Figure 1Secondary structure of tRNAs showing structural variations in 37 species. The first structure represents the nucleotide positions common to all tRNAs (1–88) and details of stem- loop. Length variations in stems and loops are indicated by grey squares and green arcs respectively. The secondary structure of tRNAs were predicted by ARWEN software (http://130.235.244.92/ARWEN/) and edited manually in Adobe Photoshop CS6.0.
Size of stems and loops of 22 tRNA genes in the mitogenomes of 37 species of Carangidae.
| tRNA | AA stem (bp) | DHU stem (bp) | DHU loop (nucleotide) | AC stem (bp) | V loop (nucleotide) | TΨU stem (bp) | TΨU loop (nucleotide) | |||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 6 | 7 | 8 | 0 | 3 | 4 | 5 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 5 | 6 | 3 | 4 | 5 | 4 | 5 | 6 | 6 | 7 | 8 | 9 | |
| Phe | 35 | 2 | 12 | 25 | 24 | 13 | 37 | 37 | 37 | 37 | ||||||||||||||||||||
| Val | 35 | 2 | 33 | 4 | 4 | 33 | 37 | 37 | 37 | 37 | ||||||||||||||||||||
| Leu(TAA) | 37 | 37 | 21 | 16 | 37 | 37 | 37 | 37 | ||||||||||||||||||||||
| Ile | 18 | 19 | 37 | 1 | 36 | 37 | 37 | 37 | 26 | 11 | ||||||||||||||||||||
| Met | 2 | 35 | 8 | 29 | 7 | 18 | 12 | 37 | 34 | 3 | 1 | 36 | 36 | 1 | ||||||||||||||||
| Trp | 37 | 37 | 36 | 1 | 37 | 37 | 37 | 37 | ||||||||||||||||||||||
| Asp | 36 | 1 | 37 | 3 | 34 | 37 | 37 | 37 | 37 | |||||||||||||||||||||
| Lys | 33 | 4 | 36 | 1 | 1 | 6 | 29 | 1 | 37 | 37 | 37 | 37 | ||||||||||||||||||
| Gly | 37 | 37 | 28 | 6 | 3 | 37 | 37 | 37 | 37 | |||||||||||||||||||||
| Arg | 31 | 6 | 37 | 37 | 37 | 37 | 37 | 37 | ||||||||||||||||||||||
| His | 37 | 37 | 3 | 2 | 32 | 37 | 11 | 26 | 37 | 37 | ||||||||||||||||||||
| Ser(GCT) | 3 | 34 | 34 | 6 | 30 | 1 | 7 | 30 | 31 | 6 | 37 | 37 | ||||||||||||||||||
| Leu(TAG) | 37 | 37 | 37 | 37 | 37 | 37 | 37 | |||||||||||||||||||||||
| Thr | 37 | 37 | 3 | 32 | 2 | 37 | 37 | 37 | 37 | |||||||||||||||||||||
| Gln | 32 | 5 | 37 | 37 | 37 | 37 | 37 | 37 | ||||||||||||||||||||||
| Ala | 20 | 17 | 37 | 37 | 37 | 37 | 37 | 37 | ||||||||||||||||||||||
| Asn | 37 | 27 | 10 | 14 | 23 | 37 | 37 | 37 | 37 | |||||||||||||||||||||
| Cys | 37 | 5 | 32 | 23 | 14 | 37 | 37 | 37 | 2 | 35 | ||||||||||||||||||||
| Tyr | 37 | 37 | 34 | 3 | 37 | 37 | 37 | 37 | ||||||||||||||||||||||
| Ser(TGA) | 37 | 37 | 37 | 37 | 37 | 37 | 37 | |||||||||||||||||||||||
| Glu | 32 | 5 | 37 | 37 | 37 | 37 | 37 | 37 | ||||||||||||||||||||||
| Pro | 32 | 5 | 37 | 36 | 1 | 37 | 34 | 37 | 37 | |||||||||||||||||||||
Occurrence of wobble pairings in the stem regions of 22 tRNA genes in the mitogenome of 37 species of Carangidae.
| tRNA | AA stem | DHU stem | AC stem | TΨU stem | Total | Percentage | |
|---|---|---|---|---|---|---|---|
| Total base pairs | 5871 | 3066 | 4066 | 3999 | 17,002 | ||
| Wobble bp | Wobble bp | Wobble bp | Wobble bp | ||||
| H strand coded | Phe | 10 | 6 | 0 | 1 | 17 | 2.5 |
| Val | 2 | 2 | 0 | 3 | 7 | 0.93 | |
| Leu(TAA) | 0 | 37 | 0 | 3 | 40 | 4.6 | |
| Ile | 6 | 0 | 0 | 0 | 6 | 0.72 | |
| Met | 33 | 0 | 0 | 11 | 44 | 5.50 | |
| Trp | 2 | 35 | 0 | 0 | 37 | 4.80 | |
| Asp | 0 | 0 | 17 | 19 | 2.50 | ||
| Lys | 0 | 0 | 0 | 0 | 0 | 0 | |
| Gly | 0 | 65 | 0 | 0 | 65 | 8.40 | |
| Arg | 8 | 6 | 2 | 18 | 34 | 4.34 | |
| His | 2 | 7 | 0 | 23 | 32 | 4.20 | |
| Ser(GCT) | 0 | 5 | 7 | 12 | 1.64 | ||
| Leu(TAG) | 0 | 0 | 0 | 0 | 0 | 0 | |
| Thr | 0 | 0 | 6 | 0 | 6 | 0.80 | |
| Average (%) | 1.72 | 8.27 | 0.50 | 3.29 | 3.00 | ||
| L strand coded | Gln | 37 | 37 | 0 | 0 | 74 | 9.46 |
| Ala | 42 | 30 | 41 | 34 | 150 | 18.82 | |
| Asn | 24 | 37 | 0 | 18 | 79 | 10.03 | |
| Cys | 52 | 34 | 1 | 1 | 88 | 10.81 | |
| Tyr | 37 | 0 | 37 | 0 | 74 | 9.52 | |
| Ser(TGA) | 3 | 6 | 29 | 34 | 66 | 9.40 | |
| Glu | 68 | 0 | 71 | 9 | 148 | 19.00 | |
| Pro | 35 | 74 | 32 | 1 | 142 | 18.13 | |
| Average | 14.13 | 18.84 | 14.25 | 6.75 | 13.18 | ||
| Coefficient of varitaion | 1.30 | 1.29 | 1.84 | 1.32 | 0.88 | ||
| Grand average | 6.18 | 12.26 | 5.50 | 4.57 | 6.70 |
Figure 2Sequence and structure of the mitochondrial DNA control region of carangids (a) Features of the control region. CSB-F,-D,-E: central conserved sequence blocks; CSB-1,-2,-3: conserved sequence blocks. Palindromic sequence motifs 'TACTA' and 'ATGTA' are framed. Grey colored bar represents the characteristic 3' terminal sequence. (b) The secondary structure identified in CSB-F, CSB-D, and CSB -E. (c) Secondary structure identified in CSB-1, CSB-2, and CSB-3. The secondary structures of conserved sequence motifs were predicted by the online Mfold web server (http://www.unafold.org/).
The comparison of Central conserved blocks, Conserved sequence blocks and 3’ terminal region in 37 species of Carangidae.
| Species | Central conserved blocks | ||
|---|---|---|---|
| CSB- F | CSB- E | CSB- D | |
| Reference sequence | ATGTAGTAAGAACCTACCA | AGGGACAACTATTGTGGGGGT | TATTCCTGGCATTTGGTTCCTA |
| ******************* | ************C******** | ************C********* | |
| ***********G******* | *******GAA*********** | ********************** | |
| ******************* | ********************G | ********************** | |
| ******************* | ********TA*********** | ********************** | |
| ******************* | ********AA*********** | ********************** | |
| ******************* | *******GAA*********** | ********************** | |
| ******************* | ********AA*********** | ********************** | |
| ******************* | *******GAA*********** | ********************** | |
| ******************* | ********AA*CC******** | ********************** | |
| ******************* | *A******AA******A**** | ********************** | |
| GCCC*A************* | *******GAA*********** | ********************** | |
| **A****G*AGG*TC**** | ***A*****C***ACATTC*C | ********************** | |
| *C******G**C*****GG | *A******AA**C***A**** | ****************C***** | |
| *C******G**C*****GG | *A******AA******A**** | ****************C***** | |
| ******************* | ********AA*********** | ************C********* | |
| **************G**** | ********AA*********** | ********************** | |
| ******************* | ********************* | ************C********* | |
| ******************* | ********AA**C******** | ********************** | |
| ***C*************** | *******GTA*********** | ********************** | |
| ***********G******* | *******GAA*********** | ****************C***** | |
| *C*C**********C**** | *A*****GAA******A**** | ****************C***** | |
| ******************* | ********AA*********** | ********************** | |
| ******************* | ********AA*****A***** | ********************** | |
| ***********G******* | ********AA**C******** | ********************** | |
| ***********G******* | ********AA**C******** | ********************** | |
| ******************* | G*A****GAA**C******** | ********************** | |
| *CTC*A********G**** | *******GAA**C******** | ********************** | |
| *CTC*A********G**** | *******GTA*********** | ********************** | |
| *CCC*A********G**** | ********TA**C******** | ********************** | |
| ******************* | *A*****GAA*********** | ********************** | |
| ******************* | *******GAA*********** | ********************** | |
| *CCC*A********G**** | ********TA**C******** | ********************** | |
| ***********G******* | ********AA**C******** | ********************** | |
| GCAC*******GA*C**** | ********A*C********** | ********************** | |
| ***********G******* | ********TA*********** | ********************** | |
| ******************* | *******GAA*********** | ********************** | |
| ******************* | ******AGAA*********** | ********************** | |
Dr = D. russelli, Ac = A. ciliaris, Ai = A. indica, Ad = A. djedaba, Ak = A. kleinii, Am = A. mate, Ca = C. armatus, Cb = C. bajad, Ce = C. equula, Cm = C. malabaricus, Cp = C. plagiotaenia, Ci = C. ignobilis, Ct = C. tille, Cmel = C. melampygus, Dmac = D. macerellus, Dm = D. macrosoma, Dmar = D. maruadsi, Dt = D. tabl, Eb = E. bipinnulata, Gs = G. speciosus, Mc = M. cordyla, Pn = P. niger, Pd = P. dentex, Tt = T. trachurus, Tj = T. japonicas, Sc = S. crumenophthalmus, Sd = S. dumerili, Sl = S. lalandi, Sq = S. quinqueradiata, Sle = S. leptolepis, Sr = S. rivoliana, Sn = S. nigrofasciata, Tb = T. blochii, To = T. ovatus, Tc = T. carolinus, Uh = U. helvola, Us = U. secunda.
Asterisks indicate the same nucleotide as the reference sequence. Insertions are represented as bold and deletions as underlined. Other nucleotides denote base substitutions.
Figure 3Phylogenetic tree inferred from the nucleotide sequences of 13 concatenated PCGs of 37 species of Carangidae. Bayesian posterior probabilities (left) and bootstrap support values (right) are superimposed with each node. The phylogenetic tree was generated by using MrBayes 3.2.7a[93].
Figure 4Timetree chronogram of 37 species of Carangidae for the 13 PCGs inferred from Bayesian topology. Divergence times were estimated using 32 calibrations. The red text indicates the species sequenced in this study. The Timetree was generated by using MEGA X[96].