Literature DB >> 31536521

The complete mitochondrial genome of Calyptogena marissinica (Heterodonta: Veneroida: Vesicomyidae): Insight into the deep-sea adaptive evolution of vesicomyids.

Mei Yang1,2,3,4, Lin Gong1,2,3,4, Jixing Sui1,2,3,4, Xinzheng Li1,2,3,4.   

Abstract

The deep-sea chemosynthetic environment is one of the most extreme environments on the Earth, with low oxygen, high hydrostatic pressure and high levels of toxic substances. Species of the family Vesicomyidae are among the dominant chemosymbiotic bivalves found in this harsh habitat. Mitochondria play a vital role in oxygen usage and energy metabolism; thus, they may be under selection during the adaptive evolution of deep-sea vesicomyids. In this study, the mitochondrial genome (mitogenome) of the vesicomyid bivalve Calyptogena marissinica was sequenced with Illumina sequencing. The mitogenome of C. marissinica is 17,374 bp in length and contains 13 protein-coding genes, 2 ribosomal RNA genes (rrnS and rrnL) and 22 transfer RNA genes. All of these genes are encoded on the heavy strand. Some special elements, such as tandem repeat sequences, "G(A)nT" motifs and AT-rich sequences, were observed in the control region of the C. marissinica mitogenome, which is involved in the regulation of replication and transcription of the mitogenome and may be helpful in adjusting the mitochondrial energy metabolism of organisms to adapt to the deep-sea chemosynthetic environment. The gene arrangement of protein-coding genes was identical to that of other sequenced vesicomyids. Phylogenetic analyses clustered C. marissinica with previously reported vesicomyid bivalves with high support values. Positive selection analysis revealed evidence of adaptive change in the mitogenome of Vesicomyidae. Ten potentially important adaptive residues were identified, which were located in cox1, cox3, cob, nad2, nad4 and nad5. Overall, this study sheds light on the mitogenomic adaptation of vesicomyid bivalves that inhabit the deep-sea chemosynthetic environment.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 31536521      PMCID: PMC6752807          DOI: 10.1371/journal.pone.0217952

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Mitochondria, which descended from Proteobacteria via endosymbiosis, are important organelles in eukaryotic cells and are involved in various processes, such as ATP generation, signaling, cell differentiation, growth and apoptosis [1]. Moreover, mitochondria have their own genetic information system. In general, the metazoan mitogenome is a closed, circular DNA molecule, ranging from 12 to 20 kb in length and usually containing 37 genes: 13 protein-coding genes (PCGs) (atp6, atp8, cox1-3, cob, nad1-6 and nad4l) of the respiratory chain, 2 ribosomal RNA (rRNA) genes (rrnS and rrnL) and 22 transfer RNA (tRNA) genes [2]. In addition, there are several noncoding regions in the mitogenome, and the longest noncoding “AT-rich” region is normally interpreted as the control region (CR), which includes elements controlling the initiation and regulation of transcription and replication [3]. Owing to variable gene order, a low frequency of gene recombination and different genes having different evolutionary rates, mitochondrial sequences are widely used for species identification, genetic diversity assessment and phylogenetics at various taxonomic levels [4-7]. Since the discovery of cold seeps and hydrothermal vents in the deep-sea, the unique biological communities that depend on chemosynthetic primary production have attracted the attention of researchers [8-11]. These deep-sea chemosynthetic environments lack sunlight and exhibit high pressure, low oxygen and high levels of chemical toxicity due to various heavy metals, and the organisms that live there show a series of adaptations (more complex innate immune system, expanded gene families for stabilizing protein structures and removing toxic substances, a set of positively selected genes related to cytoskeleton structures, DNA repair and genetic information processing) compared with marine species in coastal environments [12-15]. Mitochondria are the energy metabolism centers of eukaryotic cells, which can generate more than 95% of cellular energy through oxidative phosphorylation (OXPHOS) [3]. Therefore, mitochondrial PCGs may undergo evolutionary selection in response to metabolic requirements in extremely harsh environments [16-18]. Numerous studies have found clear and compelling evidence of adaptive evolution in the mitogenome of organisms from extreme habitats, including Tibetan humans [19], Chinese snub-nosed monkeys [20], Tibetan horses [21-22], Tibetan wild yaks [23], galliform birds [24], and Tibetan loaches [25]. The family Vesicomyidae (Dall & Simpson, 1901) is widely distributed worldwide from shelf to hadal depths and comprises specialized bivalves occurring in reducing environments such as hydrothermal vents located in mid-ocean ridges and back-arc basins, cold seeps at continental margins and whale falls [26-29]. Studies have shown that vesicomyid bivalves rely upon the symbiotic chemoautotrophic bacteria in their gills for all or part of nutrition [30-31]. Based on the shells and soft body, the Vesicomyidae is divided into two subfamilies: Vesicomyinae and Pliocardiinae. The Vesicomyinae includes only one genus, Vesicomya, while Pliocardiinae currently contains 20 genera. Among the 20 genera, Calyptogena is the most diverse group of deep-sea vesicomyid bivalves in the western Pacific region and its marginal seas [32]. As some of the dominant species in the deep-sea, vesicomyids are an interesting taxon with which to study the mechanisms of adaptation to diverse stressors in deep-sea habitats. Considering that the mitogenome has highly compact DNA and is easily accessible, several complete/nearly complete mitogenomes of vesicomyids have been sequenced [33-36] in recent years; however, limited information is available about the mechanism of adaptation to deep-sea habitats in vesicomyids at the mitogenome level. In the present study, we obtained the mitogenome of Calyptogena marissinica, a new species of the family Vesicomyidae from the Haima cold seep of the South China Sea. First, the mitogenome organization, codon usage, and gene order information were obtained, and we compared the composition of this mitogenome with that of other available vesicomyid bivalve mitogenomes. Second, based on mitochondrial PCGs and 2 rRNA genes, the phylogenetic relationships between C. marissinica and other species from subclass Heterodonta were examined. Finally, to understand the adaptive evolution of deep-sea organisms, we conducted positive selection analysis of vesicomyid bivalve mitochondrial PCGs.

Materials and methods

Sampling, identification and DNA extraction

Specimen of C. marissinica was sampled from the “Haima” methane seep in the northern sector of the South China Sea, 16°43.80′N, 110°28.50′E, 1,390 m, using a remotely operated vehicle (ROV) in May 2018. Species-level morphological identification was performed according to the main points of Chen et al. (2018) [32]. Specimen was dissected and different tissues were preserved separately at -80°C until DNA extraction. Total genomic DNA was extracted from the adductor muscle tissue using a DNeasy tissue kit (Qiagen, Beijing, China) following the manufacturer’s protocols.

Illumina sequencing, mitogenome assembly and annotation

After DNA isolation, 1 μg of purified DNA was fragmented, used to construct short-insert libraries (insert size of 430 bp) according to the manufacturer’s instructions (TruSeq™ Nano DNA Sample Prep Kit, Illumina), and then sequenced on an Illumina HiSeq 4000 instrument (San Diego, USA). Prior to assembly, raw reads were filtered by Trimmomatic 0.35. This filtering step was performed to remove the reads with adaptors, the reads showing a quality score below 20 (Q<20), the reads containing a percentage of uncalled bases (“N” characters) equal to or greater than 10% and the duplicated sequences. The mitochondrial genome was reconstructed using a combination of de novo and reference-guided assemblies, and the following three steps were used to assemble the mitogenome. First, the filtered reads were assembled into contigs using SPAdes 3.10.1. Second, contigs were aligned to the reference mitogenomes from species of the family Vesicomyidae using BLAST (https://blast.ncbi.nlm.nih.gov/Blast.cgi), and aligned contigs (≥80% similarity and query coverage) were ordered according to the reference mitogenomes. Third, clean reads were mapped to the assembled draft mitogenome using GapCloser 1.12 with default parameters, and the majority of gaps were filled via local assembly. The mitochondrial genes were annotated using homology alignments and de novo prediction, and EVidenceModeler [37] was used to integrate the gene set. rRNA genes and tRNA genes were predicted by rRNAmmer [38] and tRNAscan-SE [39]. A whole-mitochondrial genome BLAST search (E-value ≤ 1e-5, minimal alignment length percentage ≥ 40%) was performed against 5 databases, namely, the KEGG (Kyoto Encyclopedia of Genes and Genomes), COG (Clusters of Orthologous Groups), NR (Non-Redundant Protein), Swiss-Prot and GO (Gene Ontology) databases. Organellar Genome DRAW [40] was used for circular display of the C. marissinica mitogenome.

Sequence analysis

The nucleotide composition and codon usage were computed using DnaSP 5.1 [41]. The AT and GC skews were calculated with the following formulas: AT skew = (A − T) / (A + T) and GC skew = (G − C) / (G + C) [42], where A, T, G and C are the occurrences of the four nucleotides. Tandem Repeats Finder 4.0 [43] was used to search for the tandem repeat sequences. The online version of Mfold [44] was applied to infer potential secondary structure, and if more than one secondary structure appeared, the one with the lowest free energy score was used.

Phylogenetic analysis

The phylogeny of the subclass Heterodonta was reconstructed using mitogenome data from 41 species, including 2 Lucinida species, 2 Myoida species, and 37 Veneroida species, and Chlamys farreri and Mimachlamys nobilis from the subclass Pteriomorphia served as outgroups (S1 Table). Our data set was based on nucleotide and amino acid sequences from 9 mitochondrial PCGs (cox1, cox2, cox3, cob, atp6, nad1, nad4, nad5, and nad6) and 2 rRNA genes. The atp8, nad2, nad4l and nad6 sequences were excluded due to several species with incomplete mitogenomes. Multiple alignments of the whole 11 genes were conducted using MUSCLE 3.8.31. In aligned sequences, ambiguously aligned regions and gaps were removed using Gblocks server 0.91b [45] with the default setting. ModelTest 2.1.10 [46] and ProtTest 3.4 [47] were used to select the best-fit evolutionary models GTR + I + G and LG + I + G + F for the nucleotide dataset and amino acid dataset, respectively. Maximum likelihood (ML) analysis was performed using RAxML 8.2.8 [48]. Topological robustness for the ML analysis was evaluated with 100 replicates of bootstrap support. Bayesian inference (BI) was conducted using MrBayes 3.2.6 [49], and four Markov chain Monte Carlo (MCMC) chains were run for 106 generations, with sampling every 100 generations and a 25% relative burn-in. All phylogenetic trees were graphically edited with the iTOL 3.4.3 (https://itol.embl.de/itol.cgi).

Positive selection analysis

Comparing the nonsynonymous/synonymous substitution ratios (ω = dN/dS) has become a useful approach for quantifying the impact of natural selection on molecular evolution [50]. ω >1, = 1 and <1 indicate positive selection, neutrality and purifying selection, respectively [51]. The codon-based maximum likelihood (CodeML) method implemented in the PAML package [52] was applied to estimate the dN/dS ratio ω. The combined database of 13 mitochondrial PCGs was used for the selection pressure analyses (S2 Table). Both the ML and Bayesian phylogenetic trees were separately used as the working topology in all CodeML analyses. To evaluate positive selection in the vesicomyid bivalves, we used branch models in the present study. First, the one-ratio model (M0) estimate the distribution of ɷ values as a benchmark under an assumption of no adaptive evolution in the gene sequences, which assumes a single ω ratio for all branches in the phylogeny [53]. Then, the two-ratio model (M2), which allows the background lineages and foreground lineages to have different ω ratio values, was used to detect positive selection acting on branches of interest [54-55]. Last, a free-ratio model (M1), which allows ω ratio variation among branches, was used to estimate ω values on each branch [55]. Here, the one-ratio model (M0) and the free-ratio model (M1) were compared to confirm whether different clades in Heterodonta had different ω values. Additionally, we compared the one-ratio model (M0) and the two-ratio model (M2) to investigate whether deep-sea vesicomyid clades are subjected to more selection pressure than other Heterodonta species in coastal waters. ω0 and ω1 represent the values for the other Heterodonta clades in the phylogeny and the vesicomyid clades, respectively. Pairwise models were compared with critical values of the Chi square (χ2) distribution using likelihood ratio tests (LRTs), in which the test statistic was estimated as twice the difference in log likelihood (2ΔL) and the degrees of freedom were estimated as the difference in the number of parameters for each model. Furthermore, we fit branch-site models to examine positive selection on some sites among the vesicomyid clades. Branch-site models allow ω to vary both among sites in the protein and across branches on the tree. Branch-site model A (positive selection model) was used to identify the positively selected sites among the lineages of vesicomyids (marked as foreground lineages). The presence of sites with ω > 1 suggests that model A fits the data significantly better than the corresponding null model. Bayes Empirical Bayes analysis was used to calculate posterior probabilities in order to identify sites under positive selection on the foreground lineages if the LRTs was significant [56].

Results and discussion

C. marissinica mitogenome organization

The Illumina HiSeq runs resulted in 20,359,890 paired-end reads from the C. marissinica library with an insert size of approximately 450 bp. Selective-assembly analysis showed that 312,068 reads were assembled into a 17,374 bp circular molecule, which represented the complete mitogenome of C. marissinica (Fig 1 and Table 1). This length is shorter than that of the complete mitogenome of other vesicomyid bivalves, which ranges from 19,738 bp in Calyptogena magnifica [33] to 19,424 bp in Abyssogena phaseoliformis [35]. The genome encodes 37 genes, including 13 PCGs, 2 rRNA genes, and 22 tRNA genes (duplication of tRNA and tRNA). All of the genes are encoded on the heavy (H) strand, as consistently reported for other bivalves [35-36,56], and transcribed in the same direction. A total of 2,287 bp of noncoding nucleotides are scattered among 23 intergenic regions varying from 1 to 1,676 bp in length (Table 1). The largest noncoding region (1,676 bp) is located between tRNA and nad6 and is identified as the putative control region (CR) due to its location and high A+T content (73.3%). Furthermore, there are four overlaps between adjacent genes in the C. marissinica mitogenome with a size range of 1 to 5 bp (tRNA / tRNA, tRNA / nad1, rrnS / tRNA, and cox3 / tRNA).
Fig 1

Complete mitogenome map of C. marissinica.

All 37 genes are encoded on the heavy (H) strand. Genes for proteins and rRNAs are shown with standard abbreviations. Genes for tRNAs are displayed by a single letter for the corresponding amino acid, with two leucine tRNAs and two serine tRNAs differentiated by numerals.

Table 1

Mitogenome organization of C. marissinica.

NameStrandRangeSizeCodonIntergenic nucleotides
NucleotidesAmino acidsStartStopAnticodon
cox1+1–18331833610ATGTAA-
tRNA-Pro+1854–191764TGG20
cox2+1918–29341017338ATGTAA0
tRNA-Arg+2941–300565TCG6
cob+3010–41431134377ATGTAA4
rrnL+4305–53401036161
atp8+5377–549311738TAG36
nad4+5506–68521347448ATGTAA12
tRNA-His+6873–693361GTG20
tRNA-Glu+6934–699966TTC0
tRNA-SerUCA+6996–705964TGA-4
atp6+7060–7773714237ATGTAA0
nad3+7872–8192321106ATTTAA98
tRNA-Gln+8203–826967TTG10
tRNA-Ile+8272–833867GAT2
tRNA-Lys+8339–840567TTT0
tRNA-LeuUUA+8407–846963TAA1
nad1+8467–9381915304ATATAG-3
tRNA-Val+9399–946062TAC17
tRNA-Asn+9461–952262GTT0
nad5+9550–112231674557ATATAA27
tRNA-LeuCUA+11236–1129762TAG12
tRNA-Trp+11298–1136265TCA0
contral region+11363–1303816760
nad6+13039–13554516171ATTTAA0
nad4l+13595–1384324982ATTTAA40
tRNA-Gly+13844–1390764TCC0
nad2+13925–150101086361ATTTAG17
tRNA-Asp+15020–1508162GTC9
tRNA-Thr+15082–1514261TGT0
rrnS+15160–1602786817
tRNA-Met+16023–1608967CAT-5
tRNA-Cys+16092–1615362GCA2
tRNA-Tyr+16159–1622062GTA5
tRNA-SerAGA+16228–1629669TCT7
cox3+16297–17148852283ATGTAG0
tRNA-Phe+17148–1721063GAA-1
tRNA-Ala+17228–1729366TGC17

Complete mitogenome map of C. marissinica.

All 37 genes are encoded on the heavy (H) strand. Genes for proteins and rRNAs are shown with standard abbreviations. Genes for tRNAs are displayed by a single letter for the corresponding amino acid, with two leucine tRNAs and two serine tRNAs differentiated by numerals. The C. marissinica mitogenome has a nucleotide composition of 25.9% A, 10.8% C, 23.8% G, and 39.5% T and an overall AT content of 65.4%. The AT skew and GC skew are well conserved among vesicomyids, which vary from -0.165 to -0.230 and 0.343 to 0.440, respectively (Table 2). For the C. marissinica mitogenome, the AT skew is -0.209, and the GC skew is 0.375, which indicates bias toward T and G similar to that in other vesicomyids. The complete mitochondrial DNA sequence has been deposited in GenBank under accession number MK948426.
Table 2

Mitogenomes of Vesicomyidae species sequenced to date and their genomic features.

SpeciesGenusAccession numberLength(bp)GenomeProtein-coding genesrrnLrrnStRNAsControl region
AT%ATskewGCskewLength(aa)AT%(all)AT%(3rd)Length(bp)AT%Length(bp)AT%Number/Length(bp)AT%Length(bp)AT%
Abyssogena mariana1AbyssogenaLC12631115,927*69.8-0.2100.408388469.073.9119671.286270.823/127971.7--
Abyssogena phaseoliformisAbyssogenaAP01455719,42470.4-0.1990.440388168.373.7119671.286270.924/128270.43,43874.4
Akebiconcha kawamurai2AkebiconchaAP01455112,946*65.2-0.2220.371324962.568.7119469.720567.717/109069.0--
Archivesica gigas1ArchivesicaMF95962315,674*65.0-0.2280.389387863.670.6122368.987967.921/121268.7--
Archivesica pacifica1ArchivesicaMF95962417,782*68.6-0.2140.429400267.179.5122671.388569.922/128269.6--
Archivesica sp.1ArchivesicaMF95962215,650*64.8-0.2280.386388963.770.8122169.087967.820/121468.7--
Calyptogena fausta2CalyptogenaAP01454913,509*66.0-0.2180.39434100.6464.770.7118970.320567.817/109270.0--
Calyptogena laubirei2CalyptogenaAP01455312,968*64.3-0.2260.361325961.667.1119169.020467.217/109067.8--
Calyptogena magnificaCalyptogenaNC_02872419,73868.4-0.1950.390392865.575.6121970.593570.022/134770.23,91075.2
Calyptogena marissinicaCalyptogenaMK94842617,37465.4-0.2090.375391263.269.71,03667.286867.722/1,41168.71,67673.3
Calyptogena nautilei2CalyptogenaAP01455413,281*69.4-0.1960.353329868.074.4118272.120472.117/108871.4--
Calyptogena pacifica2CalyptogenaAP01455613,454*67.6-0.2220.420339066.672.6119570.820469.617/108869.5--
Isorropodon fossajaponicum1IsorropodonAP01455019,556*68.2-0.1650.343389466.670.6119970.386168.324/129070.3--
Phreagena kilmeri2PhreagenaAP01455212,944*64.9-0.2230.365324963.468.0119169.520468.317/108968.7--
Phreagena okutanii1PhreagenaAP01455516,336*65.6-0.2300.405383364.068.1119169.786167.723/127768.9--
Phreagena soyoae2PhreagenaAP01455812,941*64.9-0.2230.365324963.467.9119069.520468.117/108968.7--
Pliocardia stearnsii2PliocardiaAP01455913,012*67.7-0.2300.402326566.672.7120171.620369.517/109669.1--

Note:

*indicates incomplete mitogenomes.

1 Incomplete mitogenomes for which the control region was not sequenced.

2 Incomplete mitogenomes for which nad2, nad4l, nad6, a few tRNA genes and the control region were not sequenced.

Note: *indicates incomplete mitogenomes. 1 Incomplete mitogenomes for which the control region was not sequenced. 2 Incomplete mitogenomes for which nad2, nad4l, nad6, a few tRNA genes and the control region were not sequenced.

Protein-coding genes

The total length of all 13 PCGs of C. marissinica is 11,775 bp, accounting for 67.8% of the complete length of the mitogenome, and the PCGs encode 3,912 amino acids (Table 2). In the mitogenome of metazoans, most PCGs start with the standard ATN codon [2,57-58]. In the present study, with the exception of the atp8 gene, which had the alternate initiation codon GTG, all the PCGs were initiated by typical ATN codons: 6 genes (atp6, cob, cox1, cox2, cox3, and nad4) were initiated by ATG, 4 genes (nad2, nad3, nad4l, and nad6) were initiated by ATT, and 2 genes (nad1 and nad5) were initiated by ATA. Notably, genes are commonly initiated by GTG in vesicomyid bivalves [34], and the amino acid encoded by GTG is valine, which belongs to the nonpolar amino acids, such as methionine and isoleucine encoded by ATN. Moreover, in eight other vesicomyid bivalves (Archivesica sp., Archivesica gigas, Archivesica pacifica, C. magnifica, Abyssogena mariana, Ab. phaseoliformis, Isorropodon fossajaponicum, and Phreagena okutanii), cox3 had a truncated termination codon, TA [34]. Previous studies have shown that the truncated stop codon is common in the metazoan mitogenome and might be corrected by posttranscriptional polyadenylation [59-60]. However, in the mitogenome of C. marissinica, all of the PCGs were ended by a complete TAA (atp6, cob, cox1, cox2, nad3, nad4, nad4l, nad5, and nad6) or TAG (atp8, cox3, nad1, and nad2) termination codon. Numerous studies have indicated that metazoan mitogenomes usually have a bias toward a higher representation of nucleotides A and T, leading to a subsequent bias in the corresponding encoded amino acids [58,61-63]. In the mitogenome of C. marissinica, the A+T contents of PCGs and third codon positions are 63.2% and 69.7%, respectively, which are similar to the values observed in other vesicomyids (Table 2). The amino acid usage and relative synonymous codon usage (RSCU) values in the PCGs of C. marissinica are summarized in Fig 2. The mitogenome encodes a total of 3,912 amino acids, among which leucine (13.6%) and glutamine (1.4%) are the most and the least frequently used, respectively. As mentioned earlier, the amino acids encoded by A+T-rich codon families (Asn, Ile, Lys, Met, Phe and Tyr) have a higher frequency of use than those encoded by G+C-rich codon families (Ala, Arg, Gly and Pro). The RSCU values indicate that the six most commonly used codons are TTA (Leu), ACT (Thr), GGG (Gly), TCT (Ser), GCT (Ala), and CCT (Pro) (Fig 2), which show A+T bias at their third codon position. In addition, the codons with A and T in the third position are used more frequently than other synonymous codons. These features reflect codon usage with A and T biases at the third codon position, which are similar to the biases that exist in many metazoans [16,64-66].
Fig 2

Codon usage (A) and the relative synonymous codon usage (RSCU) (B) of the C. marissinica mitogenome.

Numbers to the left refer to the total number of codons (A) and the RSCU values (B). Codon families are provided on the X axis.

Codon usage (A) and the relative synonymous codon usage (RSCU) (B) of the C. marissinica mitogenome.

Numbers to the left refer to the total number of codons (A) and the RSCU values (B). Codon families are provided on the X axis.

Ribosomal RNA and transfer RNA genes

The rrnL and rrnS genes of C. marissinica are 1,036 bp (AT% = 67.2) and 868 bp (AT% = 67.7) in length, respectively. As in other vesicomyid bivalves, rrnL is located between the cob and atp8 genes, and rrnS is located between tRNA and tRNA. The largest known rrnL and rrnS genes are 1,226 bp in Ar. pacifica and 935 bp in C. magnifica, respectively [33-35]. Twenty-two tRNA genes were identified in the mitogenome of C. marissinica, which is typical for metazoans. However, the number of tRNA genes varies among other vesicomyid bivalves (Table 2). The length of tRNA genes in C. marissinica ranges from 61 (tRNA and tRNA) to 69 (tRNA) bp (Table 1), and the AT content of the tRNA genes is 68.7%. The secondary structures of tRNA genes are schematized in S1 Fig. Generally, a typical tRNA clover-leaf structure includes a 7–8 bp aminoacyl acceptor stem, a 3–5 bp TψC stem, a 5 bp anticodon stem and a 4 bp DHU stem. In the present study, most of the tRNA genes had the typical secondary structure, except for tRNA, tRNA, tRNA, tRNA and tRNA. In tRNA, tRNA and tRNA, the TψC loops are incomplete, which is not observed in other vesicomyid bivalves [34-36], and this feature might be a specific character in the C. marissinica mitogenome. In tRNA and tRNA, the DHU stems are reduced to a simple loop, as in many other bivalve mitogenomes [34,67]. Many studies have shown that the incomplete clover-leaf secondary structure of tRNA genes is common in metazoan mitogenomes and that aberrant tRNA genes can still function normally through posttranscriptional RNA editing and/or coevolved interacting factors [68-70]. Additionally, several mismatch pairs were detected within amino acid acceptors and anticodon stems in tRNA genes of C. marissinica. Such mismatches seem to be ubiquitous phenomena in the mitogenomes of many organisms and can also be corrected by posttranscriptional RNA editing [58,66,71-72].

Noncoding regions and gene arrangement

A total of 23 noncoding regions (totaling 2,216 bp) are distributed in the C. marissinica mitogenome. The longest noncoding region (1,676 bp), located between tRNA and nad6, corresponds to the control region identified in most other vesicomyids. The nucleotide content of the 1,676 bp control region is 34.25% A, 39.02% T, 16.29% G, and 10.44% C. The A + T content (73.27%) of this region is higher than that of other regions in the C. marissinica mitogenome (Table 2). In general, the mitochondrial control region is subjected to less evolutionary pressure than PCGs and thus has the highest variation in the whole mitogenome [73-74]. Additionally, in the mitochondrial control region of C. marissinica, we found a tandemly arranged repeated sequence, which was 354 bp in length (positions 12,675–13,028), including three identical tandem repeat units of 118 bp (Fig 3). The tandem repeat sequence could be folded into stem-loop secondary structures with minimized free energy (Fig 3), which is a common phenomenon in invertebrates [16,63,66,75]. The control region in the mitogenome is essential for transcription and replication in animals [76-77]. Therefore, the stem-loop structures mentioned above may play an important role in gene replication and regulation. In addition, some other peculiar patterns, such as special “G(A)nT” motifs and AT-rich sequences, were observed in the control region of the C. marissinica mitogenome (Fig 3). Furthermore, similar characteristics (e.g., repetitive elements, G(A)nT motifs and AT-rich sequences) were also observed in the deep-sea anemone Bolocera sp., alvinocaridid shrimp Shinkaicaris leurokolos and spongicolid shrimp Spongiocaris panglao [16,63,66]. In view of the particularity of the deep-sea chemosynthetic environment, we speculate that these special control region elements are involved in the regulation of replication and transcription of the mitogenome and may play some role in helping organisms adapt to extreme deep-sea habitats.
Fig 3

Nucleotide sequences and stem-loop structures of the tandem repeat motifs in the control region (CR) of the C. marissinica mitogenome.

The CR is flanked by sequences encoding tRNA and nad6. The CR consists of certain patterns, such as special G(A)nT motifs (marked with a box), AT-rich regions and tandem repeat motifs.

Nucleotide sequences and stem-loop structures of the tandem repeat motifs in the control region (CR) of the C. marissinica mitogenome.

The CR is flanked by sequences encoding tRNA and nad6. The CR consists of certain patterns, such as special G(A)nT motifs (marked with a box), AT-rich regions and tandem repeat motifs. In contrast to other metazoans, the Mollusca showed frequent and extensive variation in gene arrangement, and among them, bivalves showed more gene order variation in their mitogenomes [78-80]. Here, a comparison of the C. marissinica mitogenome with the other twelve Heterodonta mitogenomes is shown in Fig 4. All thirteen Heterodonta mitogenomes come from two orders (five families): Myoida (family Hiatellidae) and Veneroida (family Tellinidae, family Mactridae, family Veneridae and family Vesicomyidae). Among the Heterodonta mitogenomes analyzed in the present study, the gene arrangement has a distinct difference between the family Vesicomyidae and other species (Fig 4). In the family Vesicomyidae, we found that if the tRNA genes are not considered, the nine vesicomyid bivalves have a completely identical gene arrangement of PCGs. When compared to the “standard” mitogenome of Ar. pacifica, C. magnifica and C. marissinica, several additional tRNA genes were identified in Ab. mariana (tRNA), Ab. phaseoliformis (tRNA and tRNA), I. fossajaponicum (tRNA and tRNA) and Ph. okutanii (tRNA) (Fig 4). As a general rule, additional gene copies usually obtained by gene replication and different gene copies would share some sequence identity with each other. However, analysis showed that the aforementioned additional tRNA genes have low similarity to other tRNA genes that encode the same tRNAs [34]. The remolding of tRNA genes, DNA shuffling and the point mutations in the anticodons may all provide chances for tRNA gene rearrangement within mitogenomes [3,81-82]. Furthermore, gene rearrangements usually occurred around the control regions, which are considered the replication origins. Perhaps gene replication events occur frequently in this region, and consequently, more novel gene arrangements will be found in this region. To date, there are four known mechanisms of gene rearrangements in mitogenomes: inversion, transposition, reverse transposition and tandem duplication-random losses (TDRLs) [83-84]. However, the specific mechanism of significant differences in mitochondrial gene arrangements in mollusks has not been completely clarified. With the determination of mitogenomes in more species of this phylum, the mechanism of large-scale rearrangement of mitochondrial genes in mollusks will be identified by further comparing and summarizing the rules of gene arrangement among different species.
Fig 4

Mitochondrial gene arrangement of 13 species in the subclass Heterodonta (Panopea generosa, Moerella iridescens, Coelomactra antiquata, Meretrix meretrix and 9 vesicomyid bivalves).

CR indicates the control region. Genes for tRNAs are displayed by a single letter for the corresponding amino acid, with two leucine tRNAs and two serine tRNAs differentiated by numerals. Uniquely derived gene positions of individual species are depicted in red. Sequence segments are not drawn to scale.

Mitochondrial gene arrangement of 13 species in the subclass Heterodonta (Panopea generosa, Moerella iridescens, Coelomactra antiquata, Meretrix meretrix and 9 vesicomyid bivalves).

CR indicates the control region. Genes for tRNAs are displayed by a single letter for the corresponding amino acid, with two leucine tRNAs and two serine tRNAs differentiated by numerals. Uniquely derived gene positions of individual species are depicted in red. Sequence segments are not drawn to scale.

Phylogenetic relationships

Since several vesicomyid bivalves have incomplete mitogenomes at present, phylogenetic analyses were performed based on nucleotide and amino acid sequences of 9 mitochondrial PCGs (atp6, cox1, cox2, cox3, cob, nad1, nad3, nad4, and nad5) and 2 rRNA genes by maximum likelihood (ML) and Bayesian inference (BI) methods (Fig 5, S2–S4 Figs). The tree topologies resulting from the BI and ML analyses were not the same. There are two potential reasons for this discrepancy: one is that the presence of noncoding rRNA genes made the databases of nucleotides and amino acids different, and the other is the fact that several clades are represented by only one or two species each. The phylogenetic analyses clustered C. marissinica with the previously reported vesicomyid bivalves with high support values (Fig 5). In all phylogenetic trees, the family Vesicomyidae first clustered well with Veneridae and then united with Mactridae, which corroborates earlier studies of phylogenetic relationships based on the concatenated 12 PCGs and 2 rRNA genes [34-36]. Calyptogena (sensu lato) is the most diverse group of deep-sea vesicomyid bivalves in the western Pacific region and its marginal seas. Until now, the composition, evolutionary position and level of the genus Calyptogena have been the subject of discussion [85-87]. Phylogenetic reconstruction using the cytochrome oxidase c subunit I (cox1) gene showed that C. marissinica was clearly nested within a fully supported monophyletic clade corresponding to Calyptogena sensu lato and consisting of all included Calyptogena (sensu lato) species [32]. Notably, in our studies, C. marissinica showed a close genetic relationship with the Akebiconcha species (Fig 5). Therefore, additional mitogenomes of a greater number of vesicomyid bivalves, combined with morphological characters, are necessary to determine the phylogenetic relationships among members of this family.
Fig 5

Phylogenetic tree derived from Bayesian analyses based on concatenated nucleotide sequences of 9 mitochondrial PCGs (cox1, cox2, cox3, cob, atp6, nad1, nad4, nad5, and nad6) and 2 ribosomal RNA genes (rrnS and rrnL).

Numbers on branches are Bayesian posterior probabilities (percent). Two Pectinidae species belonging to the subclass Pteriomorphia were used as outgroups.

Phylogenetic tree derived from Bayesian analyses based on concatenated nucleotide sequences of 9 mitochondrial PCGs (cox1, cox2, cox3, cob, atp6, nad1, nad4, nad5, and nad6) and 2 ribosomal RNA genes (rrnS and rrnL).

Numbers on branches are Bayesian posterior probabilities (percent). Two Pectinidae species belonging to the subclass Pteriomorphia were used as outgroups. Purifying selection is the predominant force in the evolution of mitogenomes, but because mitochondria are the main sites of aerobic respiration and are essential for energy metabolism, weak and/or episodic positive selection may occur against this background of strong purifying selection under reduced oxygen availability or greater energy requirements [88-89]. As proven by many studies, mitochondrial PCGs underwent positive selection in animals that survived in hypoxic environments or had higher energy demands for locomotion, such as Tibetan humans, Ordovician bivalves, diving cetaceans and flying insects [19, 90–92]. Considering that the deep-sea chemosynthetic ecosystemsmay impact the function of mitochondrial genes, we examined potential positive selection in the Vesicomyidae lineage using CodeML from the PAML package. Although different tree-building methods were used, the results of positive selection analyses were generally consistent (Table 3). In the analysis of branch models, the ω (dN/dS) ratio calculated under the one-ratio model (M0) was 0.02272 for the 13 mitochondrial PCGs of sampled Heterodonta bivalves, suggesting that these genes have experienced constrained selection pressure to maintain function. Then, in the comparison of the one-ratio model (M0) and the free-ratio model (M1), the LRTs indicated that the free-ratio model fit the data better than the one-ratio model (Table 3), which means that the ω ratios are indeed different among lineages. Furthermore, the two-ratio model (M2) was also found to fit the data better than the one-ratio model (Table 3) when the family Vesicomyidae was set as a foreground branch. The ω ratio of the Vesicomyidae branch was almost treble that of other branches (ω1 = 0.06398 and ω0 = 0.02278), indicating divergence in selection pressure between vesicomyid bivalves and other shallow-sea Heterodonta species. However, the ω ratio of the family Vesicomyidae (ω1 = 0.06398) was still significantly less than 1. This result is consistent with the known functional significance of mitochondria as a respiration chain necessary for electron transport and OXPHOS [93].
Table 3

CODEML analyses of selection pressure on mitochondrial genes in the Vesicomyidae lineage.

TreesModelslnLParameter estimatesModels compared2ΔLp-value
Branch models
Bayesian treeM0-203672.1001ω = 0.02272
Two-ratio-203666.6298ω0 = 0.02278 ω1 = 0.06398Two-ratio vs. M010.940720.00094
Free-ratio-202667.8186Free-ratio vs. M02008.562990.00000
ML treeM0-203672.1001ω = 0.02272
Two-ratio-203666.6298ω0 = 0.02278 ω1 = 0.06398Two-ratio vs. M010.940720.00094
Free-ratio-202747.0065Free-ratio vs. M01850.187290.00000
Branch-site models
Bayesian treeNull model-202664.0952P0 = 0.77667 P1 = 0.02562 P2a = 0.19139 P2b = 0.00631
ω0 = 0.02110 ω1 = 1.00000 ω2a = 1.00000 ω2b = 1.00000
Model A-202663.7531P0 = 0.78464 P1 = 0.02590 P2a = 0.18340 P2b = 0.00605Model A vs. Null model0.684220.40814
ω0 = 0.02115 ω1 = 1.00000 ω2a = 1.21998 ω2b = 1.21998
ML treeNull model-202664.0952P0 = 0.77667 P1 = 0.02562 P2a = 0.19140 P2b = 0.00631
ω0 = 0.02110 ω1 = 1.00000 ω2a = 1.00000 ω2b = 1.00000
Model A-202663.7531P0 = 0.77667 P1 = 0.02562 P2a = 0.19139 P2b = 0.00631Model A vs. Null model0.684220.40814
ω0 = 0.02115 ω1 = 1.00000 ω2a = 1.22003 ω2b = 1.22003
Moreover, many studies have shown that positive selection often occurs over a short period of evolutionary time and acts on only a few sites; thus, the signal for positive selection is usually swamped by those for continuous purifying selection that occur on most sites in a gene sequence [88,94]. In the present study, branch-site models were used to detect possible positively selected sites in the vesicomyid bivalves (Table 4). Ten residues, which were located in cox1, cox3, cob, nad2, nad4 and nad5, were identified as positively selected sites with high BEB values (> 95%).
Table 4

Possible sites under positive selection in the Vesicomyidae lineage.

Bayesian treeML tree
GeneCodonAmino acidBEB valuesGeneCodonAmino acidBEB values
cox1529W0.967cox1529W0.967
cox3998G0.956cox3998G0.956
1018W0.9621018W0.962
1021T0.9541021T0.954
cob1131K0.982cob1131K0.982
1432A0.9591432A0.959
nad22043K0.960nad22043K0.960
nad42388E0.951nad42388E0.951
nad52734P0.975nad52734P0.975
2773S0.9512773S0.951
It is well known that mitochondrial PCGs play a key role in the oxidative phosphorylation pathway; the above ten amino acid mutation sites are components of the respiratory chain and therefore may have important functions. As the first and the largest enzyme complex in the respiratory chain, the NADH dehydrogenase complex exercises the functions of proton pumps, and variation in loci may interfere with the efficiency of the proton-pumping process, which then affect metabolic performance [91]. In this work, there were four positively selected sites located in the nad2, nad4 and nad5 genes (subunits 2, 4 and 5 of NADH deshydrogenase). Under the deep-sea chemosynthetic enviroment, survival may require a modified and adapted energy metabolism, and evidences of adaptive evolution in the NADH dehydrogenase complex have been reported in mitogenome of Tibetan horses, Chinese snub-nosed monkeys and Tibetan loaches [20,22,25]. Two residues in the cob gene were identified to be under positive selection. As a relatively conserved gene, cob plays a fundamental role in energy production in mitochondria. It catalyzes reversible electron transfer from ubiquinol to cytochrome c coupled to proton translocation [95]. Wide variation in the properties of amino acids was observed in functionally important regions of cob in species with more specialized metabolic requirements, such as adaptation to a low-energy diet or large body size and adaptation to unusual oxygen requirements or low-temperature environments [91,96]. Cytochrome c oxidase, which catalyzes the terminal reduction of oxygen and whose catalytic core is encoded by three mitochondrial protein-coding genes (cox1, cox2 and cox3), has been proven to be a particularly important target of positive selection during hypoxia adaptation [97-98]. Studies have shown that when cells of the organisms are exposed to anoxia, the cytochrome c oxidase are required for the produced reactive oxygen species (ROS), and the increase in ROS concentration serves to stabilize Hif-1α, which then leads to the induction of Hif-1-dependent nuclear hypoxic genes [99-100]. Four positively selected residues were detected in the cox1 and cox3 genes. For C. marissinica, functional modification mediated by positively selected mutations may increase the affinity between the enzyme and oxygen, thus allowing the efficient utilization of oxygen under hypoxia and maintaining essential metabolic levels. The environment of deep-sea hydrothermal vents and cold seeps is characterized by darkness, a lack of photosynthesis-derived nutrients, high hydrostatic pressure, variable temperatures, low dissolved oxygen, and high concentrations of hydrogen sulfide (H2S), methane (CH4) and heavy metals, such as iron, copper and zinc [9, 101]. Previous studies have confirmed that all of the above environmental factors influence the process of mitochondrial aerobic respiration; for example, thirty potentially important adaptive residues were identified in the mitogenome of S. leurokolos and revealed the mitochondrial genetic basis of hydrothermal vent adaptation in alvinocaridid shrimp [16]. Similar findings have been reported in other deep-sea macrobenthos, such as the sea anemone Bolocera sp., starfish Freyastera benthophila and sea cucumber Benthodytes marianensis [66,102-103]. In the present study, ten potentially adaptive residues were identified in the cox1, cox3, cob, nad2, nad4 and nad5 genes, supporting the adaptive evolution of the mitogenome of C. marissinica. Our results establishes a necessary foundation to understand how the deep-sea vesicomyid bivalves maintain aerobic respiration for sufficient energy in harsh deep-sea chemosynthetic environment at the mitochondrial level.

Conclusion

This study characterized the complete mitogenome of the deep-sea vesicomyid bivalve C. marissinica, which is 17,374 bp in length and encodes 37 typical mitochondrial genes, including 13 PCGs, 2 rRNA genes, and 22 tRNA genes. All of these genes are encoded on the heavy strand. We analyzed the mitogenome organization, codon usage, control region features, gene arrangement, phylogenetic relationships and positive selection of C. marissinica. In the mitogenome of C. marissinica, tandem repeat sequences, “G(A)nT” motifs and AT-rich sequences were detected. In the family Vesicomyidae, we found that if the tRNA genes are not considered, the sequenced vesicomyid bivalves have a completely identical arrangement of PCGs. The phylogenetic analyses clustered C. marissinica with previously reported vesicomyid bivalves with high support values. Ten residues located in cox1, cox3, cob, nad2, nad4 and nad5 were inferred to be positively selected sites along the branches leading to vesicomyid bivalves, which may indicate that the genes were under positive selection pressure. The findings of this study could help deepen our understanding of the molecular mechanisms of adaptive evolution at the mitochondrial level in deep-sea organisms.

Species used for phylogenetic reconstructions.

(DOCX) Click here for additional data file.

Species used for CodeML analyses of selective pressure on mitochondrial genes.

(DOCX) Click here for additional data file.

Putative secondary structures for the 22 transfer RNAs of the C. marissinica mitogenome.

(TIF) Click here for additional data file.

Phylogenetic trees derived from ML analyses based on nucleotide sequences of 9 mitochondrial protein-coding genes and 2 ribosomal RNA genes.

(TIF) Click here for additional data file.

Phylogenetic trees derived from Bayesian analyses based on amino acid sequences of 9 mitochondrial protein-coding genes.

(TIF) Click here for additional data file.

Phylogenetic trees derived from ML analyses based on amino acid sequences of 9 mitochondrial protein-coding genes.

(TIF) Click here for additional data file.
  89 in total

1.  Codon-substitution models for heterogeneous selection pressure at amino acid sites.

Authors:  Z Yang; R Nielsen; N Goldman; A M Pedersen
Journal:  Genetics       Date:  2000-05       Impact factor: 4.562

2.  A novel type of RNA editing occurs in the mitochondrial tRNAs of the centipede Lithobius forficatus.

Authors:  D V Lavrov; W M Brown; J L Boore
Journal:  Proc Natl Acad Sci U S A       Date:  2000-12-05       Impact factor: 11.205

Review 3.  Evolution and biogeography of deep-sea vent and seep invertebrates.

Authors:  C L Van Dover; C R German; K G Speer; L M Parson; R C Vrijenhoek
Journal:  Science       Date:  2002-02-15       Impact factor: 47.728

4.  The minimal tRNA: unique structure of Ascaris suum mitochondrial tRNA(Ser)(UCU) having a short T arm and lacking the entire D arm.

Authors:  Takashi Ohtsuki; Gota Kawai; Kimitsuna Watanabe
Journal:  FEBS Lett       Date:  2002-03-06       Impact factor: 4.124

Review 5.  Animal mitochondrial genomes.

Authors:  J L Boore
Journal:  Nucleic Acids Res       Date:  1999-04-15       Impact factor: 16.971

6.  Biological identifications through DNA barcodes.

Authors:  Paul D N Hebert; Alina Cywinska; Shelley L Ball; Jeremy R deWaard
Journal:  Proc Biol Sci       Date:  2003-02-07       Impact factor: 5.349

7.  Mfold web server for nucleic acid folding and hybridization prediction.

Authors:  Michael Zuker
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

8.  Complete mtDNA sequence of the North American freshwater mussel, Lampsilis ornata (Unionidae): an examination of the evolution and phylogenetic utility of mitochondrial genome organization in Bivalvia (Mollusca).

Authors:  Jeanne M Serb; Charles Lydeard
Journal:  Mol Biol Evol       Date:  2003-08-29       Impact factor: 16.240

9.  Exposure of yeast cells to anoxia induces transient oxidative stress. Implications for the induction of hypoxic genes.

Authors:  Reinhard Dirmeier; Kristin M O'Brien; Marcella Engle; Athena Dodd; Erick Spears; Robert O Poyton
Journal:  J Biol Chem       Date:  2002-06-27       Impact factor: 5.157

Review 10.  Replication and transcription of mammalian mitochondrial DNA.

Authors:  Patricio Fernández-Silva; José A Enriquez; Julio Montoya
Journal:  Exp Physiol       Date:  2003-01       Impact factor: 2.969

View more
  7 in total

1.  Divergent paths in the evolutionary history of maternally transmitted clam symbionts.

Authors:  Maëva Perez; Corinna Breusing; Bernard Angers; Roxanne A Beinart; Yong-Jin Won; C Robert Young
Journal:  Proc Biol Sci       Date:  2022-03-09       Impact factor: 5.349

2.  The complete mitochondrial genome of a cold seep gastropod Phymorhynchus buccinoides (Neogastropoda: Conoidea: Raphitomidae).

Authors:  Lvpei Du; Shanya Cai; Jun Liu; Ruoyu Liu; Haibin Zhang
Journal:  PLoS One       Date:  2020-11-30       Impact factor: 3.240

3.  Phylogenetic Relationships and Adaptation in Deep-Sea Mussels: Insights from Mitochondrial Genomes.

Authors:  Kai Zhang; Jin Sun; Ting Xu; Jian-Wen Qiu; Pei-Yuan Qian
Journal:  Int J Mol Sci       Date:  2021-02-14       Impact factor: 5.923

4.  Comparative Analysis of the Mitochondrial Genome of Galatheanthemum sp. MT-2020 (Actiniaria Galatheanthemidae) From a Depth of 9,462 m at the Mariana Trench.

Authors:  Mengke Shi; Li Qi; Li-Sheng He
Journal:  Front Genet       Date:  2022-06-08       Impact factor: 4.772

5.  The First Complete Mitochondrial Genome of Eucrate crenata (Decapoda: Brachyura: Goneplacidae) and Phylogenetic Relationships within Infraorder Brachyura.

Authors:  Xiaoke Pang; Chenglong Han; Biao Guo; Kefeng Liu; Xiaolong Lin; Xueqiang Lu
Journal:  Genes (Basel)       Date:  2022-06-23       Impact factor: 4.141

6.  The complete mitogenome of Phymorhynchus sp. (Neogastropoda, Conoidea, Raphitomidae) provides insights into the deep-sea adaptive evolution of Conoidea.

Authors:  Mei Yang; Dong Dong; Xinzheng Li
Journal:  Ecol Evol       Date:  2021-05-02       Impact factor: 2.912

Review 7.  Mitochondrial Short-Term Plastic Responses and Long-Term Evolutionary Dynamics in Animal Species.

Authors:  Sophie Breton; Fabrizio Ghiselli; Liliana Milani
Journal:  Genome Biol Evol       Date:  2021-07-06       Impact factor: 3.416

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.