Literature DB >> 30463257

Comparative Characterization of the Complete Mitochondrial Genomes of the Three Apple Snails (Gastropoda: Ampullariidae) and the Phylogenetic Analyses.

Huirong Yang1,2, Jia-En Zhang3, Jun Xia4,5, Jinzeng Yang6, Jing Guo7,8, Zhixin Deng9,10, Mingzhu Luo11,12.   

Abstract

The apple snails Pomacea canaliculata, Pomacea diffusa and Pomacea maculate (Gastropoda: Caenogastropoda: Ampullariidae) are invasive pests causing massive economic losses and ecological damage. We sequenced and characterized the complete mitochondrial genomes of these snails to conduct phylogenetic analyses based on comparisons with the mitochondrial protein coding sequences of 47 Caenogastropoda species. The gene arrangements, distribution and content were canonically identical and consistent with typical Mollusca except for the tRNA-Gln absent in P. diffusa. An identifiable control region (d-loop) was absent. Bayesian phylogenetic analysis indicated that all the Ampullariidae species clustered on the same branch. The genus Pomacea clustered together and then with the genus Marisa. The orders Architaenioglossa and Sorbeoconcha clustered together and then with the order Hypsogastropoda. Furthermore, the intergenic and interspecific taxonomic positions were defined. Unexpectedly, Ceraesignum maximum, Dendropoma gregarium, Eualetes tulipa and Thylacodes squamigerus, traditionally classified in order Hypsogastropoda, were isolated from the order Hypsogastropoda in the most external branch of the Bayesian inference tree. The divergence times of the Caenogastropoda indicated that their evolutionary process covered four geological epochs that included the Quaternary, Neogene, Paleogene and Cretaceous periods. This study will facilitate further investigation of species identification to aid in the implementation of effective management and control strategies of these invasive species.

Entities:  

Keywords:  Ampullariidae; Pomacea canaliculata; Pomacea diffusa; Pomacea maculate; comparative characterization; mitochondrial genome; phylogenetic analysis

Mesh:

Substances:

Year:  2018        PMID: 30463257      PMCID: PMC6274680          DOI: 10.3390/ijms19113646

Source DB:  PubMed          Journal:  Int J Mol Sci        ISSN: 1422-0067            Impact factor:   5.923


1. Introduction

Apple snails (Ampullariidae) are freshwater snails distributed naturally throughout the humid tropics and subtropics and Pomacea is the largest of nine genera in the family. Pomacea spp. are indigenous to South America and were introduced from Argentina to Taiwan for commercial purposes. They have subsequently appeared in numerous countries throughout southern and eastern Asia including China. However, these animals have become pests of wetland rice and other crops causing massive economic losses. They have spread into nonagricultural wetlands and their ecological impacts in these habitats are more difficult to estimate [1,2]. P. canaliculata is listed among the world’s 100 worst invasive species by the United States Aquatic Nuisance Species Task Force (USANSTF) [3]. However, the identification and taxonomy of the genus Pomacea are difficult and confusing due to the highly conserved external morphology across the genus despite considerable intraspecific shell variation. This has obscured the true number of species and their identities [4,5,6]. Animal mitochondrial DNAs (mtDNA) are 16–19 kb circular molecules comprised of 37 genes encoding 22 transfer RNAs (tRNA), 13 proteins, two ribosomal RNAs and a control region (CR) that possesses cis regulatory elements [7,8]. The compactness, maternal inheritance, rapid evolutionary rate and short coalescence time compared to nuclear DNA have resulted in the use of mtDNA sequence information for phylogenic studies, taxonomic resolution [9] and population genetics [10]. Currently, P. canaliculata, Pomacea maculate (formerly Pomacea insularum) and Pomacea diffusa are the most widespread species in China. Most studies of the genus Pomacea actually refer to P. canaliculata. This species has been defined in terms of its life history, reproduction and as an intermediate host for the rat lungworm Angiostrongylus cantonensis. Control and management strategies have been largely based on this species [11,12,13,14]. In contrast, there are few studies of P. maculata and P. diffusa [15,16,17,18,19,20]. The Caenogastropoda is the most diverse clade of the Gastropoda and their evolution and population biology are under active investigation. However, phylogenetic analyses based on mtDNA genome have not been completely investigated and only preliminary and limited mtDNA information is available for the apple snails [20,21,22,23,24]. The complete mtDNA genome sequences of the family Ampullariidae are currently under investigation [20,23,25]. We sequenced complete mt genomes of the three apple snails P. canaliculata, P. maculata and P. diffusa using next-generation sequencing (NGS) technology to analyze their phylogeny and evolutionary histories. The identification of new DNA markers and taxonomic viewpoints may facilitate further species identification and assist the development of more effective management and control strategies of these invasive species.

2. Results and Discussion

2.1. Genome Organization and Composition

The complete mt genomes of P. canaliculata, P. diffusa and P. maculate were 16,965, 16,640 and 15,516 bp in length. These contained the canonical 13 PCGs (Protein Coding Genes), two rRNA genes (rrnl and rrns) and 22 tRNA genes with the exception of tRNA-Gln that was absent in P. diffusa. Most of the 37 genes are encoded on the heavy strand (H-strand) as in most Mollusca. However, seven tRNA genes (Met, Tyr, Cys, Trp, Gln, Gly and Glu) from all three species as well as tRNA-Val of P. diffusa and tRNA-Thr of P. diffusa and P. maculate are present on the light strand (L-strand). The G + C content varied from 14.71% to 41.18% (Table 1). The large and small subunit rRNA genes rrnl and rrns were located between tRNA-Leu (TAG) and tRNA-Glu (TTG) genes and separated by the tRNA-Val (TCG) gene (Figure 1). The absence of a specific and identifiable control region is consistent with other Gastropoda [26,27,28,29].
Table 1

Summary of gene/element feature of P. canaliculata, P. diffusa and P. maculate.

NO.Gene/ElementStrandSize (bp)GC-Content (%)Amino Acids (aa)Inferred Initiation CodonInferred Termination CodonOne Letter CodeAnti-CodonIntergenic Nucleotide*(bp)
1 COXI H153633.27–36.07511ATGTAG/TAA 27–32
2 COXII H68731.3–34.79228ATGTAG/TAA 17–21
3 tRNA-Asp H6814.71–17.65 DGTC0
4 ATP8 H15921.38–26.4252ATGTAA 12–29
5 ATP6 H699–71426.90–29.41232–237ATGTAG/TAA 15–28
6 tRNA-Met L64–6627.69–34.38 MCAT23–35
7 tRNA-Tyr L65–6831.34–41.18 YGTA8–25
8 tRNA-Cys L62–6521.54–29.03 CGCA5–19
9 tRNA-Trp L64–6827.94–35.38 WTCA11–119
10 tRNA-Gln L62–6434.38–35.48 QTTG9–10
11 tRNA-Gly L66–6719.40–20.9 GTCC24–31
12 tRNA-Glu L65–6720.90–24.62 ETTC0
13 rrns H929–95028.09–29.68 0–2
14 tRNA-Val H/L67–6920.90–29.41 VTAC0–1
15 rrnl H1331–134525.47–27.14 0
16 tRNA-Leu H64–6623.44–27.27 LTAG(−1)–0
17 tRNA-Leu H68–6923.53–26.09 LTAA0
18 ND1 H945–96028.02–31.96314–319ATGTAG/TAA (−12)–9
19 tRNA-Pro H66–6829.85–30.88 PTGG0
20 ND6 H49523.64–26.87164ATGTAA 10–19
21 Cytb H114028.51–32.02379ATGTAG/TAA 9–17
22 tRNA-Ser H6633.33–34.85 STGA20–30
23 tRNA-Thr H/L68–7130.99–35.29 TTGT9–14
24 ND4L H297–31226.92–27.9598–103 ATA, ATGTAG/TAA (−7)–20
25 ND4 H1341–136225.73–29.30446–453ATA, ATGTAA 3–9
26 tRNA-His H64–6516.92–21.88 HGTG0
27 ND5 H1710–172828.13–31.42569–575ATGTAA (−20)–0
28 tRNA-Phe H68–6930.88–33.33 FGAA141–1617
29 COXIII H78033.72–37.69259ATA, ATGTAA 47–67
30 tRNA-Lys H67–6829.85–32.35 KTTT10–18
31 tRNA-Ala H68–7123.19–28.17 ATGC29–46
32 tRNA-Arg H67–6929.85–33.33 RTCG2–5
33 tRNA-Asn H69–7121.13–24.64 NGTT19–30
34 tRNA-Ile H70–7230.56–34.29 IGAT0–6
35 ND3 H35429.66–31.92117ATGTAA 20–37
36 tRNA-Ser H6830.88–32.35 SGCT0
37 ND2 H1062–107127.12–30.91353–356ATGTAG/TAA 4–6

Intergenic Nucleotide*(bp): positive values indicated the interval sequence of adjacent genes, and negative values indicated the overlapping of adjacent genes.

Figure 1

Gene map of the complete mt genomes for P. canaliculata (GenBank accession No. KU052865), P. diffusa (KY008698) and P. maculate (KY008699). The larger ring indicates gene arrangement and distribution. The smaller ring indicates the GC content. ND1-6: NADH dehydrogenase subunits 1–6; COXI-III: cytochrome c oxidase subunits 1–3; ATP6 and ATP8: ATPase subunits 6 and 8; Cytb: cytochrome b; rrn: ribosomal RNA genes; tRNA: transfer RNA gene.

Gene arrangement and distribution of the three apple snails were canonically identical and consistent with typical Mollusca [30,31,32,33,34,35] but obviously different from vertebrates. We identified the presence of 24, 26 and 24 intergenic spacers (IGS) for P. canaliculata; P. diffusa and P. maculate totaling 2020, 1684 and 547 bp, respectively. The largest IGS was located between tRNA-Phe―COXIII IGS on the H strand and was 1617 bp in P. canaliculata, 1073 bp in P. diffusa and 141 bp in P. maculate. These regions accounted for the shorter mitogenome in P. maculate (Table 2). MtDNA-IGS are regions of non-coding DNA between genes as well as between the tandem rRNA gene copies. The mtDNA genes in animals generally have very short spacers and are important for evolutionary studies because they change more rapidly than gene sequences. IGS regions are thought to have sequence-independent functions [36]. Additionally, a 119 bp IGS between tRNA-Trp and tRNA-Gly in P. diffusa accounted for the absence of the tRNA-Gln.
Table 2

Summary of the base composition of the mitogenomes for eight species of the orders Sorbeoconcha and Architaenioglossa in the Gastropoda.

Speics Pomacea canaliculata Pomacea diffusa Pomacea maculata Marisa cornuarietis Cipangopaludina cathayensis Turritella bacillum Tylomelania sarasinorum Semisulcospira libertina
Accession numberKU052865KY008698KY008699NC_025334.1NC_025577.1NC_029717.1NC_030263.1NC_023364.1
TaxonomyCaenogastropoda, ArchitaenioglossaCaenogastropoda, ArchitaenioglossaCaenogastropoda, ArchitaenioglossaCaenogastropoda, ArchitaenioglossaCaenogastropoda, ArchitaenioglossaCaenogastropoda, SorbeoconchaCaenogastropoda, SorbeoconchaCaenogastropoda, Sorbeoconcha
Length (bp)16,96516,64015,51615,92317,15715,86816,63215,432
A (%)32.6130.1330.8128.9426.7428.8529.4931.4
T (%)39.8539.6241.1341.0444.5135.8835.6534.76
C (%)12.0314.2412.8113.348.2816.2116.5617.78
G (%)15.5116.0215.2516.6820.4819.0618.316.06
AT (%)72.4669.7571.9469.9871.2564.7365.1466.16
AT-skew−0.0998−0.1361−0.1433−0.1729−0.2493−0.109−0.0946−0.0508
GC-skew0.12630.0590.08680.11130.42430.08090.0499−0.0507
Length (aa)37243737373337123727373737553750
AT (%) (all)70.7167.987168.9870.2963.8964.2965.66
AT (%) (3rd)72.3876.0285.6182.3479.3868.7568.5869.35
AT-skew−0.2099−0.2064−0.2022−0.2312−0.3243−0.119−0.1033−0.0642
GC-skew0.08440.04660.06970.07640.38430.08620.0696−0.0397
Length (bp)13341345133113591387135713821347
AT (%)73.5472.8674.5372.7772.1768.5366.9368.08
AT-skew0.00510.01630.006−0.0253−0.1109−0.073−0.107−0.06
GC-skew0.15010.13970.16220.23240.4456−0.002−0.1116−0.0791
Length (bp)929950934977895949899951
AT (%)71.9170.3270.9969.8166.0367.7664.2966.56
AT-skew0.0120.0210.0377−0.0264−0.0592−0.154−0.1142−0.0142
GC-skew0.2490.17730.19560.25420.4474−0.033−0.1153−0.1509
Length (bp)14731484147915001471148614821489
AT (%)71.8971.5672.0868.0772.3364.864.365.14
AT-skew−0.0293−0.0226−0.0263−0.0245−0.0602−0.09−0.0766−0.0124
GC-skew0.08210.06160.08470.08140.27760.05540.0057−0.0405
Mitochondrial genes overlap and serve to compact the mitogenomes. Each of the snails possessed five overlap sites totaling 11 bp in P. canaliculata, 27 bp in P. diffusa and 13 bp in P. maculate similar to other mitogenomes [37,38,39,40,41,42,43,44,45]. The gene overlaps we identified were present in ND1-tRNA-Pro of P. canaliculata, ND4L-ND4 and ND5-tRNA-Phe of P. diffusa, tRNA-Leu-tRNA-Leu and ND1-tRNA-Pro of P. maculate. (Table 2). Generally, we allowed gene overlaps between adjacent genes but not between protein coding genes and tRNAs. When a full termination codon (UAA or UAG) caused an overlap between a protein-coding gene and a tRNA, we preferred to annotating this gene with incomplete termination codon U/UA rather than an overlap. The incomplete termination codons were not adopted for the three apple snails. The annotation principles were determined by the annotated species available on the website data, which had close relationships with them.

2.2. Nucleotide Composition

The overall nucleotide compositions of the H-strand of the three apple snails in descending order were 40.20% T, 31.18% A, 15.59% G and 13.03% C, with an average G + C content of 28.62% indicating significant strand asymmetry or strand-specific bias commonly observed in the Mollusca [31,35,38,40]. Using the GC-/AT skew analysis, we found that the H-strand showed an over representation of T and A as compared to the L-strand. Strand asymmetry in the nucleotide composition was also reflected in the codon usage of genes oriented in opposite directions. The genes encoded on the H-strand showed clear preference for T or A to end codons, whereas G or C ending codons were over represented on the L-strand. The underlying mechanism responsible for the strand bias has been generally interpreted as evidence of an asymmetrical directional mutation pressure. This is associated with replication processes when one strand remains transiently in a single-stranded state, making it more vulnerable to DNA damage [46,47]. The highest A + T content of the genomes was detected in tRNA-Asp and was 85.29% in P. canaliculata, 82.35% in P. diffusa, and 83.82% in P. maculate. The highest G + C contents were 41.18% in the tRNA-Tyr of P. diffusa, 36.15% in the COXIII of P. canaliculata, and 35.29% in the tRNA-Thr of P. maculate. These phenomena were consistent with the findings of previous reports on other Mollusca [30,32,39,48].

2.3. Protein Coding Genes (PCG)

The 13 PCGs were represented in the apple snail mitogenomes and were H-strand encoded. These included three subunits of cytochrome c oxidase (COXI-III), seven subunits of NADH ubiquinone oxidoreductase complex (ND1-6, ND4L), a cytochrome b oxidoreductase complex (Cyt b) and two subunits of ATP synthases (ATP6 and ATP8). The latter ranged in size from 159 bp (ATP8) to 1728 bp (ND5) comprising 11,220–11,268 bp total accounting for 66.14–72.62% of the entire mitogenomes. Each PCG length was identical between the three snails with the exception of ND4L, ATP6, ND1, ND2, ND4 and ND5 that differed by 6–21 bp. The AT-skew of the 13 PCGs among the three species were all negative and the majority of the GC skew values were positive. The AT skews of COXIII and ND3 were the lowest and the GC skews of ND2 and ND3 were the highest (Figure 2).
Figure 2

Graphical illustration showing the AT- and GC-skew in the PCGs of the mitochondrial genome of P. canaliculata (PC), P. diffusa (PD) and P. maculate (PM).

We identified initiation and termination codons based on alignments with the corresponding genes of other snails. The mitogenomes of the apple snails exhibited a canonical genetic code shared by all Mollusca. An orthodox initiation codon AUG was used for most of the protein-coding genes. In contrast, AUA was used in ND4L, ND4 and COXIII of P. maculate and in ND4 and COXIII of P. canaliculata. These were also the accepted canonical mitochondrial initiation codons for Mollusca mt genomes (Table 2). Termination codons were represented by only UAA and UAG for all three apple snails and UAA was the most abundant. We did not identify any incomplete termination codons although they are common in other Gastropoda [49,50]. The rates of nonsynonymous amino acid substitutions over synonymous silent substitution (Ka/Ks) for the 13 protein-coding genes were 0.0837. These varied from 0.0065 in COXI to 0.1761 in ND2. This indicated that the functional genes evolved under strong purifying selection, which meant natural selection against deleterious mutations with negative selective coefficients [51] (Figure 3). The selection pressures differed among the genes and they likely evolved in different ways [52]. Interestingly, the ND2 genes possessed the highest ratios indicating that the selection pressures were strand independent.
Figure 3

Evolutionary rates of the mitochondrial genome of P. canaliculata (PC), P. diffusa (PD) and P. maculate (PM). The rates of non-synonymous substitutions to the rate of synonymous substitutions (Ka/Ks) for each PCG are indicated.

The conservation of mtDNA genes was evaluated based on the overall p-genetic distance among P. canaliculata (PC), P. diffusa (PD) and P. maculate (PM) (Figure 4). Comparisons of full-length PCGs indicated that COXI and COXII had the lowest overall p-genetic distances (0.122, 0.125) and ATP8 and ND6 had the highest (0.260, 0.222). Furthermore, the values based on the first and second nucleotides of codons were also consistent with those based on data of the full-length codons. This indicated that ATP8 and ND6 likely had the highest evolutionary rates while COXI and COXII had the lowest. The p-genetic distances between PD-PC and PD-PM was much more higher than that for PC-PM. This meant that the PCG sequences of P. canaliculata and P. maculate were the most similar among the three apple snails.
Figure 4

Overall mean p-genetic distances between P. canaliculata, P. diffusa and P. maculate for each of 13 PCG. Values were calculated using the first and second nucleotide positions, the total codon and over the full sequence.

2.4. Mitochondrial Gene Codon Usage

The amino acids Ser and Leu were utilized by eight and six different codons, respectively, while all other amino acids were encoded by either two or four. The frequencies in the PCGs of Phe (UUU), Leu (UUA), Ile (AUU), Met (AUA), and Ala (GCU) were the most frequently used and Stop (UAG) and Arg (CGC) were least frequently used. However, the relative synonymous codon usage (RSCU) analysis indicated that Leu (UUA), Ser (UCU) and Ala (GCU) were the most frequent while Leu (CUC, CUG), Pro (CCC, CCG), Arg (CGC) and Ala (GCG) were rare. Moreover, the GC content and the AT and GC skew of the PCGs were reflected in the codon usage and were closely related and consistent (Table 1; Figure 2) [53,54,55]. The RSCU values indicated that codons with A or U in the third position were more frequent than C or G. Therefore, codons NNA and NNU were in the majority while the synonymous codons NNC and NNG were in the minority (Figure 5).
Figure 5

Codon Frequency (A) and RSCU (B) in the mitochondrial genomes of P. canaliculata (PC), P. diffusa (PD) and P. maculate (PM). PC-A indicated the Codon Frequency of P. canaliculata; PD-A indicated the Codon Frequency of P. diffusa; PM-A indicated the Codon Frequency of P. maculate. PC-B indicated the RSCU of P. canaliculata; PD-B indicated the RSCU of P. diffusa; PM-B indicated the RSCU of P. maculate.

2.5. Transfer and Ribosomal RNA Genes

The mitogenomes of the apple snails each contained 22 tRNA genes varying from 62 bp (tRNA-Gln and tRNA-Cys) to 72 bp (tRNA-Ile) and were typical of metazoan mitogenomes. In general, there was a one-to-one correspondence of codon and anticodon. However, serine was determined by two types of anticodons (UGA, GCU) and leucine was determined by UAG and UAA in the apple snails, identical to other Mollusca [30,34]. All 22 tRNA genes (tRNA-Gln absent in P. diffusa) were predicted to be folded into canonical cloverleaf secondary structures as predicted using the tRNA Scan-SE software (http://lowelab.ucsc.edu/tRNAscan-SE/) [56], although numerous non-complementary and U-G base pairs existed in the stem regions. Stem mismatches are common for mitochondrial tRNA genes and are repaired via post-transcriptional editing [57]. tRNA genes were predicted using a Cove score cutoff of 0.1 by tRNA scan-SE v.2.0, combined with ARWEN software (www.acgt.se/online.html) and then confirmed by MitoFish (http://mitofish.aori.u-tokyo.ac.jp/) to confirm the annotation. We found high similarity in the corresponding tRNA-Gln sequences among the three snails. In addition, the secondary structure of tRNA-Gln in P. diffusa could not be predicted, but the tRNA-Gln in P. canaliculata and P. maculate could be done successfully. We are unsure whether this gene segment variation led to the unsuccessful prediction of secondary structure or was deficiency true deficiency of tRNA-Gln in P. diffusa. The mitochondrial genes of other snails have not been completely investigated and only preliminary and limited mtDNA information is available. Further research is needed to clarify this situation. We also identified H-strand large (rrnl) and small subunit (rrns) rRNA genes ranging in size from 929 to 950 bp and 1331 and 1345 bp, respectively. These were positioned between tRNA-Glu (UAG) and tRNA-Leu (UUC) tRNA-Val (UAC) and consistent with most Mollusca [31,33,48]. The AT content of rrns ranged from 70.32 and 71.91% with an AT-skew of 0.012–0.0377 and a GC-skew from 0.1773 to 0.249. The AT content of the rrnl gene ranged from 72.86 to 74.53% with an AT-skew of 0.0051–0.0163 and a GC-skew from 0.1397 to 0.1622. This nucleotide distribution differed when compared with the entire mitogenome; the protein-coding genes and tRNAs all had a T and G bias.

2.6. Mitogenome Comparisions

We compared the mitochondrial genomes of P. canaliculata with other available mitogenomes in the orders Sorbeoconcha and Architaenioglossa of the Caenogastropoda. Both the nucleotide (Figure 6) and CDS (coding DNA sequence) (Figure 7) regions were highly similar. ATP8 was the least conserved over in all species with a 59.34% similarity that was appropriate for the phylogenetic and evolutional analyses within the family Ampullariidae or genus Pomacea. On the contrary, COXI was the most conservative with the highest similarity (95.41%) and appropriate for the analyses above the family Ampullariidae (Figure 7). The average similarities for the 13 CDS genes were 93.96% (P. maculate), 82.41% (P. diffusa), 77.62% (M. cornuarietis), 68.92% (Semisulcospira libertine), 68.24% (Turritella bacillum) and 67.39% (Tylomelania sarasinorum). P. canaliculata was most closely related to P. maculate and then with P. diffusa and M. cornuarietis (Figure 8). Since nucleotide variation and diversity is greater than that for CDS regions, the mitogenome nucleotides showed less identity. This difference was also consistent when the wobble nucleotides are considered, so it was a more effective tool for phylogenetics (compare Figure 6 and Figure 7).
Figure 6

Graphical map of the BLAST results showing nucleotide identity between the complete mitochondrial genomes of P. canaliculata and species from the orders Sorbeoconcha and Architaenioglossa. BLAST results are presented in order with the sequence most similar to the reference (P. canaliculata) placed closer to the outer edge of the map. The gene regions, Blast identities and AT skew are depicted from outside to inside on the circle. The species from outside to inside are, respectively, P. canaliculata (KU052865), P. maculata (KY008699), P. diffusa (KY008698), M. cornuarietis (NC_025334), T. bacillum (NC_029717), S. libertine (NC_023364), T. sarasinorum (NC_030263) and C. cathayensis (NC_025577).

Figure 7

Graphical map of the BLAST results of mitochondrial CDS identities between P. canaliculata (NC_024586) and Sorbeoconcha and Architaenioglossa species. The sequences most similar to the reference (P. canaliculata) are placed closer to the outer edge of the map. COG (Clusters of Orthologous Groups of proteins), gene region, Blast identity and AT skew are shown from outside to inside. The species from outside to inside are named as in Figure 6.

Figure 8

Bayesian inference tree inferred from the amino acid sequences of 13 PCGs of 47 species of Caenogastropoda. Helix aspersa, NC_021747, was used as the outgroup. The numbers at the nodes represent the Bayesian posterior probabilities. Species in red indicate sequences generated in this study.

2.7. Phylogenetic Analyses

Phylogenetic analyses were carried out based on the concatenated alignment of amino acid sequences of 13 PCGs covering three orders, 18 families, 32 genera and 47 species of Caenogastropoda. We selected Helix aspersa (NC_021747) (Gastropoda, Heterobranchia) as the out-group (Table S1). Bayesian and Maximum Likelihood (ML) analyses produced almost identical topologies with similar branch lengths and strong bootstraps (ML) and posterior probabilities (Bayesian) values (Figure 8). We preferred the phylogenetic analyses based on the PCG information because the 47 Caenogastropoda species belonged to different orders i.e., Architaenioglossa, Hypsogastropoda and Sorbeoconcha and possessed wide-ranging taxonomies. Multiple amino acid sequence alignments indicated that nucleotide sequences were the more conservative and the most suitable taxonomic method [58,59,60]. In summary, whether the 13 amino acid sequences or nucleotides of the 13 PCGs was appropriate for phylogenetics was species dependent. Our phylogenetic tree analysis indicated that all of our tested Ampullariidae species clustered on the same branch and all posterior probabilities were 100. These Pomacea species were clustered together and then with the genus Marisa. The species of the orders Architaenioglossa and Sorbeoconcha clustered together and then with Hypsogastropoda. This indicated that orders Architaenioglossa and Sorbeoconcha have a closer genetic relationship to the Pomaceas. Unexpectedly, Ceraesignum maximum, Dendropoma gregarium, Eualetes tulipa and Thylacodes squamigerus traditionally classified in the order Hypsogastropoda were isolated from this order and occupied the most external branch of the Bayesian tree.

2.8. Divergence Times of the Caenogastropoda

We estimated divergence times of the Caenogastropoda based on phylogenetic trees constructed using the Bayesian method and species differentiation time using the ML method [61]. The model MtMam + I + G + F was optimal with an AIC (Akaike Information Criterion) value of 621,746.50. The model JTT + I + G + F was suboptimal with an AIC value of 614,814.31, similar to the one for model MtMam + I + G + F (Figure 9). The JTT + I + G + F model was used for the molecular clock.
Figure 9

Chronogram for the 47 species of Caenogastropoda with a single out-group (Helix aspersa, NC_021747) based on the Bayesian topology calculated from analysis of the 13 PCGs. Divergence times were estimated using three calibrations. Numbers near the nodes indicated the average estimated divergence time estimated in Mya.

Pomacea taxonomy has been based on shell, egg mass and soft tissue morphology [5]. However, there are no clear criteria to distinguish between species because of intraspecific variation, most likely the result of environmental influences [62]. These unclear morphological criteria have confused the taxonomy and identification within the genus, especially between P. canaliculata and P. maculate. Most invasive apple snail populations have been identified as P. canaliculata, but many may belong to other species in a closely related complex. This difficulty has resulted in their informal naming as the “P. canaliculata group” [4]. Preliminary attempts at classifying these species using mitochondrial 12S, 16S, and COXI sequence have been successful [49]. In addition, Pomacea species in Japan have been classified using COXI sequence alignments [63]. However, these studies did not clearly distinguish between these organisms. In our study, the divergence times indicated that P. canaliculata and P. maculate began to differentiate from other species about 17.38 million years ago (Mya) within the Neogene and then subsequently diverged with P. diffusa 37.11 Mya within the Paleogene (Figure 9). This suggests that the longer the divergence time, the more likely differences in external morphology would be apparent. The divergence time of the orders Architaenioglossa and Sorbeoconcha was 106.47 Mya within the middle Cretaceous. The order Hypsogastropoda terminated their differentiation after three important differentiation nodes at 87.49, 91.97 and 99.99 Mya within the late Cretaceous, and converged with the Architaenioglossa-Sorbeoconcha cluster 114.16 Mya within the middle Cretaceous. Interestingly, the cluster gathered by C. maximum, D. gregarium, E. tulipa and T. squamigerus is traditionally classified in the order Hypsogastropoda and converged with the main Architaenioglossa–Sorbeoconcha-Hypsogastropoda cluster 137.37 Mya within the early Cretaceous. Early phyletic evolution is primarily based on morphological characteristics. The development of modern molecular methods and techniques has provided additional information but also some new problems and challenges. These include how to comprehensively consider and balance position and function between morphology and modern molecular biology.

3. Materials and Methods

3.1. Sampling and Materials

Wild specimens of the three apple snails (P. canaliculata, P. diffusa and P. maculate) were collected in Ningxi Teaching and Research Farm of South China Agricultural University in Guangzhou (E 113°29′, N 23°5′) China on 2–25 March 2015. Voucher specimens were deposited in the Guangdong Engineering Research Center for Modern Eco-agriculture and Circular Agriculture, Guangzhou, China (Accession numbers: 004022015, 0050315 and 006012015). The dorsal muscle was preserved in 95% ethanol and stored at −70 °C until they were used for DNA extraction. Genomic DNA was isolated from muscle tissue as previously described [64]. Total DNA was eluted in sterile deionized water and was deposited at −20 °C. All animal experiments were conducted based on the guidelines and approval of the Animal Research and Ethics Committees of South China Agricultural University (AREC2003003, 15 March 2003).

3.2. Library Preparation for Sequencing

DNA sample quality and quantity were characterized by gel electrophoresis and UV spectroscopy using the Nano-Drop 2000 instrument (Thermo Scientific, Waltham, MA, USA). We isolated the high-quality genomic DNA to construct the DNA libraries containing insert sizes of 500 bp for paired-end sequencing. Paired-end reads of 100 bp were generated on an Illumina HiSeq2500 instrument (Illumina, Inc., San Diego, CA, USA) using sequencing protocols provided by the manufacturer.

3.3. Sequence Analysis and Annotation

Illumina paired-end sequencing reads were filtered on quality values and the low quality bases (quality < 20, p-error > 0.01) of 50 upstream and 30 downstream were trimmed. Using the Ampullariidae mitochondrial genomes from the NCBI database using bowtie2 (http://Bowtie-Bio.Sourceforge.Net/Bowtie2/Index.Shtml), the mitochondrial genome sequencing reads were captured. De novo assembly with paired-end sequencing reads was determined using SOAPdenovo2 (http://soap.genomics.org.cn/ soapdenovo.html). The protein-coding regions and ribosomal genes were identified using the Basic Local Alignment Search Tool (https://blast.ncbi.nlm.nih.gov/Blast.cgi) [65]. Species maps were drawn using OGDRAW (http://ogdraw.mpimp-golm.mpg.de/). Transfer RNA genes were predicted using a Cove score cutoff of 0.1 by tRNA scan-SE v.2.0 (http://lowelab.ucsc.edu/tRNAscan-SE/), combined with ARWEN software (http://mbio-serv2.mbioekol.lu.se/ARWEN/) [66,67] and then confirmed by MitoFish (http://mitofish.aori.u-tokyo.ac.jp/) to confirm the annotation. Strand asymmetry was calculated using the following formulas: AT skew = [A − T]/[A + T] and GC skew = [G − C]/[G + C] [68] to describe base composition. Repeat sequences were identified using Spectral Repeat Finder v1.1 [69]. Codon usage was determined for all PCGs. Long repeat analysis was used in the web-based REPuter (http://bibiserv.techfak.uni-bielefeld.de/reputer/) and included forward, reverse, and tandem repeats with minimal lengths of 30 bp and edit distances of <3 bp [70]. RSCU values [71] were obtained using MEGA 7 software (https://www.megasoftware.net/) [61]. Statistical analyses of the distributions and visualization of codon usage in the form of heat maps were conducted using R language with RSCU values, a measure of non-uniform usage of synonymous codons in a coding sequence [71]. The RSCU value was the number of times. A particular codon was observed relative to the number of times that the codon would be observed for a uniform synonymous codon usage i.e., all codons for a given amino acid exhibiting similar probabilities. The RSCU value in the absence of any codon usage bias was 1.00. A codon used less frequently than expected would have RSCU values <1.00, whereas codons used more frequently than expected would have RSCU values >1.00.

3.4. Phylogenetic Analyses

Phylogenetic analyses were conducted based on 47 mitochondrial genomes sequences of 47 species of Caenogastropoda (Table S1), using Helix aspersa (NC_021747) (Heterobranchia, Helicidae) as the out-group. All sequences were deposited in GenBank. We aligned amino acid sequences of PCGs in the 48 species using the MUSCLE program in MEGA 7. Our alignments of individual genes excluded the stop codon and the third codon. We performed phylogenetic analysis by the Maximum Likelihood (ML) and Bayesian Inference (BI) methods. The best model MtMam + I + G + F based on the amino acid sequences in this study was selected using AIC (Akaike Information Criteria). Minimum and best values were fitted with ProtTest 3 (https://omictools.com/prottest-tool) with optimized parameters [72]. ML analysis was carried out using the RAxML 8.1.5 with 1000 bootstrap replicates based on the best-scoring protein substitution model determined automatically of the software [73]. Bayesian phylogenetic analysis with 10,000,000 generations was conducted with MrBayes 3.2.6 software (http://mrbayes.sourceforge.net/index.php) [74]. Four independent Markov chains were used at the same time with sampling every 100 generations. The BI Tree was reliable since the standard deviation of split frequencies was <0.01. The phylogenetic trees were generated using Evolview (http://www.evolgenius.info/evolview/) [75,76]. The NCBI taxonomy server (http://www.ncbi.nlm.nih.gov/taxonomy/) and DeepFin (http://deepfin.org/) were adopted for additional analyses. The divergence times of the Caenogastropoda were estimated using MEGA 7.0 [77] with the RelTime-ML method and JTT + F + I + G modeling. Divergence times were presented in the Time Tree database (http://www.timetree.org/) [61]. Bayesian phylogenetic data were input to MEGA 7.0 to generate divergence times that ensured consistency between the phylogenetic tree and divergence times. Sequence differences between three apple snails and other Sorbeoconcha and Architaenioglossa species were analyzed, and P. canaliculata (NC_024586) was used as the reference sequence. Sequences were aligned by BLAST and annular genetic similarity mapping was visualized using the CGView Comparison Tool (http://stothard.afns.ualberta.ca/downloads/CCT).

4. Conclusions

The gene characterization, arrangement and distribution of the three apple snails were canonically identical and consistent with the typical Mollusca except for the absence of tRNA-Gln in P. diffusa. The absence of an identifiable control region or d-loop was consistent with other Gastropoda. Bayesian phylogenetic analysis indicated that all Ampullariidae species including the apple snails clustered on the same branch with 100 posterior probabilities. The members of the genus Pomacea clustered together first and then with genus Marisa. The orders Architaenioglossa and Sorbeoconcha clustered together and then with the order Hypsogastropoda. Furthermore, the intergenic and interspecific taxonomic positions were explicit and clear. Unexpectedly, C. maximum, D. gregarium, E. tulipa and T. squamigerus traditionally classified in order Hypsogastropoda were isolated from it and located in the most external branch of the Bayesian inference topology tree. Divergence times of the Caenogastropoda indicated their evolutionary process covered four geological epochs: Quaternary, Neogene, Paleogene and Cretaceous. Early phyletic evolution is mainly based on morphological characteristics. This study presents a characterization of mitochondrial genomes and insight into the phylogenetics of Caenogastropoda that may facilitate further investigation of species identification. This is essential for the implementation of effective management and control strategies.
  64 in total

1.  Complete mitochondrial genome of a freshwater snail, Semisulcospira libertina (Cerithioidea: Semisulcospiridae).

Authors:  Tai Zeng; Wei Yin; Rong Xia; Cuizhang Fu; Binsong Jin
Journal:  Mitochondrial DNA       Date:  2014-01-10

2.  Molecular phylogeny of the Achatinoidea (Mollusca: Gastropoda).

Authors:  Ian Kendrich Fontanilla; Fred Naggs; Christopher Mark Wade
Journal:  Mol Phylogenet Evol       Date:  2017-06-21       Impact factor: 4.286

3.  The codon Adaptation Index--a measure of directional synonymous codon usage bias, and its potential applications.

Authors:  P M Sharp; W H Li
Journal:  Nucleic Acids Res       Date:  1987-02-11       Impact factor: 16.971

4.  Patterns of nucleotide composition at fourfold degenerate sites of animal mitochondrial genomes.

Authors:  N T Perna; T D Kocher
Journal:  J Mol Evol       Date:  1995-09       Impact factor: 2.395

5.  Molecular identification of Pomacea canaliculata and P. insularum from rice paddy in different origins in China using mitochondrial adenosine triphosphate subunit 6 gene.

Authors:  Qing-Qing Bian; Xiao-Yan Li; Yan-Qin Fang; Yan-Qing Jia; Xi-Dong Mu
Journal:  Mitochondrial DNA       Date:  2013-10-03

6.  Use of a saponin based molluscicide to control Pomacea canaliculata snails in Southern Brazil.

Authors:  R San Martíns; Claudio Gelmi; Jaime Vargas de Oliveira; José Luis Galo; Honorio Pranto
Journal:  Nat Prod Commun       Date:  2009-10       Impact factor: 0.986

7.  A comprehensive description and evolutionary analysis of 22 grouper (perciformes, epinephelidae) mitochondrial genomes with emphasis on two novel genome organizations.

Authors:  Xuan Zhuang; Meng Qu; Xiang Zhang; Shaoxiong Ding
Journal:  PLoS One       Date:  2013-08-09       Impact factor: 3.240

8.  The Mitochondrial Genomes of the Nudibranch Mollusks, Melibe leonina and Tritonia diomedea, and Their Impact on Gastropod Phylogeny.

Authors:  Joseph L Sevigny; Lauren E Kirouac; William Kelley Thomas; Jordan S Ramsdell; Kayla E Lawlor; Osman Sharifi; Simarvir Grewal; Christopher Baysdorfer; Kenneth Curr; Amanda A Naimie; Kazufusa Okamoto; James A Murray; James M Newcomb
Journal:  PLoS One       Date:  2015-05-21       Impact factor: 3.240

9.  tRNAscan-SE On-line: integrating search and context for analysis of transfer RNA genes.

Authors:  Todd M Lowe; Patricia P Chan
Journal:  Nucleic Acids Res       Date:  2016-05-12       Impact factor: 16.971

10.  The complete mitochondrial genome of Galba pervia (Gastropoda: Mollusca), an intermediate host snail of Fasciola spp.

Authors:  Guo-Hua Liu; Shu-Yan Wang; Wei-Yi Huang; Guang-Hui Zhao; Shu-Jun Wei; Hui-Qun Song; Min-Jun Xu; Rui-Qing Lin; Dong-Hui Zhou; Xing-Quan Zhu
Journal:  PLoS One       Date:  2012-07-26       Impact factor: 3.240

View more
  4 in total

1.  Characterization of four mitochondrial genomes of family Neritidae (Gastropoda: Neritimorpha) and insight into its phylogenetic relationships.

Authors:  Jian-Tong Feng; Li-Ping Xia; Cheng-Rui Yan; Jing Miao; Ying-Ying Ye; Ji-Ji Li; Bao-Ying Guo; Zhen-Ming Lü
Journal:  Sci Rep       Date:  2021-06-03       Impact factor: 4.379

2.  Complete Mitogenomes of Three Carangidae (Perciformes) Fishes: Genome Description and Phylogenetic Considerations.

Authors:  Zhenhai Li; Min Li; Shannan Xu; Li Liu; Zuozhi Chen; Keshu Zou
Journal:  Int J Mol Sci       Date:  2020-06-30       Impact factor: 5.923

3.  Sequence characterization and phylogenetic analysis of mitogenome of the Acanthorhodeus chankaensis Dybowsky from Cao'e River.

Authors:  Haifang Xu; Pengjie Yang; Wenjie Dai; Shoubao Yang
Journal:  Mitochondrial DNA B Resour       Date:  2020-01-14       Impact factor: 0.658

4.  Predicting Current Potential Distribution and the Range Dynamics of Pomacea canaliculata in China under Global Climate Change.

Authors:  Yingxuan Yin; Qing He; Xiaowen Pan; Qiyong Liu; Yinjuan Wu; Xuerong Li
Journal:  Biology (Basel)       Date:  2022-01-10
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.