Literature DB >> 22844544

The complete mitochondrial genome of Galba pervia (Gastropoda: Mollusca), an intermediate host snail of Fasciola spp.

Guo-Hua Liu1, Shu-Yan Wang, Wei-Yi Huang, Guang-Hui Zhao, Shu-Jun Wei, Hui-Qun Song, Min-Jun Xu, Rui-Qing Lin, Dong-Hui Zhou, Xing-Quan Zhu.   

Abstract

Complete mitochondrial (mt) genomes and the gene rearrangements are increasingly used as molecular markers for investigating phylogenetic relationships. Contributing to the complete mt genomes of Gastropoda, especially Pulmonata, we determined the mt genome of the freshwater snail Galba pervia, which is an important intermediate host for Fasciola spp. in China. The complete mt genome of G. pervia is 13,768 bp in length. Its genome is circular, and consists of 37 genes, including 13 genes for proteins, 2 genes for rRNA, 22 genes for tRNA. The mt gene order of G. pervia showed novel arrangement (tRNA-His, tRNA-Gly and tRNA-Tyr change positions and directions) when compared with mt genomes of Pulmonata species sequenced to date, indicating divergence among different species within the Pulmonata. A total of 3655 amino acids were deduced to encode 13 protein genes. The most frequently used amino acid is Leu (15.05%), followed by Phe (11.24%), Ser (10.76%) and IIe (8.346%). Phylogenetic analyses using the concatenated amino acid sequences of the 13 protein-coding genes, with three different computational algorithms (maximum parsimony, maximum likelihood and bayesian analysis), all revealed that the families Lymnaeidae and Planorbidae are closely related two snail families, consistent with previous classifications based on morphological and molecular studies. The complete mt genome sequence of G. pervia showed a novel gene arrangement and it represents the first sequenced high quality mt genome of the family Lymnaeidae. These novel mtDNA data provide additional genetic markers for studying the epidemiology, population genetics and phylogeographics of freshwater snails, as well as for understanding interplay between the intermediate snail hosts and the intra-mollusca stages of Fasciola spp..

Entities:  

Mesh:

Substances:

Year:  2012        PMID: 22844544      PMCID: PMC3406003          DOI: 10.1371/journal.pone.0042172

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Many snails within the families Lymnaeidae and Planorbidae act as intermediate hosts of medically and veterinary important digenean trematodes that infect humans and domestic animals (especially sheep and cattle) [1], [2]. Galba pervia (Pulmonata: Lymnaeidae) is widely distributed and is the dominant host snail for transmission of Fasciola spp. in China [3]. Fascioliasis caused by Fasciola spp. is a signifancant disease of livestock animals causing substantial economic impact [4]–[7]. More importantly, millions of humans have been infected by Fasciola spp. in a number of countries [8]. The metazoan mitochondrial (mt) genome, ranging in length from 14 to 18 kb, is typically circular and usually contains 36–37 genes, including 12–13 protein-coding genes, 2 ribosomal RNA (rRNA) genes and 22 transfer RNA (tRNA) genes [9]. In addition, metazoan mt genome usually contains at least one lengthy noncoding region which is essential regulatory element for the initiation of transcription and replication [9]. Mitochondrial DNA (mtDNA) has long been extensively used as genetic markers to resolve evolutionary relationships among animal species due to their maternal inheritance, higher mutation rates than nuclear genes, and relatively conserved genome structures compared to ribosomal DNA [10]–[15]. In coelomate animals, mt gene arrangements are usually relatively stable within each phylum [16]. However, Mollusca, the second largest animal phylum, exhibit high diversity in their mt genome structures. For example, the mt genomes of Mytilus edulis, Argopecten irradians and Chlamys farreri contain supernumerary or lost tRNA genes [17], [18]. The Gastropoda are the largest class of the Mollusca and their mt gene arrangements also exhibit high levels of varibility [16]. There have been considerable controversies regarding the phylogenetic relationships of Gastropoda. Gastropoda were traditionally classified into three main subclasses based on their morphological characters: Prosobranchia, Opisthobranchia and Pulmonata. In modern taxonomies, Opisthobranchia and Pulmonata usually are clustered together in the clade Euthyneura [19], [20]. When mt gene rearrangements occur, they may usually provide very powerful phylogenetic information for resolving phylogenetic relationships among taxa [21]. Compared to other metazoan animals, only 21 complete mt genome sequences of Pulmonata species have been sequenced and deposited in GenBank (Table 1) to date, and only a low quality mt genome has been determined for the family Lymnaeidae [22].
Table 1

Mitochondrial genome sequences of Pulmonata sequenced prior to the present study.

FamilySpeciesLength (bp)Accession numbers
Clausiliidae Albinaria coerulea 14130NC_001761
Volvatellidae Ascobulla fragilis 14745NC_012148
Planorbidae Biomphalaria glabrata 13670NC_005439
Biomphalaria tenagophila 13722NC_010220
Helicidae Cepaea nemoralis 14100NC_001816
Placobranchidae Elysia chlorotica 14132NC_010567
Ellobiidae Myosotella myosotis 14246NC_012434
Auriculinella bidentata 14135NC_016168
Ovatella vulcani 14274NC_016175
Pedipes pedipes 16708NC_016179
Onchidiidae Onchidella celtica 14150NC_012376
Platevindex mortoni 13991NC_013934
Peronia peronii 13968NC_016181
Pyramidellidae Pyramidella dolabrata 13856NC_012435
Siphonariidae Siphonaria pectinata 14065NC_012383
Siphonaria gigas 14518NC_016188
Veronicellidae Rhopalocaulis grandidieri 14523NC_016183
Amphibolidae Salinator rhamphidia 14007NC_016185
Succineidae Succinea putris 14092NC_016190
Lymnaeidae Radix balthica 13993HQ330989
Trimusculidae Trimusculus reticulatus 14044NC_016193
The objectives of the present study were to determine the complete mt sequence of the G. pervia, to compare the mt sequence with those of Radix balthica to infer further insights into the high variability of Gastropoda mitochondrial genomes, and to study phylogenetic relationships of Pulmonata using mt sequence dataset.

Results and Discussion

Genome content and organization

The complete mt genome of G. pervia was 13,768 bp in length (Figure 1), and the mtDNA sequence was deposited in GenBank (accession number JN564796). The G. pervia mt genome contains 13 protein-coding genes (cox1-3, nad1-6, nad4L, atp6, atp8 and cytb), a small subunit ribosomal RNA gene (rrnS), a large subunit ribosomal RNA gene (rrnL), and 22 transfer RNA genes, but without lengthy non-coding regions (Table 2). As found in other Gastropoda species, most of these genes are coded on the heavy strand (H-strand) except for atp6, atp8, nad3, cox3, 8 tRNA genes and rrnS. The details of gene locations were given in Table 2.
Figure 1

The mitochondrial genome of Galba pervia.

Gene scaling is only approximate. All genes have standard nomenclature including the 22 tRNA genes, which are designated by the one-letter code for the corresponding amino acid, with numerals differentiating each of the two leucine- and serine-specifying tRNA (L1 and L2 for codon families CUN and UUR, respectively; S1 and S2 for codon families AGN and UCN, respectively).

Table 2

Positions and nucleotide lengths of the mitochondrial genome of Galba pervia.

GenePositionStrandCodonAnticodonIntergenic nucleotidesa
From-To (length)Start/Stop
nad51–1653 (1653)H ATA/TAG +15
nad11669–2529 (861)H ATG/TAA −1
nad4L2529–2993 (465)H ATG/TAG −158
cytb2836–3933 (1098)H ATA/TAG −29
tRNA-Asp (D)3905–3971 (67)H GTC −9
tRNA-Phe (F)3963–4025 (63)H GAA +27
cox24053–4685 (633)H ATA/TAA −19
tRNA-Tyr (Y)4667–4726 (60)H TGT −6
tRNA-Trp (W)4721–4787 (67)H TCA −8
tRNA-Cys (C)4780–4842 (63)H GCA +1
tRNA-Gly (G)4844–4918 (75)H TCC −19
tRNA-His (H)4900–4959 (60)H GTG +6
tRNA-Gln (Q)4966–5029 (64)L TTG −10
tRNA-Leu (UUR) (L2)5020–5082(63)L TAA −32
atp85051–5197 (147)L ATA/TAA +30
tRNA-Asn (N)5228–5292 (65)L GTT −51
atp65242–5928 (687)L ATA/TAA +3
tRNA-Arg (R)5932–5996 (65)L TCG −9
tRNA-Glu (E)5988–6054 (67)L TTC 0
rrnS6055–6767 (713)L+1
tRNA-Met (M)6769–6835 (67)L CAT −6
nad36830–7168 (339)L ATA/TAA +16
tRNA-Ser UCN (S2)7185–7247(63)L TGA −7
tRNA-Ser AGN (S1)7241–7294 (54)H GCT +5
nad47310–8647 (1338)H ATA/TAA −29
tRNA-Thr (T)8619–8684 (66)L TGT −20
cox38665–9462 (798)L ATG/TAA +42
tRNA-Ile (I)9505–9567(63)H GAT +25
nad29593–10501(909)H ATA/TAA −29
tRNA-Lys (K)10473–10541 (69)H TTT −29
cox110513–12057 (1545)H ATA/TAA −2
tRNA-Val (V)12056–12116 (61)H TAC +3
rrnL12120–13128 (1009)H−6
tRNA-Leu CUN (L1)13123–13187 (65)H TAG −5
tRNA-Pro (P)13183–13246 (64)H TGG −1
tRNA-Ala (A)13246–13312 (67)H TGC −41
nad613272–13763 (492)H ATG/TAA +5

indicates gap nucleotides (positive value) or overlapped nucleotides (negative value) between two adjacent genes.

The mitochondrial genome of Galba pervia.

Gene scaling is only approximate. All genes have standard nomenclature including the 22 tRNA genes, which are designated by the one-letter code for the corresponding amino acid, with numerals differentiating each of the two leucine- and serine-specifying tRNA (L1 and L2 for codon families CUN and UUR, respectively; S1 and S2 for codon families AGN and UCN, respectively). indicates gap nucleotides (positive value) or overlapped nucleotides (negative value) between two adjacent genes. The nucleotide compositions of the complete mtDNA sequence of G. pervia are biased toward A and T, with T being the most favored nucleotide and C the least favored, in accordance with the mt genome of R. balthica and Biomphalaria glabrata and B. tenagophila [22], [23]. The content of A+T is 72.67% for G. pervia (32.21% A, 40.46% T, 14.58% G and 12.75% C) (Table 3), 71.29% for R. balthica (31.68% A, 39.61% T, 15.39% G and 13.31% C), 75.78% for B. tenagophila (33.66% A, 42.12% T, 13.50% G and 10.72% C), respectively. Strand asymmetry (strand compositional bias) is usually reflected by skewness [24], which is calculated as (A%−T%)/(A%+T%) and (G%−C%)/(C%+G%), respectively. AT-skews and GC-skews of the whole mt genome were calculated for Pulmonata species to date (Table 3). This composition of full mtDNA sequence of G. pervia is strongly skewed away from A in favor of T (AT skew = −0.114), and GC skew = 0.067. The pattern of skew values of G. pervia is highly congruent with those observed in the mtDNA sequences of other pulmonate animals (Table 3). Previous studies suggested that GC skew is the best indicators of strand asymmetry [25]. Hence, all Pulmonata species in the present study show strand asymmetry (GC skew between −0.071and 0.210) (Table 3). Interestingly, in all the mt genome sequences of Pulmonata species reported to date, only the GC skew of Rhopalocaulis grandidieri has negative value due to that the C content of R. grandidieri mt genome is relatively higher than its G content. In mammals, these asymmetrical and biased base composition of mt genomes may be due to the spontaneous deamination process of C and A in the H-strand during replication [26], [27].
Table 3

Nucleotide composition of the mitochondrial genomes of Pulmonata species.

SpeciesNucleotide frequency (%)Whole genome sequence
ATGCA+T%AT skewGC skew
Albinaria coerulea 32.7537.9015.5413.8170.65−0.0730.059
Ascobulla fragilis 30.1236.9317.8215.1367.05−0.1020.082
Biomphalaria glabrata 33.0541.5814.0811.2974.63−0.1140.110
Biomphalaria tenagophila 33.6642.1213.5010.7275.78−0.1120.115
Cepaea nemoralis 26.1633.6321.2618.9459.79−0.1250.058
Elysia chlorotica 26.9236.9919.2316.8663.91−0.1580.066
Myosotella myosotis 23.6731.3523.6321.3555.02−0.1400.051
Auriculinella bidentata 25.8330.8822.6520.4156.71−0.0890.052
Ovatella vulcani 25.0529.7023.6921.5554.76−0.0850.047
Pedipes pedipes 28.5933.7319.2718.4262.31−0.0820.023
Onchidella celtica 25.2634.0621.7718.9259.31−0.1480.070
Peronia peronii 27.0637.2820.2915.3764.34−0.1590.138
Platevindex mortoni 27.2735.7220.2316.7862.99−0.1340.093
Pyramidella dolabrata 27.4435.9719.6016.9763.41−0.1350.072
Siphonaria pectinata 29.7637.0618.2614.9266.82−0.1090.101
Siphonaria gigas 24.3237.2423.3515.0861.56−0.2100.215
Rhopalocaulis grandidieri 29.2733.8817.1119.7363.15−0.073−0.071
Salinator rhamphidia 26.6635.5820.8316.9362.23−0.1430.103
Succinea putris 33.8042.8912.0810.8276.69−0.1190.055
Trimusculus reticulatus 26.4034.7220.6418.2461.12−0.1360.062
Radix balthica 31.6839.6115.3913.3171.29−0.1110.072
Galba pervia 32.2140.4614.5812.7572.67−0.1140.067
The gene arrangement differs among the mt genome sequences of 21 pulmonate animals, including that of R. balthica and B. glabrata and B. tenagophila (not shown). The gene arrangement in protein-coding genes seems to be stable in this group with a few exceptions. However, tRNA gene arrangement among these pulmonate animals is highly diversified, which provides further support for considerable variation in mt gene arrangement among pulmonate animals. Compared to the mt gene arrangement of R. balthica, the mt genome of G. pervia shows a novel gene arrangement (Figure 2). All rearranged genes are tRNA genes, in which 3 tRNA genes (tRNA-His, tRNA-Gly and tRNA-Tyr) changed their positions or directions (Figure 2). The tRNA-His arrangements represent translocation, tRNA-Gly arrangements represent shuffling, and tRNA-Tyr arrangements represent remote inversion translocations. Generally, tRNA gene rearrangements can be classified as translocations (across a protein-coding gene), local inversion (inverted but remaining in the position), remote inverted (translocated and inverted), and shuffling (on the same mt strand but in a different position) [28]. Within the Gastropoda, gene rearrangements have been particularly prevalent, and the mt gene rearrangment events in Gastropoda mt genomes have been discussed [29], [30]. To date, four mechanisms have been proposed to explain mt gene rearrangement in metazoan: (i) the tandem duplication followed by random loss (TDRL) of supernumerary genes owning to selection favoring small genomes [31], [32], (ii) illicit priming of replication by tRNA genes [33], (iii) tandem duplication followed by nonrandom loss of excess genes [34], and (iv) nonhomologous intergenome or intragenome recombination is presumed to be the most possible explanation for local inversion [35]. Considering all the data available so far, these mechanisms still do not explain mt gene rearrangment events in Gastropoda, because these compactly organized mt genomes of Gastropoda (with very few and short noncoding sequences) suggest strong selection against maintaining remnants of duplication events [36]. However, in G. pervia, a 6 bp and in R. balthica a 156 intergenic region were found in the location of tRNA-His, which might support TDRL as a mechanism acting in Pulmonata mt genome rearrangements.
Figure 2

Comparison of the mitochondrial gene arrangement between Galba pervia and Radix balthica.

All genes have standard nomenclature including the 22 tRNA genes, which are designated by the one-letter code for the corresponding amino acid, with numerals differentiating each of the two leucine- and serine-specifying tRNA (L1 and L2 for codon families CUN and UUR, respectively; S1 and S2 for codon families AGN and UCN, respectively). Underlined genes are coded on the light strand.

Comparison of the mitochondrial gene arrangement between Galba pervia and Radix balthica.

All genes have standard nomenclature including the 22 tRNA genes, which are designated by the one-letter code for the corresponding amino acid, with numerals differentiating each of the two leucine- and serine-specifying tRNA (L1 and L2 for codon families CUN and UUR, respectively; S1 and S2 for codon families AGN and UCN, respectively). Underlined genes are coded on the light strand. Overlapping of adjacent genes (protein-encoding genes overlapped) is common in many animal mt genomes, although the extent of overlaps varies [37]–[40]. The mt genes of G. pervia overlap a total of 526 bp in 21 locations which range from 1 to 158 bp, including overlapping between protein-encoding and tRNA genes (Table 2) which has also been found in Gastropoda mt genomes [41], [42]. The G. pervia mt genes are separated by intergenic spacer sequences of a total of 179 bp in length, which are located in 13 regions and range from 1 to 42 bp in size (Table 2). The longest intergenic region (42 bp) is located between cox3 and tRNA-Ile genes.

Protein-coding genes and codon usage patterns

The boundaries between protein-coding genes in the mt genome of G. pervia were determined by aligning their sequences and by identifying translation initiation and termination codons with comparison to those of R. balthica and Biomphalaria spp.. The predicted translation initiation and termination codons for the protein-coding genes of G. pervia mt genome were compared with those of R. balthica and Biomphalaria spp.. As shown in Table 2, the start codons of 13 protein-coding genes are ATN codon, which is typical of most metazoan mt genomes. The start codons inferred in the mt genome of G. pervia are ATA and ATG, and all reading-frames of the G. pervia ended with TAG or TAA as termination codons, whereas no anomalous initiation codons and incomplete stop codons are used, although they frequently occur in protein-coding genes of most Gastropoda mtDNA genomes [29]. The pattern of codon usage in the mtDNA of G. pervia was also studied. Excluding the termination codons, a total of 3655 amino acids are encoded by the G. pervia mt genome. Codons composed of A and T are more frequently used in protein-coding genes, which seems to reflect the high content of A+T in the entire mt genome of G. pervia. For example, the most frequently used amino acids were Leu (15.05%), followed by Phe (11.24%), Ser (10.76%) and IIe (8.34%) (Table 4).
Table 4

Codon usage of Galba pervia mitochondrial protein-coding genes.

Amino acidCodonNumberFrequency (%)Amino acidCodonNumberFrequency (%)
Phe TTT 37410.23Met ATA 1574.29
Phe TTC 271.01Met ATG 451.23
Leu TTA 37710.31Thr ACT 892.43
Leu TTG 481.31Thr ACC 120.32
Ser TCT 1072.92Thr ACA 561.53
Ser TCC 130.35Thr ACG 60.16
Ser TCA 922.51Asn AAT 1183.22
Ser TCG 150.41Asn AAC 260.71
Tyr TAT 1273.47Lys AAA 872.38
Tyr TAC 250.68Lys AAG 100.27
Term TAA 100.27Ser AGT 461.25
Term TAG 30.08Ser AGC 120.32
Cys TGT 451.23Ser AGA 862.35
Cys TGC 100.27Ser AGG 240.65
Trp TGA 651.77Val GTT 982.68
Trp TGG 150.41Val GTC 90.24
Leu CTT 681.86Val GTA 1072.92
Leu CTC 10.02Val GTG 270.73
Leu CTA 521.42Ala GCT 852.32
Leu CTG 50.13Ala GCC 160.43
Pro CCT 701.91Ala GCA 671.83
Pro CCC 60. 16Ala GCG 90.24
Pro CCA 481.31Asp GAT 611.66
Pro CCG 90.24Asp GAC 50.13
His CAT 501.36Glu GAA 661.80
His CAC 180.49Glu GAG 160.43
Gln CAA 541.47Gly GGT 581.58
Gln CAG 70.19Gly GGC 100.27
Arg CGT 200.54Gly GGA 721.96
Arg CGC 10.02Gly GGG 611.66
Arg CGA 350.95IIe ATT 2757.52
Arg CGG 30.05IIe ATC 390.82

Total number of codons is 3655.

Term = Stop codon.

Total number of codons is 3655. Term = Stop codon.

Transfer RNA and ribosomal RNA genes

The 22 tRNA genes in G. pervia mt genome vary in length from 53 to 75 nucleotides with differences in stem and loop sizes of dihydrouridine (D) and TΨC loops. The 22 tRNA genes are located on both strands. Of these, 14 tRNA are encoded on the H-strand and 8 on the light strand (L-strand) (Table 2). The order and orientation of the gene arrangement pattern are identical to that of R. balthica, except for the positions and directions of the tRNA-His, tRNA-Gly and tRNA-Tyr genes. All of the 22 tRNA genes can be folded into normal cloverleaf structure, except for tRNA-Ser(AGN) that lacks DHU arm. Their putative secondary structures are similar to those of R. balthica or B. tenagophila (not shown), indicating their similar functions. In mt genomes of most Gastropoda animals, tRNA-Ser(UCN) generally lacks DHU arm [30], [42], [43], but it has a standard cloverleaf structure in mt genome of G. pervia. It is interesting that the secondary structures of mt tRNA genes are highly variable among Gastropoda animals. Previous studies suggested that the reduction of tRNA stem was caused by a strong pressure for mt genome minimization [44]. The rrnS and rrnL genes of G. pervia were identified by sequence comparison with those of R. balthica. The rrnS is located between tRNA-Glu and tRNA-Met, and the rrnL is located between tRNA-Val and the tRNA-LeuCUN. The lengths of the rrnS and rrnL genes of G. pervia are 713 bp and 1009 bp, respectively. The lengths of the rrnS and rrnL genes of R. balthica are 713 bp and 969 bp, respectively. The A+T contents of the rrnS and rrnL of G. pervia are 72.09% and 74.93%, respectively. Sequence identities in the rrnS and rrnL genes between G. pervia and R. balthica are 83.45% and 81.97%, respectively.

Phylogenetic analyses

The phylogenetic relationships of 20 Pulmonata species based on concatenated amino acid sequence datasets, plus the mt DNA sequence of G. pervia obtained in the present study, using maximum parsimony (MP), maximum likelihood (ML) and Bayesian analyses (Bayes) analyses are shown in Figure 3. The amino acid sequences of R. balthica were not used due to its low quality mt genome. The topologies of the trees from the MP, ML and Bayes analyses were identical or similar. In the tree, two major clades were recovered within Pulmonata: clade I and clade II form monophyletic groups, respectively. Within the clade I, Siphonaria pectinata and S. gigas clustered together with high statistical support, indicating that S. pectinata is sister to S. gigas. Within the clade II, Placobranchidae+Volvatellidae and other families form monophyletic groups, respectively.
Figure 3

Inferred phylogenetic relationship among Pulmonata species based on mitochondrial DNA sequences.

The concatenated amino acid sequences of 13 protein-coding genes were analyzed utilizing maximum parsimony (MP), maximum likelihood (ML) and Bayesian analysis (Bayes), using Aplysia californica and A. dactylomela as outgroup. The numbers along branches indicate bootstrap probability (BP) and posterior probability (PP) resulting from different analyses in the order: MP/ML/Bayes. Alternative topologies were correspondingly shown on the downside of the tree. The colourred species indicated that the phylogenetic status of the family Ellobiidae remains problematic.

Inferred phylogenetic relationship among Pulmonata species based on mitochondrial DNA sequences.

The concatenated amino acid sequences of 13 protein-coding genes were analyzed utilizing maximum parsimony (MP), maximum likelihood (ML) and Bayesian analysis (Bayes), using Aplysia californica and A. dactylomela as outgroup. The numbers along branches indicate bootstrap probability (BP) and posterior probability (PP) resulting from different analyses in the order: MP/ML/Bayes. Alternative topologies were correspondingly shown on the downside of the tree. The colourred species indicated that the phylogenetic status of the family Ellobiidae remains problematic. The results of the present study revealed that G. pervia and B. tenagophila are closely related with high statistical support, indicating that Lymnaeidae and Planorbidae are closely related families, consistent with previous classification based on their morphological features and molecular data [45], [46]. Interestingly, MP and ML analyses revealed that families Amphibolidae and Pyramidellidae are not closely related (with very low bootstrap probability value), whereas Bayes analysis strongly supported that Amphibolidae and Pyramidellidae are closely related with high posterior probability value (Figure 3). Recent molecular phylogenetic studies revealed that the monophy of Pulmonata was clearly rejected by the phylogenetic position of a representative of the Systellommatophora (Onchidella celtica) which is more closely related to Pyramidella dolabrata than it was to any other Pulmonate [47]. This is probably due to the use of a small dataset. Thus, sampling of more taxa is needed to more accurately define phylogenetic positions of these families in further studies. In the present study, the phylogenetic status of the family Ellobiidae remains problematic, consistent with findings of recent molecular phylogenetic studies [48], [49]. Here, we will not further discuss the phylogenetic relationships of these families (eg., Clausiliidae, Aplysiidae, Volvatellidae, Pleurobranchidae, Siphonariidae and Ellobiidae) since their phylogenetic relationships have been discussed in detail by a recent study using nearly 80 Pulmonata species based on analyses of 18S, 16S and CO1 sequences [50]. Some recent studies showed that analyses using whole mt genome sequences (amino acid sequences) yield more accurate results than that using singe or small sets of gene sequences [51], [52]. Thus, more mt genomes of Pulmonata need to be sequenced and evolutionary relationships of Pulmonata should be reexamined. In conclusion, the present study determined the mt genome sequence of G. pervia, which represents the first sequenced high quality mt genome of freshwater snails of the family Lymnaeidae. The G. pervia mt genome exhibits novel mt gene arrangement compared with other Pulmonata species. Phylogenetic analysis based on the mt amino acid sequence dataset revealed that Lymnaeidae and Planorbidae are closely related families, supporting previous classifications based on their morphology and molecular studies. These novel mtDNA data should be useful for further studying the population genetics and phylogeographics of this freshwater snail, which in turn would contribute to the effective control of Fasciola spp. transmitted by it.

Materials and Methods

Sample collection and DNA extraction

An adult freshwater snail representing G. pervia was collected from Guilin City, Guangxi Zhuang Nationality Autonomous Region, China. The specimen was washed in physiological saline, identified morphologically to species according to existing keys and descriptions [53], fixed in 70% (v/v) ethanol and stored at −20°C until use. Total genomic DNA was extracted from its thoracic muscle tissue by treatment with sodium dodecyl sulphate/proteinase K (Merck), followed by purification using Wizard™ DNA Clean-Up System (Promega) and then eluted into 60 µl H2O according to the manufacturer's recommendations. The DNA sample was stored at −20°C until further use.

Amplification and sequencing of partial cox1, nad1, cytb, cox3 and rrnL

Partial fragments of mt cox1, nad1, cytb, cox3 and rrnL genes were amplified using 5 sets of primers (Table 5) designed according to mtDNA sequences of other Pulmonata deposited in GenBank. PCR reactions were carried out in a 25 µl reaction volume consisting of 16.75 µl sterile deionized water, 2.5 µl 10×PCR Buffer (Mg2+ free), 2.0 µl MgCl2 (25 mM), 2.0 µl dNTPs (2.5 mM each), 0.25 µl each primer (50 pmol/µl), 0.25 µl ExTaq DNA polymerase (5 U/µl, Takara) and 1.0 µl DNA template (40 ng/µl) under the following conditions: after an initial denaturation at 94°C for 5 min, then 94°C for 1 min (denaturation), 50–55°C for 30 s (annealing), 72°C for 1 min (extension) for 35 cycles, followed by 72°C for 10 min (final extension). Each amplicon (5 µL) was examined by agarose gel electrophoresis to validate amplification efficiency. Then the cox1, nad1, cytb, cox3 and rrnL amplicons were sent to BGI-Shenzhen Company, China for sequencing from both directions.
Table 5

Primers used to amplify short-PCR fragments from Galba pervia.

Name of primerSequence (5′ to 3′)
GPcox1F TCTTTTRTTCTTCTTTTATGTTC
GPcox1R ATTRAAATTTCGATCTGTTAA
GPrrnLF CGGCCGCCTGTTTATCAAAAACAT
GPrrnLR GGAGCTCCGGTTTGAACTCAGATC
GPnad1F GAACGAAAAATTTTAGGKTATRTTCAAATTCG
GPnad1R ATTAAAACCAGAAACTAAT
GPcytbF TTTCTTTCTATACAYTATACA
GPcytbR TAWGGTTYTTCAATAGGACA
GPcox3F ACTACATCAACRAAATGTCARTATCA
Gpcox3R CCTTTTCAYTTAGTAGAAT

Long-PCR amplification and sequencing

The obtained nucleotide sequences of partial cox1, nad1, cytb, cox3 and rrnL were used to design primer sets (Table 6) for long PCR amplification of the entire G. pervia mt genome. Five overlapping long PCR fragments covering the entire mt genome of G. pervia were obtained. The Long-PCR reaction volume amounted 50 µl containing 27.5 µl sterile deionized water, 5.0 µl 10×LA PCR Buffer (Mg2+ free), 5.0 µl MgCl2 (25 mM), 8.0 µl dNTPs (2.5 mM each), 0.5 µl each primer (50 pmol/µl), 0.5 µl LA Taq DNA polymerase (5 U/µl, Takara) and 3 µl DNA template (40 ng/µl). Long-PCR cycling conditions used were 92°C for 2 min (initial denaturation), then 92°C for 10 s (denaturation), 40°C for 30 s (annealing), and 60°C for 5 min (extension) for 5 cycles, followed by 92°C for 10 s, 40°C for 30 s, and 66°C for 5 min for 20 cycles and a final extension at 66°C for 10 min. All amplifications were done on a T-Gradient thermocycler (Biometra, Germany). The 5 long-PCR fragments were sequenced using a primer-walking strategy.
Table 6

Primers used to amplify Long-PCR fragments from Galba pervia.

Name of primerSequence (5′ to 3′)
GPcox1u1 TCTTTTRTTCTTCTTTTATGTTC
GPrrnLd1 GGAGCTCCGGTTTGAACTCAGATC
GPrrnLu2 CGGCCGCCTGTTTATCAAAAACAT
GPnad1d2 ATTAAAACCAGAAACTAAT
GPnad1u3 GAACGAAAAATTTTAGGKTATRTTCAAATTCG
GPcytbd3 TAWGGTTYTTCAATAGGACA
GPcytbu4 TTTCTTTCTATACAYTATACA
GPcox3d4 CCTTTTCAYTTAGTAGAAT
GPcox3u5 ACTACATCAACRAAATGTCARTATCA
GPcox1d5 ATT RAA ATT TCG ATC TGT TAA

Gene annotation and sequence analysis

Sequences were assembled manually and aligned against the complete mt genome sequence of R. balthica and B. tenagophila using the computer program Clustal X 1.83 [54] to identify gene boundaries. The open-reading frames and codon usage profiles of protein-coding genes were analysed by the Open Reading Frame Finder (http://www.ncbi.nlm.nih.gov/gorf/gorf.html) using the invertebrate mitochondrial code. Translation initiation and translation termination codons were identified based on comparison with the mt genome of R. balthica and B. tenagophila. The amino acid sequences inferred for the mt genes of G. pervia were aligned with those of R. balthica and B. tenagophila by using Clustal X 1.83. Based on pairwise comparison, amino acid identity (%) was calculated for homologous genes. Codon usage was examined based on the relationships between the nucleotide composition of codon families and amino acid occurrence, where the genetic codons are partitioned into AT rich codons, GC-rich codons and unbiased codons. For analyzing ribosomal RNA genes, putative secondary structures of 22 tRNA genes were identified using tRNAscan-SE [55], of the 22 tRNA genes, 5 were identified using tRNAscan-SE, the other 17 tRNA genes were found by eye inspection, and rRNA genes were identified by comparison with the mt genome of R. balthica and B. tenagophila. Phylogenetic relationship among the 20 Pulmonata species (Table 1), plus the mt DNA sequence of G. pervia obtained in the present study was reconstructed based on amino acid sequences of 13 protein-coding genes using the 2 Opisthobranchia species (Aplysia californica, GenBank accession number NC_005827 and A. dactylomela, NC_015088) as the outgroup. Each gene was translated into amino acid sequence using the invertebrate mitochondrial genetic code in MEGA 4 [56], and aligned based on its amino acid sequence using default settings, and ambiguously aligned regions were excluded using Gblocks online server (http://molevol.cmima.csic.es/castresana/Gblocks_server.html) [57] using the options for a less stringent selection. The final amino acid sequences of the 13 protein-coding genes were then concatenated into single alignments for phylogenetic analyses. Three different inference methods, namely MP, ML, and Bayes, were used for phylogenetic analyses. MP analysis was performed using PAUP* 4.0b10 [58], with indels treated as missing character states. A total of 1,000 random addition searches using TBR were performed for each MP analysis. Bootstrap probability (BP) was calculated from 1,000 bootstrap replicates with 10 random additions per replicate in PAUP. ML analyses were performed using PhyML 3.0 [59], and the MtArt+I+G+F model with its parameter for the concatenated dataset was determined for the ML analysis using ProtTest 10.2 based on the Akaike information criterion (AIC) [60]. BP value for ML trees was calculated using 1000 bootstrap replicates. Bayesian analyses were conducted with four independent Markov chains run for 1,000,000 metropolis-coupled MCMC generations, sampling a tree every 100 generations in MrBayes 3.1.1 [61]. The first 2,500 trees were omitted as burn-in and the remaining trees were used to calculate Bayesian posterior probabilities (PP). Phylograms were drawn using the Tree View program version 1.65 [62].
  58 in total

1.  Complete sequence of the mitochondrial DNA of the primitive opisthobranch gastropod Pupa strigosa: systematic implication of the genome organization.

Authors:  A Kurabayashi; R Ueshima
Journal:  Mol Biol Evol       Date:  2000-02       Impact factor: 16.240

2.  A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach.

Authors:  S Whelan; N Goldman
Journal:  Mol Biol Evol       Date:  2001-05       Impact factor: 16.240

3.  Complete sequence and gene organization of the mitochondrial genome for Hubbard's sportive lemur (Lepilemur hubbardorum).

Authors:  Runhua Lei; Gary D Shore; Rick A Brenneman; Shannon E Engberg; Brandon D Sitzmann; Carolyn A Bailey; Lisa M Kimmel; Richard Randriamampionona; Jean Freddy Ranaivoarisoa; Edward E Louis
Journal:  Gene       Date:  2010-06-12       Impact factor: 3.688

4.  Patterns of nucleotide composition at fourfold degenerate sites of animal mitochondrial genomes.

Authors:  N T Perna; T D Kocher
Journal:  J Mol Evol       Date:  1995-09       Impact factor: 2.395

5.  Phylogenetic relationships and evolution of pulmonate gastropods (Mollusca): new insights from increased taxon sampling.

Authors:  Benoît Dayrat; Michele Conrad; Shaina Balayan; Tracy R White; Christian Albrecht; Rosemary Golding; Suzete R Gomes; M G Harasewych; António Manuel de Frias Martins
Journal:  Mol Phylogenet Evol       Date:  2011-02-23       Impact factor: 4.286

6.  Sequences and gene organization of the mitochondrial genomes of the liver flukes Opisthorchis viverrini and Clonorchis sinensis (Trematoda).

Authors:  X Q Cai; G H Liu; H Q Song; C Y Wu; F C Zou; H K Yan; Z G Yuan; R Q Lin; X Q Zhu
Journal:  Parasitol Res       Date:  2011-05-31       Impact factor: 2.289

7.  Complete DNA sequence of the mitochondrial genome of the sea-slug, Aplysia californica: conservation of the gene order in Euthyneura.

Authors:  Bjarne Knudsen; Andrea B Kohn; Ben Nahir; Catherine S McFadden; Leonid L Moroz
Journal:  Mol Phylogenet Evol       Date:  2005-10-17       Impact factor: 4.286

8.  Duplication and remoulding of tRNA genes during the evolutionary rearrangement of mitochondrial genomes.

Authors:  P Cantatore; M N Gadaleta; M Roberti; C Saccone; A C Wilson
Journal:  Nature       Date:  1987 Oct 29-Nov 4       Impact factor: 49.962

9.  Mitochondrial genome sequence and expression profiling for the legume pod borer Maruca vitrata (Lepidoptera: Crambidae).

Authors:  Venu M Margam; Brad S Coates; Richard L Hellmich; Tolulope Agunbiade; Manfredo J Seufferheld; Weilin Sun; Malick N Ba; Antoine Sanon; Clementine L Binso-Dabire; Ibrahim Baoua; Mohammad F Ishiyaku; Fernando G Covas; Ramasamy Srinivasan; Joel Armstrong; Larry L Murdock; Barry R Pittendrigh
Journal:  PLoS One       Date:  2011-02-02       Impact factor: 3.240

10.  The complete mitochondrial genome sequence of the hydrothermal vent galatheid crab Shinkaia crosnieri (Crustacea: Decapoda: Anomura): a novel arrangement and incomplete tRNA suite.

Authors:  Jin-Shu Yang; Hiromichi Nagasawa; Yoshihiro Fujiwara; Shinji Tsuchida; Wei-Jun Yang
Journal:  BMC Genomics       Date:  2008-05-30       Impact factor: 3.969

View more
  18 in total

1.  Characterization of the complete mitochondrial genome sequence of Homalogaster paloniae (Gastrodiscidae, Trematoda) and comparative analyses with selected digeneans.

Authors:  Xin Yang; Lixia Wang; Hanli Feng; Mingwei Qi; Zongze Zhang; Chong Gao; Chunqun Wang; Min Hu; Rui Fang; Chengye Li
Journal:  Parasitol Res       Date:  2016-06-14       Impact factor: 2.289

2.  Characterization of the complete mitochondrial genome of Spirocerca lupi: sequence, gene organization and phylogenetic implications.

Authors:  Guo-Hua Liu; Yan Wang; Hui-Qun Song; Ming-Wei Li; Lin Ai; Xing-Long Yu; Xing-Quan Zhu
Journal:  Parasit Vectors       Date:  2013-02-22       Impact factor: 3.876

3.  The Mitochondrial Genomes of the Nudibranch Mollusks, Melibe leonina and Tritonia diomedea, and Their Impact on Gastropod Phylogeny.

Authors:  Joseph L Sevigny; Lauren E Kirouac; William Kelley Thomas; Jordan S Ramsdell; Kayla E Lawlor; Osman Sharifi; Simarvir Grewal; Christopher Baysdorfer; Kenneth Curr; Amanda A Naimie; Kazufusa Okamoto; James A Murray; James M Newcomb
Journal:  PLoS One       Date:  2015-05-21       Impact factor: 3.240

4.  Complete mitochondrial genomes of the 'intermediate form' of Fasciola and Fasciola gigantica, and their comparison with F. hepatica.

Authors:  Guo-Hua Liu; Robin B Gasser; Neil D Young; Hui-Qun Song; Lin Ai; Xing-Quan Zhu
Journal:  Parasit Vectors       Date:  2014-03-31       Impact factor: 3.876

5.  The complete mitochondrial genomes of two octopods Cistopus chinensis and Cistopus taiwanicus: revealing the phylogenetic position of the genus Cistopus within the order Octopoda.

Authors:  Rubin Cheng; Xiaodong Zheng; Yuanyuan Ma; Qi Li
Journal:  PLoS One       Date:  2013-12-17       Impact factor: 3.240

6.  The mitochondrial genome of the semi-slug Omalonyx unguis (Gastropoda: Succineidae) and the phylogenetic relationships within Stylommatophora.

Authors:  Leila Belén Guzmán; Roberto Eugenio Vogler; Ariel Aníbal Beltramino
Journal:  PLoS One       Date:  2021-06-25       Impact factor: 3.240

7.  Mitochondrial and nuclear ribosomal DNA evidence supports the existence of a new Trichuris species in the endangered françois' leaf-monkey.

Authors:  Guo-Hua Liu; Robin B Gasser; Peter Nejsum; Yan Wang; Qiang Chen; Hui-Qun Song; Xing-Quan Zhu
Journal:  PLoS One       Date:  2013-06-20       Impact factor: 3.240

8.  Mitochondrial genome of the eyeworm, Thelazia callipaeda (Nematoda: Spirurida), as the first representative from the family Thelaziidae.

Authors:  Guo-Hua Liu; Robin B Gasser; Domenico Otranto; Min-Jun Xu; Ji-Long Shen; Namitha Mohandas; Dong-Hui Zhou; Xing-Quan Zhu
Journal:  PLoS Negl Trop Dis       Date:  2013-01-31

9.  Complete mitochondrial genome sequence data provides genetic evidence that the brown dog tick Rhipicephalus sanguineus (Acari: Ixodidae) represents a species complex.

Authors:  Guo-Hua Liu; Fen Chen; Yi-Zhou Chen; Hui-Qun Song; Rui-Qing Lin; Dong-Hui Zhou; Xing-Quan Zhu
Journal:  Int J Biol Sci       Date:  2013-04-22       Impact factor: 6.580

10.  The complete mitochondrial genome of Meloidogyne graminicola (Tylenchina): a unique gene arrangement and its phylogenetic implications.

Authors:  Longhua Sun; Kan Zhuo; Borong Lin; Honghong Wang; Jinling Liao
Journal:  PLoS One       Date:  2014-06-03       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.