Literature DB >> 29118589

The complete mitochondrial DNA sequence of the pantropical earthworm Pontoscolex corethrurus (Rhinodrilidae, Clitellata): Mitogenome characterization and phylogenetic positioning.

Ana C Conrado1, Hugo Arruda2, David W G Stanton3, Samuel W James4, George Brown5, Elodie Silva5, Lise Dupont6, Shabnam Taheri6, Andrew J Morgan3, Nelson Simões2, Armindo Rodrigues7, Rafael Montiel8, Luis Cunha3,5.   

Abstract

Pontoscolex corethrurus (Müller, 1857) plays an important role in tropical soil ecosystems and has been widely used as an animal model for a large variety of ecological studies in particular due to its common presence and generally high abundance in human-disturbed tropical soils. In this study we describe the complete mitochondrial genome of the peregrine earthworm P. corethrurus. This is the first record of a mitochondrial genome within the Rhinodrilidae family. Its mitochondrial genome is 14 835 bp in length containing 37 genes (13 protein-coding genes (PCG) 2 rRNA genes and 22 tRNA genes). It has the same gene content and structure as in other sequenced earthworms but unusual among invertebrates it hasseveral overlapping open reading frames. All genes are encoded on the same strand. Most of the PCGs use ATG as the start codon except for ND3 which uses GTG as the start codon. The A+T content of the mitochondrial genome is 59.9% (31.8% A 28.1% T 14.6% G and 25.6% for C). The annotated genome sequence has been deposited in GenBank under the accession number KT988053.

Entities:  

Keywords:  Azores; Pontoscolex corethrurus; Rhinodrilidae; earthworm; mitochondria; mitochondrial genome; peregrine species

Year:  2017        PMID: 29118589      PMCID: PMC5672579          DOI: 10.3897/zookeys.688.13721

Source DB:  PubMed          Journal:  Zookeys        ISSN: 1313-2970            Impact factor:   1.546


Introduction

Excluding a few aquatic taxa, earthworms (: ) are mostly terrestrial and include ca. 5,500 species (Blakemore et al. 2006). Believed to have originated in the Guyana Shield (Righi 1984), the earthworm (Müller, 1857) is a globally distributed species found in most tropical regions. It mainly occurs in human-disturbed areas and can be used as an indicator of ecosystem disturbance (Brown et al. 2006), and is commonly used in ecotoxicological studies (e.g. Buch et al. 2011; Buch et al. 2013; Da Silva et al. 2016). The species formerly belonged in the family, but was recently allocated to the family by James (2012), following the phylogeny of James and Davidson (2012). It is also the most well-known earthworm species in the humid tropics, frequently used in ecological and agronomic studies (Bhattacharjee and Chaudhuri 2002; Buch et al. 2013; Chapuis-Lardy et al. 2010; Dupont et al. 2012; Hamoui 1991; Marichal et al. 2010). Being a geophagous endogeic species, shows high plasticity regarding its tolerance to soil physicochemical characteristics, including variable moisture, high temperatures, exceptionally high carbon dioxide and low oxygen levels, and is capable of inhabiting nutrient-poor soils (Cunha et al. 2014; Hamoui 1991; Lavelle et al. 1987), as well as rotten logs (Buch et al. 2011). Molecular data have become increasingly important in recent years. In animals, the mitochondrial DNA (mtDNA) typically contains 37 genes, encoding 13 proteins for the enzymes required for oxidative phosphorylation, the two ribosomal RNA units (rRNA), and 22 transfer RNAs (tRNAs) necessary for the translation of the proteins encoded by mtDNA (Anderson et al. 1981; Boore 1999; Zhao et al. 2015). Remarkable progress has been made over the past several years in the field of the molecular systematics of annelids. Compared with individual genes, the mitochondrial genome is still a promising tool for inferring phylogenetic relationships due to its high content of information, and has been applied in some phylogenetic studies involving earthworms (Zhang et al. 2015; Zhang et al. 2016b). In this study, we sequenced the complete mtDNA sequence of for the first time and analyzed its structure. Additionally, we conducted phylogenetic analyses based on the mitochondrial sequence data available elsewhere with the purpose of investigating the phylogenetic position of within . The information reported in this article will facilitate further investigations of phylogenetic relationships of different species.

Material and methods

Sample collection and DNA extraction

A group of clitellate (adult) was collected in São Miguel Island (Azores, Portugal) inside pineapple greenhouses (Locality: Fajã de Baixo, ) during January 2015. Animals were euthanized in 10% ethanol and preserved in 96% ethanol for later work. A piece of body wall tissue was used for genomic DNA extraction using standard phenol/chloroform (Sambrook and Russell 2001) procedure followed by ethanol precipitation and kept at 4°C for subsequent use.

Mitochondrial DNA amplification

The complete mitogenome was amplified using seven sets of primers (Table 1) designed based on sequences retrieved from a previous study (Cunha et al. 2014).
Table 1.

Details of the primers used to amplify the mitochondrial DNA of .

Primer codeOrientationAnnealing position (bp)Nucleotide sequence (5’-3’)Melting Temperature (°C)
FP_1Forward2154..2175CTCTACTATGTACCCAGGAGTG57.46
RP_1Reverse2758..2775GCGGCCAAGATAAAGCAC57.67
RP_2Reverse3740..3762TAGAGGCGGTAAGGAGAAAGTAT58.61
RP_3Reverse5691..5708CAGAGGCGAGGTAAATTC53.85
RP_4Reverse6356..6373TGTTCAGGGCTAGGATTG54.99
FP_5Forward7983..8004ACTAGTGTCACTTACAACAACC57.16
RP_5Reverse8649..8670TGATAAGGGGGAAAGTCTGATC56.84
FP_6Forward8766..8787AGTAGCCGCTATAATAGTCCTT57.91
RP_6Reverse10328..10349TGATTTGGGGTCAGAGCCGTAG61.59
FP_7Forward10459..10478AAAGCTTGGCGGTGCTTCAC63.23
RP_7Reverse11242..11263CCTAGTGTGTGTCAGGACGCTT64.75
Details of the primers used to amplify the mitochondrial DNA of . Long PCR targets were amplified using different combinations of the primer sets, and initially sequenced with the same forward or reverse primers. Subsequent primer walking method was used to close the sequencing gaps. To ensure the accuracy of the sequence, every two contiguous segments overlapped by at least 80 bp. PCRs were performed using ~40 ng of DNA and 0.4 μM forward and reverse primers, 0.2 mM dNTP mix and 1.25 U Platinum HiFi DNA polymerase buffered with 1X Mg-free buffer (Thermofisher Scientific, UK). PCR amplification buffer was supplemented with MgCl2 to achieve a final concentration of 1.75 mM in a total volume of 25 μl reaction mixture. The reaction was denatured at 95°C for 3 min, cycled 35 times at 95°C for 30 s, 30 s at the required primer annealing temperature and 72°C for 1 min per 1000 bp depending on target fragment length. Negative controls were included in all PCR amplifications to confirm the absence of contaminants. Before sequencing, PCR cleanups were performed using Exo-SAP-IT (Amersham Pharmacia, UK) reagents. Exonuclease 1 (0.25 μl) and Shrimp Alkaline Phosphatase (0.5 μl) were mixed with the PCR product (10 μl) and incubated at 37°C for 45 min followed by 80°C for 15 min. DNA was sequenced using ABI PRISM® BigDye v3.1 Terminator Sequencing technology (Applied Biosystems, USA) on an ABI PRISM® 3100 DNA automated Sequencer.

Sequence editing and analysis

Sequence trace files were corrected and aligned with the MEGA v.7 (Kumar et al. 2016). The sequence overlap and mitogenome assembly were performed using CLC main workbench v.6 (Qiagen). The annotation of the 13 protein-coding genes and two rRNA genes were determined using the MITOS v.2 web server (Bernt et al. 2013) and PageBreakmanually curated using other published annelid mitogenomes as shown in Table 2, whereas the tRNA genes were identified using the program tRNAscan-SE 1.21 (Lowe and Eddy 1997). The annotated genome sequence was deposited in GenBank under accession number KT988053.
Table 2.

Representative species included in this study for comparison.

SpeciesFamilyLength (bp)GenBank accession number
Pontoscolex corethrurusRhinodrilidae14,835Present study
Tonoscolex birmanicusMegascolecidae15,170KF425518
Amynthas gracilisMegascolecidae15,161NC_027258
Duplodicodrilus schmardaeMegascolecidae15,156NC_029867
Metaphire guillemiMegascolecidae15,174NC_029869
Perionyx excavatusMegascolecidae15,083NC_009631
Lumbricus terrestrisLumbricidae14,998NC_001673
Drawida japonicaMoniligastridae14,648NC_028050
Hirudo nipponiaHirudinidae14,414NC_023776

Phylogenetic analyses

To clarify the phylogenetic position of within the , the complete mitogenome sequences of eight representative species (Table 2) were incorporated together with the presently obtained mitogenome sequence for phylogenetic analysis. Phylogenetic analyses were based on 13 protein-coding genes and the two rRNA units, which were aligned separately using MEGA v.7 (Kumar et al. 2016) with minor manual adjustments and then concatenated. The possible bias of substitution saturation at each codon position of protein-coding genes and two rRNA genes was investigated using DAMBE v.4.5.57 (Xia and Xie 2001) and MEGA v. 7 (Kumar et al. 2016). Representative species included in this study for comparison. Two different methods, Bayesian inference (BI) and maximum likelihood (ML) were used to construct the phylogenetic tree. Bayesian analyses were undertaken with MrBayes v.3.1.2 (Ronquist and Huelsenbeck 2003) under the best-fit model of nucleotide evolution selected in MrModeltest v.2.3 (Nylander 2004), using the Assignment Index Criterion (AIC). Analyses were run for 1,000,000 generations, and sampled every 100 generations to assess convergence. Trees that produced non-stationary log-likelihood values were discarded as part of a burn-in procedure and combined the remaining trees that resulted in convergent log-likelihood scores from both independent searches. These trees were used to construct a consensus tree. Maximum likelihood analysis (ML) was performed with MEGA v.7 (Kumar et al. 2016). Initial tree(s) for the heuristic search were obtained automatically by applying Neighbour-Joining and BioNJ algorithms to a matrix of pairwise distances estimated using the Maximum Composite Likelihood (MCL) approach and then selecting the topology with superior log likelihood value. A discrete Gamma distribution was used to model evolutionary rate differences among sites (5 categories (+G, alpha parameter = 0.9143)). The rate variation model allowed for some sites to be evolutionarily invariable ([+I], 17.7596% sites). The analysis involved nine mitogenome sequences (Table 2). All positions containing gaps and missing data were eliminated. The bootstrap consensus tree inferred from 1000 replicates was taken to represent the evolutionary history of the taxa analysed (Felsenstein 1985).

Results and discussion

Mitochondrial genomic structure

The mitochondrial genome of was determined to be 14 835 bp in length, comprising 13 protein-coding genes (PCGs), 22 transfer RNAs (tRNAs), two ribosomal RNAs (rRNAs), and one putative control region with a length of 318 bp (Figure 1).
Figure 1.

The mitochondrial genome of (Müller, 1857). Gene order and positions are shown, including the putative control region. IUPAC single letter codes are used to identify transfer RNA. The L1, L2, S1, and S2 transfer RNAs are differentiated on the basis of their anti-codons TAG, TAA, TCT, and TGA, respectively.

The mitochondrial genome of (Müller, 1857). Gene order and positions are shown, including the putative control region. IUPAC single letter codes are used to identify transfer RNA. The L1, L2, S1, and S2 transfer RNAs are differentiated on the basis of their anti-codons TAG, TAA, TCT, and TGA, respectively. The mitochondrial genome structure is detailed in Table 3. Gene order and orientation are similar to the previous earthworm mitochondrial genomes (Boore and Brown 1995; Zhang et al. 2015) but slightly smaller and more condensed (with several intergenic overlaps, see Table 3). The gene organization is similar to other earthworm species (e.g. : Boore and Brown 1995).
Table 3.

Organisation and structure of the mitochondrial genome.

GeneDirectionFromToSize (bp)StartStopAnti-codonIntergenic bases (bp)
COX1+115401540ATGT--0
tRNA-Asn+1541160262GTT0
COX2+16032289687ATGTAG-1
tRNA-Asp+2289235163GTC2
ATP8+23542513160ATGT--0
tRNA-Tyr+2514257663GTA-1
tRNA-Gly+2576263863TCC3
COX3+26423419778ATGT--0
tRNA-Gln+3420348869TTG0
ND6+34893954466ATGT--0
Cytb+395550941140ATGTAA-2
tRNA-Trp+5092515463TCA1
ATP6+51565851696ATGTAA-2
tRNA-Arg+5850591061TCG0
Putative Control Region+59116228318
tRNA-His+6229628860GTG0
ND5+628980101722ATGTAA3
tRNA-Phe+8014807360GAA4
tRNA-Glu+8078814164TTC0
tRNA-Pro+8142820463TGG4
tRNA-Thr+8209827264TGT0
ND4L+82738569297ATGTAA-7
ND4+856399211359ATGTAA-2
tRNA-Cys+9920998667GCA1
tRNA-Met+99881005063CAT-1
s-rRNA+1005010838789-7
tRNA-Val+108321089463TAC-2
l-rRNA+108931210412120
tRNA-Leu 1+121051216662TAG2
tRNA-Ala+121691223062TGC1
tRNA-Ser 2+122321229362TGA1
tRNA-Leu 2+122951236066TAA0
ND1+1236113290930ATGTAG-1
tRNA-lle+132901335465GAT0
tRNA-Lys+133551341763TTT0
ND3+1341813770353GTGTA--1
tRNA-Ser 1+137701383263TCT0
ND2+13833148351003ATGT--0
Organisation and structure of the mitochondrial genome. The nucleotide composition is asymmetric (31.9% A, 27.9% T, 14.9% G, and 25.3% for C) with an overall A+T content of 59.9%. One remarkable trait of metazoan mitogenomes is the strand-specific bias in nucleotide composition (Hassanin et al. 2005; Reyes et al. 1998). Such bias is measured as G/C-skew (G%-C%)/(G%+C%) and A/T-skew (A%-T%)/(A%+T%), respectively (Perna and Kocher 1995). The overall GC- and AT-skews of the H-strand of mitogenome were -0.258 and 0.066, respectively, indicating a compositional bias associated with an excess of C over G nucleotides and a slight excess of A over T nucleotides on the H-strand.

Protein-coding genes

The genome contained the expected 13 protein-coding genes with a total of 11,131 bp in size, accounting for 75.03% of the whole mitogenome. Most of the PCGs are initiated with ATG codons, except for ND3 gene, which uses GTG as the initiation codon. Six PCGs (COX1, ATP8, COX3, ND6, ND3, and ND2) are terminated with an incomplete codon T or TA, which could be completed to TAA by polyadenylation post-transcriptionally (Ojala et al. 1981). COX2 and ND1 use TAG as a termination codon. Nucleotide composition and codon usage frequencies were calculated from a concatenated sequence of all protein-coding genes on the H-strand. The base composition of protein-coding genes revealed a negative bias for A (14.4%), especially at second codon positions (12.9%, Table 4). For all protein genes, T was the most frequent nucleotide at the first and third positions whereas G was most frequent at the second position.
Table 4.

Base composition for protein-coding, tRNA, and rRNA genes of mitogenome.

Gene/RegionBase composition (%)A+T (%)Size (bp)
TCAG
COX127.426.727.918.155.31,540
COX225.824.334.915.060.7687
ATP824.426.936.911.961.3160
COX327.327.525.719.553.0778
ND628.326.030.315.558.6466
Cytb28.027.129.715.357.61,140
ATP629.629.230.610.660.2696
ND527.726.731.713.959.41,722
ND4L25.928.333.012.858.9297
ND427.527.532.312.759.81,359
ND128.425.829.816.058.2930
ND332.626.127.214.259.8353
ND230.028.230.711.160.71,003
Protein Coding
1st30.225.417.427.047.63,710
2st24.927.612.934.637.83,710
3st36.127.913.822.349.83,710
Total30.427.014.728.045.111,131
tRNA30.517.634.917.065.51,391
rRNA 24.122.037.616.361.72,001
Putative Control Region31.518.936.213.567.6318
Overall28.125.631.814.659.914,835
Base composition for protein-coding, tRNA, and rRNA genes of mitogenome.

Ribosomal and transfer RNA genes

Like other mitochondrial genomes (Inoue et al. 2000; Zardoya et al. 1995), twenty-two tRNA genes were identified (Supplementary figure 1). The tRNA genes were scattered throughout the mitochondrial genome and ranged in size from 60 to 67 bp PageBreak(Table 3). The mitogenome also contained a small subunit of rRNA and a large subunit of rRNA, which were 789 bp and 1212 bp in length, respectively. As in other genomes, these genes were located between the tRNAMet and tRNAVal genes and between tRNAVal and tRNALeu genes, respectively (Zhang et al. 2016b).

Non-coding regions

As shown in Table 3, there are 22 intergenic spacer regions, ranging in size from -7 to 4 bp observed in . As in most , the major non-coding region in mitochondrial genome was located between tRNA-Arg and tRNA-His. It was determined to be 318 bp in length, less than other reported species (Zhang et al. 2016a), and it had a base composition that was rich in A and T (A+T=67.6%).

Phylogenetic analyses within the

The phylogenetic trees (the 50% majority-rule consensus tree is shown in Figure 2) were highly consistent regardless of the analytic method used, and were statistically PageBreaksupported by high posterior probability and intermediate bootstrap values. This phylogenetic analysis represented the first investigation of relationships within the based on the complete mitogenome. As indicated by the tree, different species from the same family clustered together (: , , , and ), and the species from and the formed a monophyletic group. The species belongs to the , the sister group to (earthworms), which explains its phylogenetic position. The are not because they have a single cell layer in the clitellum.
Figure 2.

Phylogenetic relationships among phylum based on the combined 13,416 bp nucleotide positions. Total alignment length is greater than the combined protein coding and rRNA sequence lengths due to overlapping protein coding sequences that are subsequently concatenated, and indel regions in the alignment. The posterior probability value of BI analyses and bootstrap support values of ML analyses (in the order: BI, ML) are indicated near the branches.

Phylogenetic relationships among phylum based on the combined 13,416 bp nucleotide positions. Total alignment length is greater than the combined protein coding and rRNA sequence lengths due to overlapping protein coding sequences that are subsequently concatenated, and indel regions in the alignment. The posterior probability value of BI analyses and bootstrap support values of ML analyses (in the order: BI, ML) are indicated near the branches.

Conclusion

For the first time, the sequencing, annotation and analysis of the mitochondrial genome of a member of was completed. The mitogenome of was found to be 14,835 bp in length and showed a similar composition in size, low GC content and gene order to earthworm mitogenomes already available. The complete mitogenome reported here is expected to allow for further studies of the phylogeny and for analyses on the taxonomic status of the family .

Declaration of interest section

The authors report no conflicts of interest and are responsible for the content and writing of the paper.

Acknowledgments

We thank Jose Talavera for the taxonomic identification of the studied specimen. This study was financially supported by the Portuguese Government (FCT project PTDC/AAC-AMB/ 115713/2009). Luis Cunha was supported by an EU Marie Curie fellowship - MSCA-IF-2014-GF-660378. George Brown, Elodie da Silva acknowledge fellowship support by CNPq. Ana Caroline Conrado was supported by a scholarship funded by CAPES. Sam James was supported by USA NSF award DEB-1136604. Dave Stanton was supported by a NERC grant (NE/M017656/1).
  20 in total

1.  MrBayes 3: Bayesian phylogenetic inference under mixed models.

Authors:  Fredrik Ronquist; John P Huelsenbeck
Journal:  Bioinformatics       Date:  2003-08-12       Impact factor: 6.937

2.  Evidence for multiple reversals of asymmetric mutational constraints during the evolution of the mitochondrial genome of metazoa, and consequences for phylogenetic inferences.

Authors:  Alexandre Hassanin; Nelly Léger; Jean Deutsch
Journal:  Syst Biol       Date:  2005-04       Impact factor: 15.683

3.  Living on a volcano's edge: genetic isolation of an extremophile terrestrial metazoan.

Authors:  L Cunha; R Montiel; M Novo; P Orozco-terWengel; A Rodrigues; A J Morgan; P Kille
Journal:  Heredity (Edinb)       Date:  2013-09-18       Impact factor: 3.821

4.  CONFIDENCE LIMITS ON PHYLOGENIES: AN APPROACH USING THE BOOTSTRAP.

Authors:  Joseph Felsenstein
Journal:  Evolution       Date:  1985-07       Impact factor: 3.694

5.  Complete mitochondrial genome of four pheretimoid earthworms (Clitellata: Oligochaeta) and their phylogenetic reconstruction.

Authors:  Liangliang Zhang; Jibao Jiang; Yan Dong; Jiangping Qiu
Journal:  Gene       Date:  2015-08-17       Impact factor: 3.688

6.  Patterns of nucleotide composition at fourfold degenerate sites of animal mitochondrial genomes.

Authors:  N T Perna; T D Kocher
Journal:  J Mol Evol       Date:  1995-09       Impact factor: 2.395

7.  Sequence and organization of the human mitochondrial genome.

Authors:  S Anderson; A T Bankier; B G Barrell; M H de Bruijn; A R Coulson; J Drouin; I C Eperon; D P Nierlich; B A Roe; F Sanger; P H Schreier; A J Smith; R Staden; I G Young
Journal:  Nature       Date:  1981-04-09       Impact factor: 49.962

8.  The complete nucleotide sequence of the mitochondrial DNA genome of the rainbow trout, Oncorhynchus mykiss.

Authors:  R Zardoya; A Garrido-Pertierra; J M Bautista
Journal:  J Mol Evol       Date:  1995-12       Impact factor: 2.395

9.  MITOS: improved de novo metazoan mitochondrial genome annotation.

Authors:  Matthias Bernt; Alexander Donath; Frank Jühling; Fabian Externbrink; Catherine Florentz; Guido Fritzsch; Joern Pütz; Martin Middendorf; Peter F Stadler
Journal:  Mol Phylogenet Evol       Date:  2012-09-07       Impact factor: 4.286

10.  Complete sequence of the mitochondrial DNA of the annelid worm Lumbricus terrestris.

Authors:  J L Boore; W M Brown
Journal:  Genetics       Date:  1995-09       Impact factor: 4.562

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.