Literature DB >> 30458858

Mitochondrial genomes of two diplectanids (Platyhelminthes: Monogenea) expose paraphyly of the order Dactylogyridea and extensive tRNA gene rearrangements.

Dong Zhang1,2, Wen X Li1, Hong Zou1, Shan G Wu1, Ming Li1, Ivan Jakovlić3, Jin Zhang3, Rong Chen3, Gui T Wang4.   

Abstract

BACKGROUND: Recent mitochondrial phylogenomics studies have reported a sister-group relationship of the orders Capsalidea and Dactylogyridea, which is inconsistent with previous morphology- and molecular-based phylogenies. As Dactylogyridea mitochondrial genomes (mitogenomes) are currently represented by only one family, to improve the phylogenetic resolution, we sequenced and characterized two dactylogyridean parasites, Lamellodiscus spari and Lepidotrema longipenis, belonging to a non-represented family Diplectanidae.
RESULTS: The L. longipenis mitogenome (15,433 bp) contains the standard 36 flatworm mitochondrial genes (atp8 is absent), whereas we failed to detect trnS1, trnC and trnG in L. spari (14,614 bp). Both mitogenomes exhibit unique gene orders (among the Monogenea), with a number of tRNA rearrangements. Both long non-coding regions contain a number of different (partially overlapping) repeat sequences. Intriguingly, these include putative tRNA pseudogenes in a tandem array (17 trnV pseudogenes in L. longipenis, 13 trnY pseudogenes in L. spari). Combined nucleotide diversity, non-synonymous/synonymous substitutions ratio and average sequence identity analyses consistently showed that nad2, nad5 and nad4 were the most variable PCGs, whereas cox1, cox2 and cytb were the most conserved. Phylogenomic analysis showed that the newly sequenced species of the family Diplectanidae formed a sister-group with the Dactylogyridae + Capsalidae clade. Thus Dactylogyridea (represented by the Diplectanidae and Dactylogyridae) was rendered paraphyletic (with high statistical support) by the nested Capsalidea (represented by the Capsalidae) clade.
CONCLUSIONS: Our results show that nad2, nad5 and nad4 (fast-evolving) would be better candidates than cox1 (slow-evolving) for species identification and population genetics studies in the Diplectanidae. The unique gene order pattern further suggests discontinuous evolution of mitogenomic gene order arrangement in the Class Monogenea. This first report of paraphyly of the Dactylogyridea highlights the need to generate more molecular data for monogenean parasites, in order to be able to clarify their relationships using large datasets, as single-gene markers appear to provide a phylogenetic resolution which is too low for the task.

Entities:  

Keywords:  Gene loss; Gene rearrangement; Molecular markers; Phylogenomics; Pseudo tRNA gene tandem array

Mesh:

Substances:

Year:  2018        PMID: 30458858      PMCID: PMC6245931          DOI: 10.1186/s13071-018-3144-6

Source DB:  PubMed          Journal:  Parasit Vectors        ISSN: 1756-3305            Impact factor:   3.876


Background

Monogeneans of the family Diplectanidae (Dactylogyridea: Dactylogyrinea) are parasites found on the gills of (mostly) marine perciform fishes [1]. The family comprises approximately 250 species and is mainly studied for the adverse health effects these parasites cause to the hosts; the fixation of their opisthaptors on the gills causes haemorrhages and a white mucoid exudate, which often leads to secondary fungal, bacterial and/or viral infections [2, 3]. An example is Diplectanum aequans (Wagener, 1857), which can cause high mortality of juvenile European sea bass in the Mediterranean aquaculture [2]. Traditionally used phylogenetic markers, morphology and single-genes, are often not suitable for resolving evolutionary history with high confidence; morphological traits can be homoplastic, which often causes taxonomic and phylogenetic artifacts [4-6] and, due to the small amount of information (phylogenetic signal) they carry, single-gene molecular markers may have limited resolving power [7]. This is reflected in the unresolved phylogeny of Monogenean parasites [8-11]. Specifically, studies based on spermatozoal ultrastructural characters [12, 13], the 18S rRNA gene [9-11], and a combination of three unlinked nuclear genes [5], supported a phylogenetically closer relationship between monogenean orders Capsalidea and Gyrodactylidea than (either of the two) to the Dactylogyridea. However, 66 homologous series of morphological characters resolved the Gyrodactylidea and Dactylogyridea as sister groups [14]. Therefore, molecular markers carrying more powerful phylogenetic signals are needed to resolve their phylogenetic relationships with high resolution. Mitogenome is a good candidate marker, with an approximately ten times larger nucleotide alignment length than commonly used single-gene molecular markers (ITS, 18S and 28S rRNA). Although their applicability for studies of the Neodermata is still hampered by their relative scarcity, they are increasingly used in population genetics [15], phylogenetics [16, 17] and diagnostics [7, 18] of parasitic flatworms, despite this limitation. Intriguingly, recent researches [4, 19, 20] relying on the mitochondrial (mt) phylogenomics approach consistently resolved the Dactylogyridea and Capsalidea as sister-groups, thereby further complicating phylogenetic hypotheses for the three aforementioned orders. As the resolution power of mitochondrial genomics is still limited by the very low number of sequenced monogenean mitogenomes available, where many taxonomic categories remain poorly represented or unrepresented (only one dactylogyridean family represented), we sequenced and characterized two complete mitochondrial genomes belonging to a non-represented dactylogyridean family, the Diplectanidae: Lamellodiscus spari (Zhukov, 1970) and Lepidotrema longipenis (Yamaguti, 1934), collected from the gills of two marine fish species. Their availability shall enable us to employ mitochondrial phylogenomics to investigate relationships of these three orders with improved resolution.

Methods

Specimen collection and identification

According to the records in Zhang et al. [21], we searched for diplectanid parasites by exploring fish markets in several coastal cities in the southern China. Lepidotrema longipenis was obtained from Terapon jarbua (Forsskål, 1775) (Perciformes: Terapontidae) bought at a local market in Zhanjiang city, Guangdong Province (21°15'5"-21°15'16"N, 110°23'46"-110°24'12"E), on the 18th June 2016. Lamellodiscus spari was obtained from the black sea bream Acanthopagrus schlegelii (Bleeker, 1854) (Perciformes: Sparidae) caught by fishermen in Daya Bay, Guangdong Province (22°42'58"-22°42'56"N, 114°32'16"-114°32'25"E) on the 10th July 2017. Discriminative morphological characteristics for the Diplectanidae are their opisthaptor equipped by three transversal bars connected to two pairs of central hooks, 14 marginal hooks and accessory adhesive organ (lamellodisc or squamodisc) that can be present or absent [22, 23]. Parasites were further morphologically identified to the genus level as described in Domingues & Boeger [23], and to the species level under a light microscope according to the traits described in Ogawa & Egusa [24] for L. spari, and the traits described in Zhang et al. [21] for L. longipenis. Additionally, to confirm the taxonomic identity, the 28S rRNA gene was amplified using universal primers [25] (Additional file 1: Dataset S1); both species share a very high identity of 99.6% with corresponding conspecific homologs available in GenBank: 744/747 identical bp for L. longipenis (EF100563), and (837/840) for L. spari (DQ054823). All sampled and identified parasites were first washed in 0.6% saline and then stored in 100% ethanol at 4 °C.

DNA extraction, amplification and sequencing

Due to the small size of these parasites, we used two kinds of genomic DNA to ensure a sufficient amount for amplification and sequencing, both extracted using a TIANamp MicroDNA Kit (Tiangen Biotech, Beijing, China): mixture DNA (20 parasite specimens) and individual DNA (a single parasite specimen). Mixture DNA was first used to amplify the whole mitogenome. First, we selected 14 monogenean mitogenomes from GenBank, aligned them using ClustalX [26], and designed degenerate primer pairs (Additional file 1: Dataset S1) matching the generally conserved regions of mitochondrial genes (16S, 12S, cox1, cox2, nad1, nad4 and cytb). On the basis of these obtained fragments, specific primers were then designed using Primer Premier 5 [27], and the remaining mitogenome was amplified and sequenced in several PCR steps (Additional file 1: Dataset S1). Both mitogenomes were amplified exactly following the procedures previously described [4, 17, 19, 28]; detailed PCR conditions are provided in Additional file 1: Dataset S1. PCR products were sequenced bi-directionally using both degenerate and specific primers mentioned above on an ABI 3730 automatic sequencer (Sangon, Shanghai, China) using the Sanger method. During the sequencing we paid close attention to chromatograms, carefully examining them for double peaks, or any other sign of the existence of two different sequences. A BLAST [29] check was used to confirm that all amplicons are the actual target sequences. To address the possibility of intraspecific sequence variation present in the mixture DNA, we then used individual DNA and long-range PCR to verify the obtained sequences (primers used are listed in Additional file 1: Dataset S1). If we found two different sequences, we used the DNA extracted from a single individual to infer the mitogenomic sequence, thereby ensuring that each sequence belongs to a single specimen.

Sequence annotation and analyses

Both mitogenomes were assembled and annotated following a previously described procedure [4, 17, 19, 28] using DNAstar v.7.1 software [30], MITOS [31], ARWEN [32] and DOGMA [33] web tools, so detailed methodology is provided in Additional file 1: Dataset S1. Codon usage and relative synonymous codon usage (RSCU) for 12 protein-encoding genes (PCGs) of the two studied diplectanids, two dactylogyrids (Dactylogyrus lamellatus Achmerov, 1952 and Tetrancistrum nebulosi Young, 1967) and three capsalids (Neobenedenia melleni MacCallum, 1927, Benedenia seriolae Yamaguti, 1934 and B. hoshinai Ogawa, 1984) were computed and sorted using MitoTool [34] (an in-house GUI-based software), and finally the RSCU figure drawn using the ggplot2 [35] plugin. Non-synonymous (dN) / synonymous (dS) mutation rate ratios among the 12 PCGs of the two studied diplectanid mitogenomes were calculated with DnaSP v.5 [36]. The same software was also employed to conduct the sliding window analysis: a sliding window of 200 bp and a step size of 20 bp were implemented to estimate the nucleotide divergence Pi between the mitogenomes of L. longipenis and L. spari. Tandem Repeats Finder [37] was employed to find tandem repeats in the long non-coding regions (LNCR), and their secondary structures were predicted by Mfold software [38]. Rearrangement events in the mitogenomes and pairwise comparisons of gene orders of all 20 available monogeneans were calculated with the CREx program [39] using the common intervals measurement. Due to limitations of the CREx algorithm, and to facilitate comparative analyses of gene orders, we provisionally added the gene block trnS1-trnC-trnG to the gene order sequence of L. spari, corresponding to the position where these genes were found in L. longipenis (between nad5 and cox3). Genetic distances (identity) among mitogenomic sequences were calculated with the “DistanceCalculator” function in Biopython using the “identity” model.

Phylogenetic analyses

Phylogenetic analyses were conducted using the two newly sequenced diplectanid mitogenomes and all 18 monogenean mitogenomes available in GenBank (5th May 2018). Two species of the order Tricladida, Crenobia alpina (Dana, 1766) (KP208776) and Obama sp. (NC_026978), were used as outgroups, thus making a total of 22 mitogenomes (Additional file 2: Table S1). Two datasets were used for phylogenetic analysis: amino acid alignment of 12 protein-coding genes (PCGAA) and codon-based alignment of nucleotide sequences of 12 protein-coding genes + secondary structure alignment of 22 tRNAs and 2 rRNAs (PCGRT). As data processing was conducted as previously described [4, 17, 19, 28, 40], using MitoTool, MAFFT [41] and Gblocks [42]; details are given in Additional file 1: Dataset S1. The heterogeneity of sequence divergence within data sets was analyzed using AliGROOVE [43], wherein indels in nucleotide dataset were treated as ambiguity, and a BLOSUM62 matrix was used for amino acids. Best partitioning scheme and evolutionary models were selected using PartitionFinder2 [44], with greedy algorithm and AICc criterion. Phylogenetic analyses were conducted using two different algorithms: maximum likelihood (ML) and Bayesian inference (BI). Based on the Akaike information criterion implemented in ProtTest [45], MTART+I+G+F was chosen as the optimal evolutionary model for the downstream phylogenetic analyses. Under this optimal model and partition model, ML analysis was conducted in RAxML [46] using a ML+rapid bootstrap (BS) algorithm with 1000 replicates. Bayesian inference analyses with the empirical MTART model were conducted using PhyloBayes (PB) MPI 1.5a [47]. For each analysis, two MCMC chains were run after the removal of invariable sites from the alignment, and the analysis was stopped when the conditions considered to indicate a good run (according to the PhyloBayes manual) were reached: maxdiff < 0.1 and minimum effective size > 300. Phylograms and gene orders were visualized and annotated by iTOL [48] with the help of several dataset files generated by MitoTool, as described in our recent papers [4, 17].

Results and discussion

Genome organization and base composition

The full circular mitochondrial genome of L. longipenis (GenBank: MH328203), at 15,433 bp, is the longest among the monopisthocotylids characterized so far (Additional file 2: Table S1). The mitogenome of L. spari is 14,614 bp in size (GenBank: MH328204). The L. longipenis mitogenome contains the standard [49] 36 flatworm mitochondrial genes, including 12 protein-encoding genes (PCGs; atp8 is absent), 22 tRNA genes, and two rRNA genes, whereas trnS1, trnC and trnG genes are missing in L. spari (Table 1 and Fig. 1). The architecture, gene contents and similarity of orthologous sequences for the two studied mitogenomes are summarized in Table 1. Average sequence similarity of PCGs between the two studied mitogenomes ranged from 55.58 (nad5) to 79.45% (cox1) (Table 1). In comparison to PCGs, average sequence similarity values between the rRNAs of the two species were higher: 75.43% for rrnL and 70.87% for rrnS.
Table 1

Comparison of the annotated mitochondrial genomes of Lamellodiscus spari and Lepidotrema longipenis

GenePositionSizeIntergenic nucleotidesCodonStrandIdentity
FromToStartStop
Lepidotrema longipenis/Lamellodiscus spari
cox11/11548/15571548/1557GTG/GTGTAA/TAGH/H79.45
trnT1553/15811609/164157/614/23H/H61.9
rrnL1610/16422582/2593973/952H/H73.19
rrnS2609/25993328/3335720/73726/5H/H69.43
cox23361/33363936/3917576/58232/-ATG/ATGTAG/TAAH/H70.79
trnL13970/39204039/398570/6633/2H/H56.58
trnS24040/39854104/405065/66-/-1H/H70.15
trnE4105/40514171/411867/68H/H77.14
nad64175/41194624/4577450/4593/-ATG/GTGTAA/TAAH/H62.75
trnL24628/45844697/464870/653/6H/H64.29
trnY4700/46624767/472768/662/13H/H72.86
trnR4769/47254836/478968/651/-3H/H72.06
nad54840/47906366/61931527/14043/-ATG/GTGTAA/TAAH/H55.58
trnS16368/-6427/-60/-1/-H/-
trnC6431/-6496/-66/-3/-H/-
trnG6503/-6567/-65/-6/-H/-
 LNCR6568/61948560/79661993/1773H/H60.71
cox38561/79679214/8617654/651ATG/ATGTAA/TAAH/H71.41
trnH9218/86199285/868268/643/1H/H76.47
cytb9286/868310,374/97711089/1089ATG/ATGTAG/TAGH/H74.93
nad4L10,422/976410,670/10,018249/25547/-8ATG/ATGTAG/TAGH/H65.89
nad410,640/999111,857/11,1661218/1176-31/-28ATG/ATGTAA/TAGH/H59.5
trnQ11,874/11,31311,935/11,37562/6316/75H/H85.71
trnF11,936/11,17312,003/11,23768/65-/6H/H77.94
trnM12,000/11,38912,063/11,45364/65-4/13H/H67.69
atp612,064/11,45412,579/11,966516/513ATG/ATGTAA/TAGH/H69.56
nad212,629/11,96213,525/12,810897/84949/-5TTG/ATTTAA/TAAH/H56.16
trnA13,559/12,81713,624/12,87866/6233/6H/H68.18
trnD13,633/12,88013,697/12,94165/628/1H/H83.08
trnV13,698/12,94413,763/13,01066/67-/2H/H86.76
nad113,764/13,01214,660/13,905897/894-/1GTG/ATGTAG/TAAH/H69.56
trnN14,661/13,90914,726/13,97166/63-/3H/H74.63
trnI14,759/14,05614,827/14,12369/6832/11H/H77.14
trnP14,854/13,97814,920/14,04467/6726/6H/H79.41
trnK14,923/14,13014,986/14,19764/682/6H/H69.12
nad315,018/14,20015,386/14,550369/35131/2ATG/ATGTAG/TAGH/H63.44
trnW15,367/14,54915,433/14,61467/66-20/-2H/H66.18

Abbreviation: LNCR, large non-coding region

Fig. 1

Visual representation of the circular mitochondrial genomes of Lepidotrema longipenis and Lamellodiscus spari. Key: Red, protein-coding genes; yellow, tRNAs; green, rRNAs; grey, non-coding regions

Comparison of the annotated mitochondrial genomes of Lamellodiscus spari and Lepidotrema longipenis Abbreviation: LNCR, large non-coding region Visual representation of the circular mitochondrial genomes of Lepidotrema longipenis and Lamellodiscus spari. Key: Red, protein-coding genes; yellow, tRNAs; green, rRNAs; grey, non-coding regions

Protein-coding genes and codon usage

Eleven out of 12 PCGs of the two studied mitogenomes used ATG or GTG as the initial codons. However, it proved difficult to determine the initial codon of the nad2 gene in both species. On the basis of results reported for other related species, as a working hypothesis we proposed TTG and ATT as the initial codon of nad2 for L. longipenis and L. spari, respectively. Similarly, ATT was proposed as the start codon for nad2 in B. hoshinai (Ogawa, 1984) and Aglaiogyrodactylus forficulatus (Kritsky, Vianna & Boeger, 2007) [20, 50], and TTG as the start codon for cox2 in Paragyrodactylus variegatus (You, King, Ye & Cone, 2014) [51] (also see Additional file 3: Table S2). Canonical stop codons for the genetic code 9 (echinoderm and flatworm mitochondrion), TAA and TAG, were found in all 12 PCGs (Table 1 and Additional file 3: Table S2). Codon usage, RSCU, and codon family (corresponding to the amino acids) proportions were investigated among the seven available capsalids (three species) and dactylogyrids (four species) (Additional file 4: Figure S1). The third codon position exhibited the highest A+T bias (Table 2). Amino acids encoded by adenosine and thymine-rich codon families (such as Phe, Leu2 and Ile) were strongly preferred, whereas amino acids encoded by guanine and cytosine-rich codon families (such as Arg, Pro and Ala) appear to be selected against (Additional file 4: Figure S1).
Table 2

Nucleotide composition and skewness comparison of different elements of the mitochondrial genomes of Lamellodiscus spari and Lepidotrema longipenis

RegionsSize (bp)T(U)CAGAT (%)GC (%)GT (%)AT skewGC skew
Lepidotrema longipenis/Lamellodiscus spari
 PCGs9990/978047.3/47.77.9/8.528.5/26.716.3/17.175.8/74.424.2/25.663.6/64.8-0.248/-0.2810.347/0.336
 1st codon position3330/326041.2/40.38.2/9.030.4/29.520.2/21.371.6/69.828.4/30.361.4/61.6-0.151/-0.1540.425/0.407
 2nd codon position3330/326048.9/49.112.2/12.721.1/20.017.9/18.270.0/69.130.1/30.966.8/67.3-0.398/-0.4200.189/0.176
 3rd codon position3330/326051.7/53.73.4/3.834.0/30.710.8/11.885.7/84.414.2/15.662.5/65.5-0.207/-0.2730.523/0.516
atp6516/51349.0/51.98.7/9.225.0/23.617.2/15.474.0/75.525.9/24.666.2/67.3-0.325/-0.3750.328/0.254
cox11548/155744.4/45.610.2/10.626.0/25.319.4/18.570.4/70.929.6/29.163.8/64.1-0.261/-0.2860.310/0.272
cox2576/58242.0/43.59.7/8.929.3/26.118.9/21.571.3/69.628.6/30.460.9/65.0-0.178/-0.2490.321/0.412
cox3654/65152.1/50.46.0/7.226.6/25.315.3/17.178.7/75.721.3/24.367.4/67.5-0.324/-0.3310.439/0.405
cytb1089/108945.7/46.29.6/11.127.4/24.917.4/17.873.1/71.127.0/28.963.1/64.0-0.251/-0.3000.290/0.232
nad1897/89446.9/47.96.4/7.828.2/26.118.5/18.275.1/74.024.9/26.065.4/66.1-0.249/-0.2950.489/0.399
nad2897/84951.3/50.87.4/6.930.2/28.511.1/13.881.5/79.318.5/20.762.4/64.6-0.259/-0.2810.205/0.330
nad3369/35148.5/45.94.6/5.131.2/33.015.7/16.079.7/78.920.3/21.164.2/61.9-0.218/-0.1620.547/0.514
nad41218/117648.1/48.78.9/8.628.7/26.714.4/16.076.8/75.423.3/24.662.5/64.7-0.253/-0.2920.237/0.301
nad4L249/25550.2/47.84.8/7.531.3/25.913.7/18.881.5/73.718.5/26.363.9/66.6-0.232/-0.2980.478/0.433
nad51527/140446.6/47.36.4/7.531.6/28.815.5/16.378.2/76.121.9/23.862.1/63.6-0.192/-0.2420.413/0.367
nad6450/45948.9/49.06.9/5.427.8/29.816.4/15.776.7/78.823.3/21.165.3/64.7-0.275/-0.2430.410/0.485
rrnL973/95241.7/40.18.6/9.534.0/34.315.6/16.175.7/74.424.2/25.657.3/56.2-0.102/-0.0780.288/0.259
rrnS720/73739.7/38.89.6/10.235.3/33.915.4/17.175.0/72.725.0/27.355.1/55.9-0.059/-0.0670.233/0.254
 LNCR1993/177346.3/40.32.6/2.847.5/52.23.6/4.793.8/92.56.2/7.549.9/45.00.013/0.1280.161/0.263
 tRNAs1448/123740.2/40.47.9/8.437.0/34.214.9/17.077.2/74.622.8/25.455.1/57.4-0.041/-0.0830.309/0.338
 rRNAs1693/168940.9/39.69.0/9.834.6/34.215.5/16.575.5/73.824.5/26.356.4/56.1-0.084/-0.0730.264/0.257
 Full genome15,433/14,61445.8/45.37.3/7.932.6/31.514.3/15.478.4/76.821.6/23.360.1/60.7-0.169/-0.1800.326/0.323

Abbreviation: LNCR, large non-coding region

Nucleotide composition and skewness comparison of different elements of the mitochondrial genomes of Lamellodiscus spari and Lepidotrema longipenis Abbreviation: LNCR, large non-coding region

Loss of trnS1, trnC and trnG from the L. spari mitogenome

Both ARWEN and MITOS algorithms failed to detect trnS1, trnC and trnG in the L. spari mitogenome. We made several attempts to corroborate that this is not an artifact. First, we carefully checked (via alignments with monogenean trnS1, trnC and trnG homologs) all intergenic sequences (including the LNCR). As these missing tRNA genes are located between nad5 and LNCR in L. longipenis (Table 1, Fig. 2), and as trnC is located between two rRNA genes in many other monogeneans (Fig. 2), we focused specifically on these two fragments. None of these sequences showed appreciable similarity with the queried homologs. Secondly, we re-sequenced the fragment between nad5 and cox3 using both mixture DNA and individual DNA, and checked the chromatograms carefully [52]. We did not find any evidence for sequence variability. Thirdly, referring to L. longipenis and other closely related monogeneans, as well as cestodes and trematodes in most of which trnG is located between nad5 and cox3 (Fig. 2, Additional file 5: Figure S2), we designed two primer pairs (Additional file 1: Dataset S1) to assess whether trnG is located between these two genes (or in their vicinity) in L. spari: (i) one forward primer (LS-GlyF) matching the most conserved region of trnG and two reverse primers (LSR1-6 and LSR1-8) matching the conserved regions of cox3 to amplify the fragment between trnG and cox3; and (ii) two forward primers (LSF17-0 and LSF17) matching nad5 and one reverse primer (LS-GlyR) matching trnG to amplify the fragment between nad5 and trnG. None of these primer pairs could generate a PCR product, which indicates that trnG is either not in the vicinity of these two genes in L. spari, or that its sequence is highly divergent, or that it is completely missing from the mitogenome. On the basis of these tests and high quality of chromatograms of the fragment between nad5 and cox3 [52], we suspect that all three tRNAs (trnS1-trnC-trnG) might be missing from the mitogenome of L. spari. These results would have to be corroborated either by resequencing of this mitogenome, or by sequencing of other closely related species. Loss of tRNA genes was also reported in many other metazoan taxa [53-57]. Given that the amino acid usage frequency of serine (AGN), glycine and cysteine is analogous between L. spari and other monogeneans (Additional file 4: Figure S1), there are at least three possible explanations for the missing tRNAs: (i) they are imported from the nucleus, as is common in mitochondria [58]; (ii) they are encoded in the mitogenome, but undergo extensive post-transcriptional RNA editing [54], so they could not be identified from their coding sequences; and (iii) they are encoded on a separate minicircle of mtDNA.
Fig. 2

Phylogeny and mitogenomic architecture of the class Monogenea. The phylogram was constructed using MTART model on the basis of concatenated amino acid sequences of 20 monogenean mitogenomes. Crenobia alpina and Obama sp. are outgroups. The scale-bar corresponds to the estimated number of substitutions per site. Statistical support values are shown above the nodes, except for nodes with maximum support. Monogenean families are shown in different colors. Gene order is displayed to the right of the tree

Phylogeny and mitogenomic architecture of the class Monogenea. The phylogram was constructed using MTART model on the basis of concatenated amino acid sequences of 20 monogenean mitogenomes. Crenobia alpina and Obama sp. are outgroups. The scale-bar corresponds to the estimated number of substitutions per site. Statistical support values are shown above the nodes, except for nodes with maximum support. Monogenean families are shown in different colors. Gene order is displayed to the right of the tree

Non-coding regions

A putative control region, or long non-coding region (LNCR), was found between nad5 and cox3 genes in both mitogenomes (disregarding the three missing tRNAs; Table 1 and Fig. 1). The LNCR of L. longipenis (1993 bp) was the longest among the monogeneans characterized so far [4, 7, 16, 19, 20, 50, 51, 59–67]. The A+T content of both LNCRs (L. longipenis = 93.8%, L. spari = 92.5%) was much higher than in other parts of the mitogenomes (Table 2). Both LNCRs contained a highly repetitive region (HRR): the HRR of L. longipenis was composed of 18 tandem repeats (TRs), where repeat units 2–18 were identical (91 bp) and unit 1 was 1 bp longer with a nucleotide insertion at the 16th position; the HRR of L. spari contained 20 TRs with the consensus size of 87 bp, but sizes and sequences of the repeat units were variable, exhibiting nucleotide mutations, deletions and insertions. This is the second report of TRs with high repeat numbers and large size in the subclass Monopisthocotylea; the first report was also in a dactylogyridean species, D. lamellatus [19]. These findings consistently reject the hypothesis that monopisthocotylids possess fewer and smaller (in size) TRs in the LNCR than polyopisthocotylids [61]. As in other monogeneans [19, 64] and cestodes [68, 69], both consensus repeat patterns of the HRRs in L. longipenis and L. spari are capable of forming stem-loop structures (Additional file 6: Figure S3). Since the presence of tandem repeats forming stable secondary structure is often associated with replication origin in mitochondria [64, 70, 71], it appears likely that these repeat regions are embedded within the control region. Aside from these TRs, the LNCR of L. longipenis also harbored 17 identical trnV pseudogenes, whereas the LNCR of L. spari contained 13 trnY pseudogenes (identified using ARWEN and DOGMA algorithms), all of which were located on the minus strand. These pseudogenes and HRR repeat patterns were two separate features, although the pseudogenes partially overlapped with the repeat patterns of HRR. Among the 13 trnY pseudogenes, six were 80 bases-long, and seven were 82 bases-long, with two bases inserted in the TΨC stem. trnV pseudogene had a standard TAC anticodon, whereas trnY contained modified standard anticodon (ATA). The cloverleaf structures of the two pseudogenes and the alignment with the corresponding functional monogenean tRNA homologs is shown in Fig. 3. Average sequence similarity values of the alignment for trnV and trnY were 40.78 ± 3.87% and 39.91 ± 4.13%, respectively. As the amino acid usage frequencies of valine and tyrosine for L. longipenis and L. spari were analogous with other monogeneans (Additional file 4: Figure S1), we hypothesise that the presence of these pseudogenes is non-adaptive, and that they may not be functional. A tRNA pseudogene was also found in the LNCR of Paratetraonchoides inermis (Bychowsky, Gussev & Nagibina, 1965) [4], but such a large accumulation of tRNA pseudogenes in tandem arrays is much more common in plastid genomes [72] and prokaryotes [73] than in mitochondrial genomes of metazoans. It might be of interest to sequence mitogenomes of other closely related species to infer whether they also harbor this feature, and study its evolutionary history and mutational rate.
Fig. 3

Alignment of trnV and trnY pseudogenes with the corresponding functional monogenean tRNA homologs. Cloverleaf structures of the two pseudogenes are shown on the right of alignment. Two red bases in trnY indicate nucleotide insertions in the TΨC stem

Alignment of trnV and trnY pseudogenes with the corresponding functional monogenean tRNA homologs. Cloverleaf structures of the two pseudogenes are shown on the right of alignment. Two red bases in trnY indicate nucleotide insertions in the TΨC stem

Nucleotide diversity and evolutionary rate analysis

The sliding window analysis was conducted using concatenated alignments of 12 PCGs, two rRNAs and 19 coalescent tRNAs of the two diplectanids (trnS1, trnC and trnG were removed due to their absence from L. spari). The plot of sequence variation ratio exhibited highly variable nucleotide diversity between the two diplectanids, with Pi values for the 200 bp windows ranging from 0.201 to 0.411 (Fig. 4a). cox1 (0.201), tRNAs (0.215), rrnS (0.221), rrnL (0.224) and cytb (0.251) exhibited a comparatively low sequence variability, whereas nad2 (0.411), nad5 (0.392), nad4 (0.381) and nad6 (0.354) had a comparatively high sequence variability. This was corroborated by the non-synonymous/synonymous (dN/dS) ratio (omega) analysis, which showed that cox1 (0.163), cytb (0.167), nad4L (0.213) and cox2 (0.229) are evolving comparatively slowly, whereas nad2 (0.69), nad5 (0.49), nad6 (0.462) and nad4 (0.451) are evolving comparatively fast (Fig. 4b). Therefore, these analyses consistently indicate that cox1, which is often used as a universal barcode for species identification [74], as well as population genetics in monogeneans [75-78], is the slowest evolving and least variable gene. As rapidly evolving genes are more suitable for analyzing relationships among closely related species [79], we propose that the fast-evolving nad2, nad5, nad6 and nad4 would be better molecular markers than cox1 for diplectanids.
Fig. 4

Sliding window and selection pressure analyses of the mitogenomes of Lepidotrema longipenis and Lamellodiscus spari. a Sliding window analysis was conducted on concatenated alignments of 12 PCGs, 2 rRNAs and 19 coalescent tRNAs (missing tRNAs, trnS1, trnC and trnG, were removed). The black line represents the value of nucleotide diversity (window size = 200 bp, step size = 20 bp, with the value inserted at its mid-point). Gene names, boundaries/direction (colored arrows) and average nucleotide diversity values are indicated above the graph. b Ratios of non-synonymous (dN) to synonymous (dS) substitution rates calculated for protein-coding genes

Sliding window and selection pressure analyses of the mitogenomes of Lepidotrema longipenis and Lamellodiscus spari. a Sliding window analysis was conducted on concatenated alignments of 12 PCGs, 2 rRNAs and 19 coalescent tRNAs (missing tRNAs, trnS1, trnC and trnG, were removed). The black line represents the value of nucleotide diversity (window size = 200 bp, step size = 20 bp, with the value inserted at its mid-point). Gene names, boundaries/direction (colored arrows) and average nucleotide diversity values are indicated above the graph. b Ratios of non-synonymous (dN) to synonymous (dS) substitution rates calculated for protein-coding genes

Phylogeny

Regardless of the dataset and model used, all analyses produced phylograms with concordant branch topologies (Fig. 2 and Additional file 7: Figure S4 and S5). As the AliGROOVE analysis indicated that the PCGAA dataset exhibits lower heterogeneity than PCGRT (Additional file 7: Figure S6), we displayed only the results of PCGAA in Fig. 2. As expected, the two diplectanids, L. longipenis and L. spari, constituted a monophyletic group with maximum support. In accordance with previous results based on mitochondrial phylogenomics [4, 19, 20], the two dactylogyrids (D. lamellatus and T. nebulosi; Dactylogyridea: Dactylogyridae) formed a sister-group with the three capsalids (N. melleni, B. hoshinai and B. seriolae; Capsalidea: Capsalidae). The two newly-sequenced diplectanids (Diplectanidae) formed a sister-group with this (Dactylogyridae + Capsalidae) clade. As the family Diplectanidae was classified into the order Dactylogyridea in the classifications proposed by Bychowsky [80], Lebedev [81] and Boeger & Kritsky [82], this topology rendered Dactylogyridea paraphyletic by the nested Capsalidea clade. In these taxonomic systems, the Dactylogyridae and Diplectanidae were assigned to the same order Dactylogyridea (or suborder Dactylogyrinea within the order Dactylogyridea in Bychowsky [80]), whereas the Capsalidae was classified into the order Capsalidea (or suborder Monopisthocotylinea within the order Dactylogyridea in Bychowsky [80]). A number of subsequent morphology- and molecular data-based studies further supported the closer phylogenetic relationship between the Dactylogyridae and Diplectanidae than either of the two with Capsalidae: chaetotaxy and ciliated cells of the oncomiracidium [83], spermatozoon ultrastructure [12, 13], comprehensive morphological characters [14, 82], 28S rRNA [8] and 18S rRNA [9-11]. Regarding the phylogenetic position of the Capsalidae, it was either resolved as phylogenetically closely related to the Gyrodactylidea (families Gyrodactylidae and/or Udonellidae) based on the evidence of spermatozoon ultrastructure [12, 13], 28S rRNA gene [8], 18S rRNA gene [9-11], and a combination of three unlinked nuclear genes (28S rRNA, Histone 3 and Elongation Factor 1α) [5], or was resolved as basal to the Gyrodactylidea and Dactylogyridea on the basis of comprehensive morphological characters [14, 82]. However, as argued before, morphological traits are liable to cause taxonomic and phylogenetic artifacts in (parasitic) microscopic animals [4-6] and single gene-based molecular markers may not provide sufficient resolution power to infer the relationships among these species with high precision [7, 84]. These discrepancies could also be a result of discrepant evolutionary rates between mitochondrial and nuclear sequences, which can produce differing evolutionary signals [85, 86]. Although our phylogenetic analysis was based on relatively limited mitogenomic data (four species belonging to two families for Dactylogyridea, and three species/one family for Capsalidea), the paraphyly of the Dactylogyridea with reasonably high support values (BS = 85, PB = 0.99) suggests that their relationships should be further explored using a larger number of monogenean mitogenomes and large nuclear datasets.

Gene order

The order of tRNA genes of the two studied diplectanids exhibits notable rearrangements in comparison to all other sequenced monogenean mitogenomes (Fig. 2, Additional file 8: Table S4). Disregarding the three missing tRNAs, the gene order of the two diplectanids was very similar, with only two transposition events: position interchanges of trnF and trnQ, and trnP and trnI (Fig. 2 and Additional file 8: Table S4). This unique gene order pattern of the two diplectanids is further manifested by low pairwise similarity values in comparison with those observed among other monogeneans: the highest, between L. spari and B. hoshinai, is only 354 over 1254 (Additional file 8: Table S4). The transformational pathway from L. spari to the most similar gene arrangement, belonging to B. hoshinai, required one transposition, one TDRL (tandem-duplication-random-loss) and two coupled transposition events (Additional file 9: Figure S7). The transformational pathway from L. longipenis to the most similar gene arrangement found in T. nebulosi and P. variegatus (similarity value: 322 over 1254; gene orders of T. nebulosi and P. variegatus were identical) required two transpositions, one TDRL and two coupled transposition events (Additional file 9: Figure S7). In addition to the patterns summarized in our recent paper [4], we assigned pattern 1c to the gene order of the two diplectanids, which seems to be synapomorphic to the family Diplectanidae (Additional file 5: Figure S2). Conserved gene arrangement is considered to be a typical feature of mitochondrial genomes [87-89], but our results suggest that extensive gene order rearrangements are not rare events in the class Monogenea. As five out of six proposed main patterns are found in the Monogenea (1a, 1b, 3, 4 and the new pattern 1c; Additional file 5: Figure S2), this further confirms the hypothesis [4] that gene order in monogeneans is evolving at a relatively rapid rate. However, evidence is emerging that the evolution of mitogenomic gene order arrangements is discontinuous in monogeneans, as some taxonomic categories appear to be particularly prone to mitogenomic rearrangements (diplectanids, tetraonchids and A. forficulatus), whereas others exhibit relatively conserved gene orders. Of course, our conclusions should be interpreted within the context of the limited number of monogenean mitogenomes currently available. Discontinuity in mitogenomic architecture evolution was also found in nematodes [28], snails [90], insects [91] and vertebrates [92]. Although gene order is sometimes used as a tool for inferring phylogenetic relationships [93, 94], this discontinuity in gene order rearrangements in monogeneans might produce misleading evolutionary signals, such as disproportionately long branches, which in turn might cause long branch attraction artifacts. Thus, gene order may only be used for phylogenetic analyses in this group of animals with this limitation in mind. The provisional addition of three missing tRNAs certainly affected the similarity values and transformational pathways between L. spari and other monogeneans, but it would not affect the assignment of a new gene order pattern to diplectanids, nor our conclusion that diplectanids are prone to mitogenomic rearrangements.

Conclusions

The present study reports four findings worthy of emphasis. First, on the basis of nucleotide diversity, dN/dS and average sequence identity, we propose that nad2, nad5 and nad4 genes are better-suited as molecular markers for species identification and population genetics studies of diplectanids than the commonly used cox1. Secondly, the long non-coding region of both mitogenomes contains two interesting features: (i) a highly repetitive region, which is often associated with replication origin in mitochondria; and (ii) tRNA pseudogenes in tandem arrays, which is common in plastid genomes and prokaryotes, but rare in metazoan mitogenomes. Thirdly, phylogenetic analysis showed that the two new diplectanids (Dactylogyridea) formed a sister group with a clade comprised of two other dactylogyrids (Dactylogyridea) and three capsalids (Capsalidea). Thus, Dactylogyridea was rendered paraphyletic by the nested Capsalidea clade. Fourthly, due to the extensive tRNA gene rearrangements in the two diplectanids, we assigned them a new gene order pattern, and concluded that the evolution of mitogenomic gene order arrangements is discontinuous in monogeneans. However, our confidence in the unorthodox phylogeny produced here, and understanding of genomic architecture evolution, is curbed by the scarcity of available mitogenomes (only four dactylogyrideans and 20 monogeneans), so we encourage researchers to accumulate more samples and molecular data (especially large molecular data) for monogenean parasites in order to infer their evolutionary history with confidence. Dataset S1. Supplementary methods. (DOCX 41 kb) Table S1. Monogenean species and outgroups used for comparative mitogenomic and phylogenetic analyses. (XLSX 14 kb) Table S2. General statistics for the protein-coding and rRNA genes of the two diplectanids and 18 other available monogeneans. (XLSX 16 kb) Figure S1. Relative synonymous codon usage (RSCU) of six monopisthocotylid mitogenomes. (PDF 57 kb) Figure S2. The 23 unique gene orders in neodermatan mitochondrial genomes filtered from 113 species. Representative species and corresponding taxonomic categories at the class/subclass level are shown on the left; a star symbol denotes that the gene order is shared by Monogenea and Cestoda. Pattern types used here to classify gene orders are shown on the right. In Lamellodiscus spari, the missing tRNAs are represented by the “?” symbol in the positions homologous to the closest available diplectanid relative, Lepidotrema longipenis. (PDF 1961 kb) Figure S3. Stem-loop structures of the consensus repeat pattern in highly repetitive regions of the long non-coding regions of Lepidotrema longipenis and Lamellodiscus spari. dG denotes the structure’s free energy (ΔG in Kcal/mol at 37 °C). (PDF 297 kb) Figure S4. Maximum likelihood tree of nucleotide dataset with partition model. Figure S5. Maximum likelihood tree of amino acid dataset with partition model. Figure S6. AliGROOVE analysis. Table S3. The best partitioning scheme. (DOCX 517 kb) Table S4. Pairwise comparison of mitochondrial DNA gene orders among 13 monogenean species based on the order of all 36 genes. Scores correspond to the similarity between gene orders, where “1254” represents an identical gene order. N is the number identifying the taxon in the pairwise comparisons. (XLSX 11 kb) Figure S7. Transformational pathway from the gene orders of the two diplectanids to the most similar monogenean gene arrangements. (PDF 992 kb)
  75 in total

Review 1.  Phylogenetics of the Monogenea--evidence from a medley of molecules.

Authors:  P D Olson; D T J Littlewood
Journal:  Int J Parasitol       Date:  2002-03       Impact factor: 3.981

2.  Automatic annotation of organellar genomes with DOGMA.

Authors:  Stacia K Wyman; Robert K Jansen; Jeffrey L Boore
Journal:  Bioinformatics       Date:  2004-06-04       Impact factor: 6.937

Review 3.  [The problems of molecular phylogenetics on the example of squamata reptiles. Mitochondrial DNA markers].

Authors:  V V Grechko
Journal:  Mol Biol (Mosk)       Date:  2013 Jan-Feb

4.  Multiple sequence alignment using ClustalW and ClustalX.

Authors:  Julie D Thompson; Toby J Gibson; Des G Higgins
Journal:  Curr Protoc Bioinformatics       Date:  2002-08

5.  PhyloBayes MPI: phylogenetic reconstruction with infinite mixtures of profiles in a parallel environment.

Authors:  Nicolas Lartillot; Nicolas Rodrigue; Daniel Stubbs; Jacques Richer
Journal:  Syst Biol       Date:  2013-04-05       Impact factor: 15.683

6.  An evaluation of genetic distances for use with microsatellite loci.

Authors:  D B Goldstein; A Ruiz Linares; L L Cavalli-Sforza; M W Feldman
Journal:  Genetics       Date:  1995-01       Impact factor: 4.562

7.  MAFFT multiple sequence alignment software version 7: improvements in performance and usability.

Authors:  Kazutaka Katoh; Daron M Standley
Journal:  Mol Biol Evol       Date:  2013-01-16       Impact factor: 16.240

8.  Initial steps of speciation by geographic isolation and host switch in salmonid pathogen Gyrodactylus salaris (Monogenea: Gyrodactylidae).

Authors:  Maria Meinilä; Jussi Kuusela; Marek S Zietara; Jaakko Lumme
Journal:  Int J Parasitol       Date:  2004-03-29       Impact factor: 3.981

9.  The complete mitochondrial DNA sequence of the monogenean Gyrodactylus thymalli (Platyhelminthes: Monogenea), a parasite of grayling (Thymallus thymallus).

Authors:  Laetitia Plaisance; Tine Huyse; D T J Littlewood; Tor A Bakke; Lutz Bachmann
Journal:  Mol Biochem Parasitol       Date:  2007-04-25       Impact factor: 1.759

10.  The complete mitochondrial genome of the common sea slater, Ligia oceanica (Crustacea, Isopoda) bears a novel gene order and unusual control region features.

Authors:  Fabian Kilpert; Lars Podsiadlowski
Journal:  BMC Genomics       Date:  2006-09-20       Impact factor: 3.969

View more
  14 in total

1.  Characterization of the complete mitochondrial genome of the echinostome Echinostoma miyagawai and phylogenetic implications.

Authors:  Yi-Tian Fu; Yuan-Chun Jin; Fen Li; Guo-Hua Liu
Journal:  Parasitol Res       Date:  2019-08-16       Impact factor: 2.289

2.  Assessing the population structure of trematode Metagonimus suifunensis using three mitochondrial markers.

Authors:  Polina G Shumenko; Yulia V Tatonova
Journal:  Parasitol Res       Date:  2022-02-08       Impact factor: 2.289

3.  Describing novel mitochondrial genomes of Antarctic amphipods.

Authors:  Louraine Salabao; Tim Plevoets; Bruno Frédérich; Gilles Lepoint; Marc Kochzius; Isa Schön
Journal:  Mitochondrial DNA B Resour       Date:  2022-05-10       Impact factor: 0.610

4.  Comparison of the complete mitochondrial genome of Phyllophorus liuwutiensis (Echinodermata: Holothuroidea: Phyllophoridae) to that of other sea cucumbers.

Authors:  Fuyuan Yang; Chen Zhou; Ngoc Tuan Tran; Zaiqiao Sun; Jianshao Wu; Hui Ge; Zhen Lu; Chenhui Zhong; Zhihuang Zhu; Qiuhua Yang; Qi Lin
Journal:  FEBS Open Bio       Date:  2020-07-17       Impact factor: 2.693

5.  Molecular phylogenetics and mitogenomics of three avian dicrocoeliids (Digenea: Dicrocoeliidae) and comparison with mammalian dicrocoeliids.

Authors:  Mian Sayed Khan; Vasyl V Tkach; Nehaz Muhammad; Dong Zhang; Xing-Quan Zhu; Jun Ma
Journal:  Parasit Vectors       Date:  2020-02-13       Impact factor: 3.876

6.  Laser capture microdissection in combination with mass spectrometry: Approach to characterization of tissue-specific proteomes of Eudiplozoon nipponicum (Monogenea, Polyopisthocotylea).

Authors:  Pavel Roudnický; David Potěšil; Zbyněk Zdráhal; Milan Gelnar; Martin Kašný
Journal:  PLoS One       Date:  2020-06-17       Impact factor: 3.240

7.  Mitochondrial genomes of two eucotylids as the first representatives from the superfamily Microphalloidea (Trematoda) and phylogenetic implications.

Authors:  Nehaz Muhammad; Mian Sayed Khan; Vasyl V Tkach; Hanif Ullah; Muhammad Ehsan; Jun Ma; Xing-Quan Zhu
Journal:  Parasit Vectors       Date:  2021-01-14       Impact factor: 3.876

8.  The genetic variations in the mitochondrial genomes of three Luciolinae fireflies.

Authors:  Quan Liu; Xinhua Fu
Journal:  Mitochondrial DNA B Resour       Date:  2020-08-26       Impact factor: 0.658

9.  The complete mitochondrial genome of the firefly Curtos fulvocapitalis (Coleoptera: Lampyridae).

Authors:  Wei Li; Quan Liu; Xinhua Fu
Journal:  Mitochondrial DNA B Resour       Date:  2021-12-13       Impact factor: 0.658

10.  Evidence for Adaptive Selection in the Mitogenome of a Mesoparasitic Monogenean Flatworm Enterogyrus malmbergi.

Authors:  Dong Zhang; Hong Zou; Shan G Wu; Ming Li; Ivan Jakovlić; Jin Zhang; Rong Chen; Wen X Li; Gui T Wang
Journal:  Genes (Basel)       Date:  2019-10-30       Impact factor: 4.096

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.