| Literature DB >> 31259095 |
Thanh Hoa Le1,2, Khue Thi Nguyen1, Nga Thi Bich Nguyen1, Huong Thi Thanh Doan1,2, Takeshi Agatsuma3, David Blair4.
Abstract
We present the complete mitochondrial genome of Paragonimus ohirai Miyazaki, 1939 and compare its features with those of previously reported mitochondrial genomes of the pathogenic lung-fluke, Paragonimus westermani, and other members of the genus. The circular mitochondrial DNA molecule of the single fully sequenced individual of P. ohirai was 14,818 bp in length, containing 12 protein-coding, two ribosomal RNA and 22 transfer RNA genes. As is common among trematodes, an atp8 gene was absent from the mitogenome of P. ohirai and the 5' end of nad4 overlapped with the 3' end of nad4L by 40 bp. Paragonimusohirai and four forms/strains of P. westermani from South Korea and India, exhibited remarkably different base compositions and hence codon usage in protein-coding genes. In the fully sequenced P. ohirai individual, the non-coding region started with two long identical repeats (292 bp each), separated by tRNAGlu . These were followed by an array of six short tandem repeats (STR), 117 bp each. Numbers of the short tandem repeats varied among P. ohirai individuals. A phylogenetic tree inferred from concatenated mitochondrial protein sequences of 50 strains encompassing 42 species of trematodes belonging to 14 families identified a monophyletic Paragonimidae in the class Trematoda. Characterization of additional mitogenomes in the genus Paragonimus will be useful for biomedical studies and development of molecular tools and mitochondrial markers for diagnostic, identification, hybridization and phylogenetic/epidemiological/evolutionary studies.Entities:
Keywords: Mitochondrial genome; Paragonimidae; Paragonimus ohirai; Paragonimus westermani; Phylogenetic analysis; Repeats; Skewness value
Year: 2019 PMID: 31259095 PMCID: PMC6589331 DOI: 10.7717/peerj.7031
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Primers for amplification and sequencing of the mitochondrial genome of Paragonimus ohirai.
| URNLF | AGCCAGGTTGGTTCTTATCTAT | |
| URNSR | TACCWTGTTACGACTTAHCWCA | |
| TRECOBF | CAGATGTCYTATTGGGCTGC | |
| TRECOBR | GAACHRVCCACARYCCCTTAAAC | |
| JB3F | TTTTTTGGGCATCCTGAGGTTTAT | |
| JB4.5R | TAAAGAAAGAACATAATGAAAATG | |
| PO1R | ACATAGCCACTAACACAGCAC | |
| PO2F | TGGTTCGGGTGTTTTCTGCG | |
| PO3R | CCCACTCAAAACGAAACCTAAG | |
| PO4F | CGGTGAGGGTTCCTCTGTTG | |
| PO5F | GTCTCACAATCCCAAATCCTGG | |
| PO6F | GTTTGGGAAGTGTTGTCTGGG | |
| PO7F | TCCTTTTAGGGGAGTAAAGGCC | |
| PO8R | GCACAACCGCAACAAAGACC | |
| PO9R | AGCAGCCCAATCAGAGTTACC | |
| PO10F | CTGCACCAAATCCGAACGCC | |
| PO11F | CTTATGGAACTGCAGCACTTC | |
| PO12R | CGTACCCAACAAACATAAACC | |
| PO13F | AATTTTTCGTGTTAGAGGGAG | |
| PO14R | GAAGATAACGAATTAGCCAAAC | |
| PO15F | GTGGGTACTTGTATTCTGTG | |
| PO16R | GTCAAAACACAAAACAAACAC | |
| PO17R | CGCCGTAATTGAAGAACC | |
| PO18F | GGTGTAGAAGGGTTGCAC | |
| PO19F | GGCACTTTTTAAGGCTGGTGG | |
| PO20R | ACCTCAGATTGGAAGGAAAGCAC | NR |
| PHOR | GTTACCAAAGGATCCGCCTGC | |
| PHOWR | CGGTATGTACCCCAACTAAATC | |
| POHF | ATGATTTGCAGGAGATTTCGGAC | |
| PHOGLUR | CCTCTAACACGAAAAATTAAC | |
| GLYF | AGTATKYYGTCTTTCCAAGTC | |
| GLYR | ACKAGACCHCYGACTTGGAAAGAC |
Notes.
Platyhelminth-universal primers used for inital amplification of the corresponding genes/regions to get sequences to design further primers.
Locations of genes and other features in the mitochondrial genome of Paragonimus ohirai (14,818 bp).
| 1–645 | 645 | 214 | ATG | TAG | +3 | |||
| 649–717 | 69 | GTG | AR, SE, DO | 0 | ||||
| 718–1,836 | 1,119 | 372 | ATG | TAG | +5 | |||
| 1,842–2,105 | 264 | 87 | ATG | TAG | −40 | |||
| 2,066–3,325 | 1,260 | 419 | ATG | TAG | +11 | |||
| 3,337–3,398 | 62 | TTG | AR | +6 | ||||
| 3,405–3,470 | 66 | GAA | AR, SE, DO | 0 | ||||
| 3,471–3,535 | 65 | CAT | AR, SE, DO | 0 | ||||
| 3,536–4,051 | 516 | 171 | ATG | TAG | +2 | |||
| 4,054–4,920 | 867 | 288 | ATG | TAG | +7 | |||
| 4,928–4,991 | 64 | TAC | AR, SE, DO | +2 | ||||
| 4,994–5,059 | 66 | TGC | AR, SE, DO | +6 | ||||
| 5,066–5,132 | 67 | GTC | AR, SE, DO | 0 | ||||
| 5,133–6,038 | 906 | 301 | ATG | TAG | +11 | |||
| 6,049–6,119 | 70 | GTT | AR, SE, DO | −2 | ||||
| 6,118–6,184 | 67 | TGG | +7 | |||||
| 6,192–6,254 | 63 | GAT | +11 | |||||
| 6,266–6,329 | 64 | CTT | AR, SE, DO | 0 | ||||
| 6,330–6,686 | 357 | 118 | ATG | TAG | +7 | |||
| 6,694–6,755 | 62 | GCT | AR | +7 | ||||
| 6,763–6,827 | 65 | TCA | AR, SE, DO | +3 | ||||
| 6,831–8,363 | 1,533 | 510 | ATG | TAG | −1 | |||
| 8,363–8,428 | 66 | TGT | AR, SE, DO | 0 | ||||
| 8,429–9,402 | 974 | 0 | ||||||
| 9,403–9,468 | 66 | GCA | AR, DO | 0 | ||||
| 9,469–10,204 | 736 | 0 | ||||||
| 10,205–10,804 | 600 | 199 | ATG | TAG | −1 | |||
| 10,804–11,256 | 453 | 150 | GTG | TAG | +3 | |||
| 11,260–1,1,320 | 61 | GTA | AR, DO | +11 | ||||
| 11,332–11,396 | 65 | TAG | AR, DO | −3 | ||||
| 11,394–11,457 | 64 | TGA | AR | +18 | ||||
| 11,476–11,538 | 63 | TAA | AR, SE, DO | +11 | ||||
| 11,540–11,618 | 69 | TCG | AR, DO | 0 | ||||
| 11,619–13,202 | 1,584 | 527 | GTG | TAA | +10 | |||
| 13,213–13,284 | 72 | TCC | AR, SE, DO | +28 | ||||
| LR1 | 13,313–13,604 | 292 | +8 | |||||
| 13,613–13,681 | 69 | TTC | SE, DO | −29 | ||||
| LR2 | 13,653–13,944 | 292 | +15 | |||||
| STR1 | 13,960–14,076 | 117 | 0 | |||||
| STR2 | 14,077–14,193 | 117 | 0 | |||||
| STR3 | 14,194–14,310 | 117 | 0 | |||||
| STR4 | 14,311–14,426 | 117 | 0 | |||||
| STR5 | 14,427–14,544 | 117 | 0 | |||||
| STR6 | 14,545–14,661 | 117 | 0 | |||||
| unique seq | 14,662–14,818 | 157 | 0 | |||||
Notes.
basepair
amino acid
intergenic sequence
(+, number of nucleotides before start of the following gene; −, number of nucleotides overlapping with the following gene or other feature); LR, Long repeat; STR, short tandem repeat. Transfer RNAs (tRNA) found by software used in this study (SE] tRNAscan-SE 1.21 (Lowe & Eddy, 1997) AR, ARWEN Alpha version (Laslett & Canback, 2008); DO, DOGMA (Wyman, Jansen & Boore, 2004) with slight amendment own by our own determination.
Indicating tRNAs lacking DHU-arm.
Figure 1A circular map of the mitochondrial genome of the lung fluke Paragonimus ohirai (KX765277).
Protein-coding and ribosomal large and small subunit genes are abbreviated according to with slight modification by Le, Blair & McManus (2002). Genes are transcribed in a clockwise direction from one strand. The transfer RNA genes (tRNA) are designated by the single-letter code for the corresponding amino acid abbreviations, with the exception of those coding for Serine (tRNA and tRNA) (Table 2). The long non-coding region consists of two long repeats (LR1 and LR2) separated by tRNA, and followed by six short tandem repeats (STR). A unique sequence is between the last STR6 and cox3.
Base composition and related statistics (skewness evaluation) for protein-coding genes and ribosomal regions of the mtDNA ofand otherspecies, and means and standard deviations of base composition of all species included in the study.
| Protein-coding | Pohi-Kinosaki-JP | 10,104 | 26.17 | 10.83 | 63.00 | −0.46 | 37.00 | 0.41 | ||
| Pwes(2n)-Haenam-KR | 10,101 | 13.40 | 38.21 | 30.61 | 17.77 | 51.61 | −0.48 | 48.39 | 0.26 | |
| Pwes(3n)-Bogil-KR | 10,101 | 13.33 | 38.25 | 30.61 | 17.81 | 51.58 | −0.48 | 48.42 | 0.26 | |
| Pwes-TypeI-IN | 10,104 | 14.89 | 40.50 | 29.36 | 15.25 | 55.39 | −0.46 | 44.61 | 0.32 | |
| Pwes-AP-IN | 10,140 | 14.72 | 40.40 | 29.58 | 15.30 | 55.12 | −0.46 | 44.88 | 0.32 | |
| Phet_China | 10,101 | 14.93 | 43.91 | 28.18 | 12.98 | 58.84 | −0.49 | 41.16 | 0.37 | |
| Pkell | 10,098 | 15.19 | 44.78 | 27.73 | 12.30 | 59.97 | −0.49 | 40.30 | 0.38 | |
| Mean, all included species | 17.58 | 46.04 | 25.91 | 10.41 | 63.64 | −0.45 | 36.38 | 0.43 | ||
| S.D. all included species | 3.5 | 2.86 | 2.85 | 2.49 | 4.93 | 0.07 | 4.91 | 0.07 | ||
| Ribosomal RNA coding | Pohi-Kinosaki-JP | 1,710 | 22.92 | 37.19 | 26.90 | 12.98 | 60.01 | −0.23 | 39.99 | 0.35 |
| Pwes(2n)-Haenam-KR | 1,732 | 19.11 | 32.62 | 31.12 | 17.15 | 51.73 | −0.26 | 48.27 | 0.29 | |
| Pwes(3n)-Bogil-KR | 1,732 | 18.94 | 32.79 | 31.29 | 16.97 | 51.73 | −0.27 | 48.27 | 0.30 | |
| Pwes-TypeI-IN | 1,729 | 20.24 | 35.11 | 29.84 | 14.81 | 55.35 | −0.27 | 44.65 | 0.34 | |
| Pwes-AP-IN | 1,721 | 20.22 | 34.63 | 30.16 | 14.99 | 54.85 | −0.26 | 45.15 | 0.34 | |
| Phet China | 1,711 | 20.81 | 37.05 | 28.23 | 13.91 | 57.86 | −0.28 | 42.14 | 0.34 | |
| Pkell | 1,711 | 21.33 | 37.52 | 27.47 | 13.68 | 58.85 | −0.28 | 41.15 | 0.33 |
Notes.
Paragonimus ohirai
P. westermani
P. heterotremus
P. kellicotti.
Japan
South Korea
India.
Kinosaki locality, Japan
Arunachal Pradesh locality, India (see Table S1)
High A and T nucleotide use (17.00% and 46.00%) in P. ohirai is bolded. S.D. standard deviation nt nucleotide.
The overlap between nad4L and nad4 is counted twice in this calculation.
Figure 2Drawings of predicted structure models of 22 transfer RNAs in the mitochondrial genome of Paragonimus ohirai, arranged in alphabetical order of the amino acids they specify.
Each tRNA (here abbreviated as trn) gene is named according to the one-letter amino acid abbreviation, with the exception of those specifying Serine, S1 and S2; and Leucine, L1 and L2 (L1, CUN; L2, UUR; S1, AGN; and S2, UCN); DHU arms are missing in tRNA and in tRNA. A: trnA (Alanine); B: trnR (Arginine); C: trnN (Asparagine); D: trnD (Aspartic acid); E: trnC (Cystine); F: trnE (Glutamic acid); G: trnQ (Glutamine); H: trnG (Glycine); I: trnH (Histidine); J: trnI (Isoleucine); K: trnL1(CUN) (Leucine); L: trnL2(UUR) (Leucine); M: trnK (Lysine); N: trnM (Methionine); O: trnF (Phenylalanine); P: trnP (Proline); Q: trnS1(AGN) (Serine); R: trnS2(UCN) (Serine); S: trnT (Threonine); T: trnW (Tryptophan); U: trnY (Tyrosine); V: trnV (Valine). Names of structural components of a tRNA gene are indicated in the tRNA structure.
Figure 3Phylogenetic tree based on concatenated amino-acid sequence data for the 12 mitochondrial proteins from 50 strains of 42 digenean trematode species (Table S1).
Phylogenetic reconstruction was performed in MrBayes 3.2 program (Ronquist et al., 2012) with a final alignment of 2,509 amino-acid residues in length after removing poor-quality regions of the alignment. The tree produced (left-hand side of figure) was a 50% majority-rule tree to which all compatible groupings were added and rooted by outgroup (i.e., members of the Diplostomata –schistosomes and related taxa). Bayesian posterior support values for each node were 100% in every case except one (within the Opisthorchiidae, value of 57% indicated). Accession numbers are given at the end of each sequence label. Paragonimus ohirai in this study is marked by a star symbol. Full names of each species and of families where they belong are provided. The scale bar represents the number of substitutions per site. The tree on the right-hand side of the figure shows the relationships of the same families according to Olson et al. (2003).