| Literature DB >> 29670192 |
James Lee Crainey1, Michel Abanto Marín2,3, Túllio Romão Ribeiro da Silva4, Jansen Fernandes de Medeiros5, Felipe Arley Costa Pessoa4, Yago Vinícius Santos4,6, Ana Carolina Paulo Vicente2, Sérgio Luiz Bessa Luz4.
Abstract
Despite the broad distribution of M. ozzardi in Latin America and the Caribbean, there is still very little DNA sequence data available to study this neglected parasite's epidemiology. Mitochondrial DNA (mtDNA) sequences, especially the cytochrome oxidase (CO1) gene's barcoding region, have been targeted successfully for filarial diagnostics and for epidemiological, ecological and evolutionary studies. MtDNA-based studies can, however, be compromised by unrecognised mitochondrial pseudogenes, such as Numts. Here, we have used shot-gun Illumina-HiSeq sequencing to recover the first complete Mansonella genus mitogenome and to identify several mitochondrial-origin pseudogenes. Mitogenome phylogenetic analysis placed M. ozzardi in the Onchocercidae "ONC5" clade and suggested that Mansonella parasites are more closely related to Wuchereria and Brugia genera parasites than they are to Loa genus parasites. DNA sequence alignments, BLAST searches and conceptual translations have been used to compliment phylogenetic analysis showing that M. ozzardi from the Amazon and Caribbean regions are near-identical and that previously reported Peruvian M. ozzardi CO1 reference sequences are probably of pseudogene origin. In addition to adding a much-needed resource to the Mansonella genus's molecular tool-kit and providing evidence that some M. ozzardi CO1 sequence deposits are pseudogenes, our results suggest that all Neotropical M. ozzardi parasites are closely related.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29670192 PMCID: PMC5906601 DOI: 10.1038/s41598-018-24382-3
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Mansonella ozzardi mtDNA sequences
| Accession number | Coordinates and length (in brackets) | Description |
|---|---|---|
| MG913165 | 9,611–10, 481(897 nucleotides) | This mtDNA pseudogene sequence shares 84% identity (751/897) with the |
| MG913166 | 6,225–7, 006(790 nucleotides) | This mtDNA pseudogene sequence shares 83% identity (652/788) with the |
| MG913167 | 12,488–13, 078(591 nucleotides) | This mtDNA pseudogene sequence shares 92% identity (545/591) with an internal portion of the NADH dehydrogenase subunit 5. It contains only one indel: a single nucleotide insertion. The single nucleotide insertion causes a frameshift mutation but no stop codons are “encoded” before. |
| MG913168 | 4,566–5,148(585 nucleotides) | This mtDNA pseudogene sequence shares 83% identity (482/578) with the |
| MG913169 | (584 nucleotides)3,292–3, 873 | This mtDNA pseudogene sequence shares 86% (501/583) with the |
| MG913170 | (576 nts)9,937–10, 510 | This mtDNA pseudogene sequence shares 85% (487/576) with |
| MG913171 | (471 nts)13,150–13, 617 | This mtDNA pseudogene sequence shares 86% (403/471) with the coding region of NADH dehydrogenase subunit 5. It also contains seven indels: three single nucleotide deletions, three 3-nucleotide deletions, and a single 2-nucleotide deletion. |
| MG913172 | (464 nts)3,356–3, 819 | This mtDNA pseudogene sequence shares 80% (371/464) with a portion of the cytochrome c oxidase subunit I (CO1). It contains four indels: three single nucleotide deletions, one 8-nucleotide deletion. It also encodes stop codon. |
| MG913173 | (432 nts)3,292–3, 873 | This mtDNA pseudogene sequence shares 86% (501/583) portion of the cytochrome c oxidase subunit I. It contains three indels: one single nucleotide insertion, one single nucleotide deletion and one seven nucleotide deletion. |
| MG913174 | (424 nts)11020–11444 | This mtDNA pseudogene sequence shares 89% (382/427) identity with the |
| MG913175 | 3,123–3, 411(292 nts) | This mtDNA pseudogene sequence shares 89% (261/292) identity with the |
| MG913176 | 3,618–3, 857(240 nts) | This mtDNA pseudogene sequence shares 91% (219/240) identity with the |
| MG913177 | (220 nts)3,655–3, 872 | This mtDNA pseudogene sequence shares 79% (173/220) identity with the |
| MG913178 | (210 nts)3,146–3, 355 | This mtDNA pseudogene sequence shares 84% (176/210) identity with the |
An inventory of 14 mtDNA pseudogene sequences over the length of 200 nucleotides that have been identified from a M. ozzardi illumina HiSeq contig library. As well as providing GenBank sequence accession numbers and the length of each of the mtDNA pseudogene sequences, the table also provides the basis on which these sequences were categorised as pseudogene sequences.
Figure 1The genomic architecture of the M. ozzardi mitogenome shows a graphical representation of the M. ozzardi genome. The mitogenome’s annotated with gene content, gene order and orientation. Inside the circular genome proposed mitogenome Numt sequences (described in detail in Table 1) are mapped to the genome.
Figure 2A phylogenetic tree constructed from filarial parasite mitogenome sequences. A representative maximum likelihood tree constructed from a 3378 amino acid residue alignment prepared by concatenating all 12 mitochondrial protein sequences from the M. ozzardi mitogenome and aligning to their orthologous protein sequences from 21 other whole mitogenome protein sequences from allied parasitic nematodes, (this alignment is available in the supplementary information, Fig. 3). Bootstrap-supported nodes referred to in the text are indicating with percentage values that were determined from 1000 pseudoreplicates. Branch labels provide source species names and GenBank accession numbers from which whole mitogenome sequence can be retrieved.
Figure 3A phylogenetic tree constructed from filarial parasite CO1 sequences. A representative maximum likelihood tree constructed from an alignment of partial mitochondrial and Numt CO1sequence spanning 601 nucleotide positions (available in the supplementary information, Fig. 4). Bootstrap-supported nodes, referred to in the text, are indicated with percentage values that were determined from 1000 pseudoreplicates. Branch labels provide source species names and GenBank accession numbers, which can be used to retrieve source sequences. National flags are used to indicate the geographical origin of where M. ozzardi CO1 sequences derive.
Figure 4Conceptual translations of M. ozzardi CO1 sequence deposits Shows an amino acid level alignment of conceptually translated M. ozzardi CO1 sequences. Variant M. ozzardi CO1 sequences from Peru are labelled with their source nucleotide sequence accession numbers and are shown aligned to a conceptually translated M. perstans CO1 reference sequence and the M. ozzardi CO1 sequence generated for this study. The source nucleotide sequences used to create the amino acid level alignment are shown below it. Three indels discussed in the text of the paper are highlight, as is a break in identity caused by a frame-shift mutation. Yellow highlighting has been used to make the grouping of Peruvian CO1 sequences more obvious.
Figure 5A phylogenetic tree constructed from filarial parasite 12 S sequences. A representative maximum likelihood tree constructed from an alignment of partial mitochondrial 12 S sequences spanning 380 nucleotide positions (available in the supplementary information, Fig. 4). Bootstrap-supported nodes, referred to in the text, are indicated with percentage values that were determined from 1000 pseudoreplicates. Branch labels provide source species names and GenBank accession numbers, which can be used to retrieve source sequences. National flags are used to indicate the geographical origin of where M. ozzardi 12 S sequences derive.
Figure 6A phylogenetic tree constructed from filarial parasite 5 S sequences. A representative maximum likelihood tree constructed from an alignment of partial ribosomal 5 S sequences spanning 380 nucleotide positions (available in the supplementary information, Fig. 5). Bootstrap-supported nodes, referred to in the text, are indicated with percentage values that were determined from 1000 pseudoreplicates. Branch labels provide source species names and GenBank accession numbers, which can be used to retrieve source sequences. National flags are used to indicate the geographical origin of where M. ozzardi 5 S sequences derive.