| Literature DB >> 35741832 |
Jose Carlos Solana1,2,3, Carmen Chicharro2,3, Emilia García2, Begoña Aguado4, Javier Moreno2,3, Jose M Requena1,3.
Abstract
Parasites of medical importance, such as Leishmania and Trypanosoma, are characterized by the presence of thousands of circular DNA molecules forming a structure known as kinetoplast, within the mitochondria. The maxicircles, which are equivalent to the mitochondrial genome in other eukaryotes, have been proposed as a promising phylogenetic marker. Using whole-DNA sequencing data, it is also possible to assemble maxicircle sequences as shown here and in previous works. In this study, based on data available in public databases and using a bioinformatics workflow previously reported by our group, we assembled the complete coding region of the maxicircles for 26 prototypical strains of trypanosomatid species. Phylogenetic analysis based on this dataset resulted in a robust tree showing an accurate taxonomy of kinetoplastids, which was also able to discern between closely related Leishmania species that are usually difficult to discriminate by classical methodologies. In addition, we provide a dataset of the maxicircle sequences of 60 Leishmania infantum field isolates from America, Western Europe, North Africa, and Eastern Europe. In agreement with previous studies, our data indicate that L. infantum parasites from Brazil are highly homogeneous and closely related to European strains, which were transferred there during the discovery of America. However, this study showed the existence of different L. infantum populations/clades within the Mediterranean region. A maxicircle signature for each clade has been established. Interestingly, two L. infantum clades were found coexisting in the same region of Spain, one similar to the American strains, represented by the Spanish JPCM5 reference strain, and the other, named "non-JPC like", may be related to an important leishmaniasis outbreak that occurred in Madrid a few years ago. In conclusion, the maxicircle sequence emerges as a robust molecular marker for phylogenetic analysis and species typing within the kinetoplastids, which also has the potential to discriminate intraspecific variability.Entities:
Keywords: Leishmania; kDNA; maxicircle; mitochondrial DNA; phylogeny; trypanosomatids
Mesh:
Year: 2022 PMID: 35741832 PMCID: PMC9222942 DOI: 10.3390/genes13061070
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.141
Maxicircle sequence datasets from trypanosomatid species used for phylogenetic studies.
| Species | GenBank/ENA * | Reference | Mendeley Data (DOI) a |
|---|---|---|---|
| SRR834802 | This work | 10.17632/8h8tzrbzft.1 | |
| SRR834907 | This work | 10.17632/23t2h58cvr.1 | |
| LR697138 | [ | 10.17632/hdyj8hbt39.1 | |
| LR697137 | [ | 10.17632/nszm7rb8y7.1 | |
| CP022652.1 | Direct submission | 10.17632/jhy362m5ms.1 | |
| ERR2191875 | This work | 10.17632/74d3b3pnvt.1 | |
| SRR8584809 | This work | 10.17632/pfdfzpwgjd.1 | |
| LR697136 | [ | 10.17632/98cn9h5t67.1 | |
|
| M10126.1 | [ | 10.17632/6kwj82nt8s.1 |
| LR697135 | [ | 10.17632/rcjzz74fvj.1 | |
| MK570510.1 | [ | 10.17632/9sxnsjy9xx.1 | |
| LR697134 | [ | 10.17632/bg3 × 4tcr64.1 | |
| ERR3656053 | This work | 10.17632/4wf4hb3k7g.1 | |
| SRR1657912 | This work | 10.17632/k6ffm6j8c3.1 | |
| SRR13558795 | This work | 10.17632/xhj67pv882.1 | |
| SRR13558784 | This work | 10.17632/9dykktyxp5.1 | |
| MK514117.1 | [ | 10.17632/c4zvyzftfg.1 | |
|
| KJ778684.1 | Direct submission | 10.17632/88w6bk3mkr.1 |
| KM386509.1 | [ | 10.17632/484wfgchdr.1 | |
| KM386508.1 | [ | 10.17632/wmkxhtz5rg.1 | |
| M94286.1 | [ | 10.17632/7yrtcn4nkk.1 | |
| MG948557.1 | [ | 10.17632/g44sr7djp4.1 | |
| KR072974.1 | [ | 10.17632/6w4gdb9rg8.1 | |
| KJ803830.1 | Direct submission | 10.17632/gs6sbtbh7z.1 | |
| DQ343646.1 | [ | 10.17632/7hfpy5frmv.1 | |
| DQ343645.1 | [ | 10.17632/7gtbgwjjv8.1 |
* The NCBI GenBank accession numbers are included for published maxicircle sequences. For new assembled maxicircles (this work), the ENA accession numbers corresponding to whole-genome reads are provided. a Accessed on 12 June 2022.
Figure 1Schematic diagram of the coding region of L. tarentolae (A), L. infantum (B), and T. cruzi (C) maxicircles. The nomenclature and abbreviations are those established for L. tarentolae. Blue, green, and red boxes are used to indicate non-edited genes, edited genes, and pan-edited genes, respectively. The drawings are based on the gene coordinates reported elsewhere: L. tarentolae (GenBank ID M10126 and [47], L. infantum [19], and T. cruzi [42].
Figure 2Evolutionary analysis of the Trypanosomatidae family based on the CR of the maxicircle sequence. The evolutionary history was inferred using the maximum likelihood method and the Tamura–Nei model (Tamura and Nei 1993). The tree with the highest log likelihood (−166,927.58) is shown. The percentage of trees in which the associated taxa are clustered together is shown next to the branches. The tree is drawn to scale, with branch lengths representing the number of substitutions per site. This analysis involved 26 nucleotide sequences. Codon positions included were 1st + 2nd + 3rd + Noncoding. There was a total of 16,895 positions in the final dataset. All the evolutionary analyses were conducted in MEGA 11 [45].
Figure 3Phylogenic relationship among Mediterranean and American L. infantum isolates (A,B). The evolutionary history was inferred using the maximum likelihood method and the Tamura–Nei model [43]. The tree with the highest log likelihood (−21,148.45) is shown. The tree is drawn to scale, with branch lengths representing the number of substitutions per site. This analysis involved 63 nucleotide sequences. Codon positions included were 1st + 2nd + 3rd + Noncoding. There was a total of 16,285 positions in the final dataset. Brazilian isolates; Spanish isolates; Tunisian isolates; Turkish isolates; Cypriot isolates; L. donovani (outgroup). All the evolutionary analyses were conducted in MEGA 11 [45].