| Literature DB >> 30486770 |
Sumudu R Samarasinghe1, Nilakshi Samaranayake1, Udeshika L Kariyawasam1, Yamuna D Siriwardana1, Hideo Imamura2, Nadira D Karunaweera3.
Abstract
BACKGROUND: Leishmaniasis is a neglected tropical disease with diverse clinical phenotypes, determined by parasite, host and vector interactions. Despite the advances in molecular biology and the availability of more Leishmania genome references in recent years, the association between parasite species and distinct clinical phenotypes remains poorly understood. We present a genomic comparison of an atypical variant of Leishmania donovani from a South Asian focus, where it mostly causes cutaneous form of leishmaniasis.Entities:
Keywords: Aneuploidy; Copy number variations; Cutaneous leishmaniasis; Drug resistance; High throughput sequencing; Indian subcontinent; SNPs; Tropism; Virulence genes
Mesh:
Substances:
Year: 2018 PMID: 30486770 PMCID: PMC6262978 DOI: 10.1186/s12864-018-5271-z
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Chromosome somy distribution among Sri Lankan L. donovani isolates. a CL-SL isolates: Most chromosomes are disomic except for monosomic chromosome 2, trisomic chromosomes 22 and 26 and pentasomic chromosome 31 (b) VL-SL isolates: All chromosomes are disomic except for tetrasomic chromosome 31
Fig. 2Heat map showing the comparison of aneuploidy profiles of CL-SL and VL-SL isolates. Chromosome numbers are listed along the right side y-axis and L. donovani CL and VL isolates are listed along the bottom x-axis. The dendrogram on top of heat map indicates the clustering pattern of the 8 isolates based on their aneuploidy similarity. The colour bar below the dendrogram represents the phenotype of isolate (Red-CL-SL and Black-VL-SL). The colour key represents the ploidy level of chromosomes in the heat map: chromosome somy > 4 green, trisomy – yellow, disomy – orange and monosomy – red
Genes with highly variable copy numbers between Sri Lankan L. donovani CL-SL and VL-SL isolates
| Chr | GeneID | VL/CL Ratio | MWU-test p-value | Benjamini-Hochberg p-value | Product Description |
|---|---|---|---|---|---|
| 27 | LdBPK_271940 | 7.96 | 0.2717 | 0.3702 | D-Lactate Dehydrogenase-Like Protein |
| 27 | LdBPK_271950 | 6.84 | 0.0012 |
| Branched-Chain Amino Acid Aminotransferase, Putative (Fragment) |
| 10 | LdBPK_100520 | 4.698 | 0.0000 |
| Gp63, Leishmanolysin |
| 29 | LdBPK_291600 | 4.41 | 0.0000 |
| Nodulin-Like, Putative |
| 23 | LdBPK_230360 | 4.22 | 0.0000 |
| Hypothetical Protein, Conserved |
| 23 | LdBPK_230350 | 3.86 | 0.0022 |
| Hypothetical Protein, Conserved |
| 23 | LdBPK_230330 | 3.85 | 0.0133 |
| Checkpoint Protein Hus1, Putative |
| 25 | LdBPK_210020 | 3.80 | 0.0009 |
| Histone H4 |
| 10 | LdBPK_100920 | 3.55 | 0.8462 | 0.8817 | Histone H3 |
| 23 | LdBPK_230340 | 3.47 | 0.0001 |
| Tryptophanyl-Trna Synthetase, Putative |
| 23 | LdBPK_230320 | 3.06 | 0.2234 | 0.3163 | Zinc Finger, C3Hc4 Type (Ring Finger), Putative |
| 23 | LdBPK_230310 | 3.02 | 0.7210 | 0.7877 | Pteridine Reductase 1 |
| 11 | LdBPK_110640 | 2.95 | 0.0922 | 0.1456 | Metallo-Peptidase, Clan Mf, Family M17 (Fragment) |
| 29 | LdBPK_291590 | 2.90 | 0.3598 | 0.4576 | Asparagine Synthase, Putative |
| 10 | LdBPK_100510 | 2.55 | 0.2285 | 0.3201 | Gp63, Leishmanolysin (Fragment) |
| 23 | LdBPK_230290* | 2.49 | 0.8027 | 0.8474 | Abc-Thiol Transporter (Fragment) |
| 23 | LdBPK_230300 | 2.47 | 0.8735 | 0.9026 | Argininosuccinate Synthase, Putative |
| 23 | LdBPK_230250* | 2.47 | 0.0540 | 0.0905 | Hypothetical Protein, Conserved |
| 35 | LdBPK_353690 | 2.38 | 0.0001 |
| Hypothetical Protein, Conserved |
| 22 | LdBPK_221370 | 2.36 | 0.0350 | 0.0616 | 40S Ribosomal Protein L14, Putative |
| 31 | LdBPK_310470 | 2.28 | 0.6628 | 0.7405 | Hypothetical Protein |
| 23 | LdBPK_230370 | 2.27 | 0.0000 |
| Hypothetical Protein, Conserved |
| 23 | LdBPK_230260* | 2.22 | 0.1946 | 0.2790 | Hypothetical Protein, Conserved |
| 25 | LdBPK_020010 | 2.20 | 0.1855 | 0.2691 | Aminopeptidase P1, Putative |
| 15 | LdBPK_150430 | 2.19 | 0.9484 | 0.9561 | 60S Acidic Ribosomal Protein, Putative |
| 27 | LdBPK_271490 | 2.13 | 0.1280 | 0.1947 | Hypothetical Protein, Conserved |
| 11 | LdBPK_111070 | 2.13 | 0.3303 | 0.4289 | Hypothetical Protein |
| 28 | LdBPK_282050 | 2.12 | 0.4786 | 0.5652 | Zinc Transporter 3, Putative |
| 34 | LdBPK_341720 | 2.10 | 0.9113 | 0.9300 | Amastin-Like Surface Protein, Putative (Fragment) |
Benjamini-Hochberg critical value for a false discovery rate < 0.05 was considered significant and marked in bold
Genes previously reported with high copy number variations between a cutaneous and a visceral isolate from Sri Lanka [14]
Tandem gene arrays with highly variable copy numbers between L. donovani CL-SL and VL-SL isolates
| Chr | OrthoMCL ID | Product description | No of genes in ref | Haploid number CL | Haploid number VL | VL/CL ratio |
|---|---|---|---|---|---|---|
| 27 | OG5_127093 | D-lactate dehydrogenase-like protein | 1 | 4 | 33 | 7.96 |
| 27 | OG5_126731 | Branched-chain amino acid aminotransferase, putative | 1 | 5 | 35 | 6.84 |
| 29 | OG5_131290 | Nodulin-like, putative | 1 | 1 | 6 | 4.41 |
| 23 | OG5_154577 | Hypothetical protein, conserved | 1 | 1 | 4 | 4.22 |
| 23 | OG5_181194 | Hypothetical protein, conserved | 1 | 1 | 5 | 3.86 |
| 23 | OG5_150800 | Checkpoint protein Hus1 | 1 | 1 | 5 | 3.85 |
| 21 | OG5_126575 | Histone H4 | 1 | 1 | 2 | 3.80 |
| 23 | OG5_127096 | Tryptophanyl-trna synthetase, putative | 1 | 1 | 5 | 3.47 |
| 23 | OG5_180874 | Zinc finger, C3Hc4 type (ring finger) containing protein, putative | 1 | 1 | 4 | 3.06 |
| 23 | OG5_133937 | Pteridine reductase 1 | 1 | 1 | 4 | 3.02 |
| 10 | OG5_126749 | Major surface protease Gp63, putative | 2 | 7 | 21 | 3.00 |
| 34 | OG5_130729 | Amastin-like surface protein-like protein | 2 | 4 | 2 | 0.50 |
| 22 | OG5_126823 | Ribonucleoside-diphosphate reductase small chain, putative | 3 | 2 | 1 | 0.50 |
| 12 | OG5_136657 | Protein of unknown function (Duf962), putative | 2 | 2 | 1 | 0.50 |
| 8 | OG5_132981 | Amastin, putative | 1 | 2 | 1 | 0.44 |
Effects of SNPs and short indels identified in Sri Lankan L. donovani isolates on coding regions
| Variant type | Common to VL & CL | Unique to CL | Unique to VL |
|---|---|---|---|
| Frame shift | 52 | 15 | 31 |
| Stop gained | 21 | 3 | 7 |
| Stop lost | 8 | 2 | 4 |
| Start lost | 2 | 2 | 1 |
| Stop retained | 13 | 3 | 3 |
| Non synonymous coding | 12,046 | 1576 | 2804 |
| Synonymous coding | 12,181 | 1558 | 2507 |
| Initiator codon variant | 1 | – | – |
| Disruptive inframe deletion | 30 | 4 | 3 |
| Disruptive inframe insertion | 35 | 4 | 10 |
Fig. 3Genes with highly variable copy numbers on chromosome 27 among CL-SL and VL-SL isolates. LdBPK_271940 (Chr27: 798359–799,840) and LdBPK_271950 (Chr27: 803591–804,793). The two genes show a drastic increase in copy number among VL-SL isolates (VL28 and VL38) compared to CL-SL isolates (L2339, L2343, L2372, R5 and R7). As an exception L2331 CL isolate (red) shows a somewhat higher increase in copy number of the genes than other CL-SL isolates. Figure was generated using R graphics [73]
Fig. 4Enriched GO terms within the highly variable genes among CL-SL and VL-SL isolates. GO terms found to be overrepresented with a p < 0.05 are shown for biological processes in VL-SL (Panel a) and CL-SL (Panel b). A web-based tool, REVIGO was used to remove redundant GO terms among the gene set based on a chosen semantic similarity measure of 0.5. Remaining GO terms were represented as circles in a two dimensional scatter plot, where the size of the circles indicates the frequency of the GO term it represents and the distance between the circles indicates the semantic similarity between the corresponding GO terms. Circles closer to each other in the plot represent semantically similar GO terms. Circle color indicates the enriched GO term p-values according to the color key given in the plot. Number within each circle corresponds to the GO term as per Additional file 1: Table S9. Plot axes x and y is a general representative measure of semantic space
Fig. 5Relatedness and genetic divergence levels between the 8 Sri Lankan L. donovani isolates. a Heat map and associated dendrogram based on the similarity between the SNP profiles of 8 isolates. Each colored square in heat map represents the level of SNP similarity for an isolate listed along the y-axis compared to the isolate listed on the x-axis according to the colour key given. The colour bar below the dendrogram represents the phenotype of isolate: Red-CL-SL and Black-VL-SL. Three distinct clades are visible. CL-SL, VL-SL and CL-PR. L2331 CL isolate is an exception that it clusters with the VL group. b Principal component analysis based on a pruned SNP set of 8 isolates shows the same results as above (c) Phylogenetic tree based on maximum-likelihood method showing the genetic divergence levels between different Leishmania strains
Heterozygosity and homozygosity within the CL-SL and VL-SL genomes
| Isolate | Ho | He | Total SNPs | F (inbreeding coefficient) |
|---|---|---|---|---|
| VL38 | 26,663 | 57,592 | 84,255 | −1.28637 |
| L2331 | 38,248 | 58,153 | 96,401 | −1.21113 |
| VL28 | 13,098 | 24,524 | 47,622 | −1.10192 |
| R7 | 39,283 | 41,672 | 80,955 | −0.77418 |
| R5 | 44,411 | 40,095 | 84,506 | −0.65821 |
| L2343 | 56,029 | 6051 | 62,080 | 0.74412 |
| L2339 | 57,733 | 4071 | 61,804 | 0.83690 |
| L2372 | 52,717 | 2273 | 54,990 | 0.90426 |
Heterozygosity and homozygosity identified within the 8 isolate genomes, among a total of 100,318 SNPs. Number of heterozygous SNPs (He), homozygous SNPs (Ho) and inbreeding coefficient (F) were calculated using VCFtools
Comparison of SNP/Indel profiling against different sequences of L. donovani
| SNP count | Indel count | |||
|---|---|---|---|---|
| LdBPK282A1 (Standard reference) | LdBPKPAC2016beta (PacBio sequence) | LdBPK282A1 (Standard reference) | LdBPKPAC2016beta (PacBio sequence) | |
| Shared by both CL and VL | 56,560 | 57,063 | 1674 | 1669 |
| Unique to CL | 21,799 | 24,858 | 2109 | 2140 |
| Unique to VL | 8688 | 7920 | 159 | 161 |
Fig. 6Comparison of read coverage across H-locus region of chromosome 23 among Sri Lankan L. donovani CL and VL isolates. a L. donovani H-locus region is defined between the coordinates 80,157 bp – 102,529 bp on chromosome 23. Few isolates were chosen randomly among CL and VL groups to clearly represent the difference in amplification of the H locus region; CL isolates - L2339 (Green), L2343 (Blue), R5 (Black) and VL isolate - VL38 (Red). Amplification of the H-locus region in VL-SL compared to CL-SL isolates were clearly observed in genes; Terbinafine resistance locus protein (LdBPK_230280: 89042–89,611), MRPA (LdBPK_230290: 91285–95,325) and Argininosuccinate synthase gene (LdBPK_230300: 96936–98,192). Figure was generated using Artemis tool (b) Read coverage across chromosome 23 in six CL-SL and two VL-SL L. donovani isolates. Amplification of the H-locus (80,157 bp – 102,529 bp) is clearly seen among VL-SL VL38 isolate (In VL28 the H-locus amplification was not prominent probably due to comparatively low coverage of the sample) and one CL-SL isolate was an exception (L2331). Figure was generated using R graphics
Genes with highly variable copy numbers between L. donovani CL-WT and CL-PR
| Chr | GeneID | CL-PR/CL-WT ratio | MWU-test p-value | Benjamini-Hochberg p-value | Product Description |
|---|---|---|---|---|---|
| 32 | LdBPK_323700 | 3.42 | 0.0032 | 0.0087 | Hypothetical Protein, Conserved |
| 34 | LdBPK_044270 | 2.35 | 0.1976 | 0.2635 | Hypothetical Protein, Conserved |
| 17 | LdBPK_081310 | 2.26 | 0.4180 | 0.4932 | Hypothetical Protein, Conserved |
| 34 | LdBPK_341300 | 1.86 | 0.0102 | 0.0221 | Hypothetical Protein, Conserved |
| 1 | LdBPK_010160 | 1.83 | 0.0167 | 0.0347 | Lsm Domain Containing Protein, Putative |
| 32 | LdBPK_322060 | 1.80 | 0.0245 | 0.0471 | Hypothetical Protein |
| 34 | LdBPK_341720 | 1.80 | 0.0839 | 0.1364 | Amastin-Like Surface Protein, Putative (Fragment) |
| 12 | LdBPK_120667 | 1.79 | 0.0000 | 0.0000 | Hypothetical Protein (Fragment) |
| 12 | LdBPK_120210 | 1.70 | 0.1487 | 0.2092 | Hypothetical Protein, Conserved |
| 19 | LdBPK_191070 | 1.69 | 0.0054 | 0.0134 | Hypothetical Protein, Unknown Function |
| 9 | LdBPK_090180 | 1.68 | 0.4268 | 0.4932 | Atg8/Aut7/Apg8/Paz2, Putative |
| 21 | LdBPK_211640 | 1.68 | 0.0000 | 0.0001 | Zinc Finger Domain-Like Protein |
| 30 | LdBPK_301560 | 1.68 | 0.3125 | 0.3964 | Hypothetical Protein |
| 8 | LdBPK_080950 | 1.67 | 0.0981 | 0.1545 | Cathepsin L-Like Protease (Fragment) |
| 15 | LdBPK_151140 | 1.66 | 0.0000 | 0.0000 | Tryparedoxin Peroxidase |
Tandem gene arrays with highly variable copy numbers between L. donovani CL-WT and CL-PR
| Chr | OrthoMCL ID | Product description | Haploid CL-WT | Haploid CL-PR | CL-PR/CL-WT ratio |
|---|---|---|---|---|---|
| 32 | OG5_134607 | Hypothetical protein, conserved | 1 | 5 | 3.42 |
| 34 | OG5_131023 | Quinonoid dihydropteridine reductase | 1 | 3 | 2.35 |
| 34 | OG5_143904 | Amastin surface glycoprotein, putative | 3 | 5 | 1.80 |
| 12 | OG5_145891 | Hypothetical protein, conserved | 1 | 2 | 1.70 |
| 21 | OG5_136877 | Palmitoyl acyltransferase 3, putative | 1 | 2 | 1.68 |
| 32 | OG5_157991 | Hypothetical protein, conserved | 1 | 2 | 1.65 |
| 32 | OG5_166761 | GIPL galf transferase, putative | 1 | 2 | 1.65 |
| 36 | OG5_184184 | Hypothetical protein, conserved | 1 | 2 | 1.62 |
| 35 | OG5_204252 | Hypothetical protein, conserved | 1 | 2 | 1.59 |
| 4 | OG5_148241 | Hypothetical protein, conserved | 2 | 3 | 1.58 |
| 36 | OG5_126983 | 40S ribosomal protein S9, putative | 1 | 2 | 1.57 |
| 36 | OG5_158724 | Histidine secretory acid phosphatase, putative | 1 | 2 | 1.56 |
| 35 | OG5_130002 | Yos1-like, putative | 1 | 2 | 1.51 |
| 36 | OG5_127039 | Sugar efflux transporter for intercellular exchange, putative | 1 | 2 | 1.50 |
| 35 | OG5_128319 | ADP-ribosylation factor-like 2, arl2, putative | 1 | 2 | 1.50 |
| 27 | OG5_127093 | D-lactate dehydrogenase-like protein | 5 | 2 | 0.41 |