| Literature DB >> 31830038 |
Malgorzata A Domagalska1, Hideo Imamura1, Mandy Sanders2, Frederik Van den Broeck1, Narayan Raj Bhattarai3, Manu Vanaerschot1, Ilse Maes1, Erika D'Haenens1, Keshav Rai3, Suman Rijal3, Matthew Berriman2, James A Cotton2, Jean-Claude Dujardin1,4.
Abstract
Whole genome sequencing (WGS) is increasingly used for molecular diagnosis and epidemiology of infectious diseases. Current Leishmania genomic studies rely on DNA extracted from cultured parasites, which might introduce sampling and biological biases into the subsequent analyses. Up to now, direct analysis of Leishmania genome in clinical samples is hampered by high levels of human DNA and large variation in parasite load in clinical samples. Here, we present a method, based on target enrichment of Leishmania donovani DNA with Agilent SureSelect technology, that allows the analysis of Leishmania genomes directly in clinical samples. We validated our protocol with a set of artificially mixed samples, followed by the analysis of 63 clinical samples (bone marrow or spleen aspirates) from visceral leishmaniasis patients in Nepal. We were able to identify genotypes using a set of diagnostic SNPs in almost all of these samples (97%) and access comprehensive genome-wide information in most (83%). This allowed us to perform phylogenomic analysis, assess chromosome copy number and identify large copy number variants (CNVs). Pairwise comparisons between the parasite genomes in clinical samples and derived in vitro cultured promastigotes showed a lower aneuploidy in amastigotes as well as genomic differences, suggesting polyclonal infections in patients. Altogether our results underline the need for sequencing parasite genomes directly in the host samples.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31830038 PMCID: PMC6932831 DOI: 10.1371/journal.pntd.0007900
Source DB: PubMed Journal: PLoS Negl Trop Dis ISSN: 1935-2727
Fig 1Performance of SuSL-sequencing on clinical samples.
A. Enrichment of Leishmania DNA from clinical samples by SureSelect; Y axis shows percentage of reads mapping to the L. donovani reference genome following SureSelect enrichment. X axis is the pre-enrichment percentage of Leishmania DNA, estimated by either whole-genome sequencing or qPCR; dotted lines indicate no enrichment and 10-, 100- and 1000-fold enrichment. B. Read mapping statistics in the 63 samples after SureSelect enrichment. Upper panel shows evenness of genome coverage as the proportion of bases covered with a minimum of number of sequencing reads from 1 to 100 as indicated by the seven different lines. Lower panel shows the percentage of reads mapping to the Leishmania reference genome for each sample, shown on the x-axis. The x-axis is common to both panels. BM, Bone Marrow; SP, Spleen aspirate.
Fig 2Genomic diversity among clinical samples.
A. SNPs. Phylogenetic tree based on SureSelect enriched clinical samples (black circle) and previously sequenced isolates [8]. Diagram is a neighbor-joining phylogeny based on 197 variable sites. ISC2-ISC10 were sub-populations previously defined [8]; ISC11 is a novel group that includes clinical samples and isolates previously designated as ‘ungrouped’. Full sample identifiers are shown in S5A Fig. B. Karyotypes (High quality samples). Y axis shows normalized somy estimate for each chromosome (x axis). Points show central estimate and bars show one standard deviation around these estimates, calculated by the binned depth method. C. Local CNVs. Average copy number per cell of H- and M-loci per ISC group in SureSelect enriched clinical samples (Bone Marrow or Spleen aspirate; BM/SP) and in cultured isolates (promastigotes, Prom). Error bars show one standard deviation around the mean estimate. Only a single sample was available from bone marrow aspirates in ISC1, hence no standard deviation is shown.
Genotype differences observed in 12 paired samples: Clinical samples analyzed by SureSelect and derived isolates sequenced classically.
| Sample code | Clinical sample ISC genotype | Derived isolate ISC genotype | Total base difference* | Homozygous difference | Heterozygous difference | Indel difference | Karyotype (bone marrow) | Karyotype (derived isolate) |
|---|---|---|---|---|---|---|---|---|
| BPK77 | ISC4 | ISC6 | 47 | 20 | 7 | 4 | almost diploid** | trisomy on ch5 and 11 |
| BPK80 | UG | ISC4 | 42 | 17 | 8 | 3 | almost diploid | trisomic on ch2, 4, 7, 8, 9, 11, 12, 20, 26, 33 and 35. tetrasomic on ch23 |
| BPK87 | ISC6 | ISC4 | 55 | 21 | 13 | 4 | almost diploid | regular |
| BPK91 | ISC3 | ISC4 | 36 | 14 | 8 | 8 | almost diploid | trisomic on ch20, 26 and 33 |
| BPK156 | ISC3 | ISC1 | 68459 | 34146 | 167 | 6275 | almost diploid | monosomic on ch17 |
| BPK157 | ISC4 | ISC1 | 64254 | 29942 | 4370 | 8281 | almost diploid | trisomic on ch23, 33 and 35 |
| BPK177 | ISC9 | ISC5 | 55 | 26 | 3 | 3 | almost diploid | trisomic on ch2, 5, 8, 9, 11, 12, 13, 14, 20, 23, 27 and 32. tetrasomic on ch33 |
| BPK276 | ISC5 | ISC4 | 40 | 18 | 4 | 1 | monosomic on ch17 | trisomic on ch4, 11, 20, 23. tetrasomic on ch5 |
| BPK280 | ISC4 | UG | 21 | 9 | 3 | 4 | almost diploid | trisomic on ch1, 7, 8, 10, 13. 15, 20, 22, 23. tetrasomic on ch9 |
| BPK81 | ISC4 | ISC4 | 13 | 3 | 7 | LdAQP1: frameshift G/GA insertion | almost diploid | regular |
| BPK181 | ISC6 | ISC6 | 5 | 0 | 5 | 0 | almost diploid | trisomic on ch8, 9, 13 and 23 |
| BPK208 | ISC4 | ISC4 | 11 | 1 | 9 | 0 | almost diploid | trisomic on ch1, 7, 8, 10, 13, 15, 20, 22 and 23. tetrasomic on ch9 |
Total base difference* = Homozygous difference x 2 + Heterozygous difference. Almost diploid**: S-value was 2 in all chromosomes except the tetrasomic chromosome 31. Notes: (1) the S-values of chromosome 11 in all bone marrow samples were slightly higher due to amplification artefact and therefore, their values were assumed to be disomic. (2) The S-values of chromosome 31 of BPK87BM and BPK80BM were lower than the expected tetrasomic values due to lower genome coverage.
Fig 3Somy comparison between matched clinical samples and culture parasites.
A. Inferred somy for 3 samples for which SureSelect-enriched bone marrow (BM) samples and cultured isolates (MO) collected at the same time from the same clinical samples had matching ISC genotypes. Y axis shows normalized somy estimate for each chromosome (x axis). Points show central estimate and bars show one standard deviation around these estimates. Somy estimates and standard deviations for the bone marrow samples were based on the binned depth method while corresponding values for isolates were based on depth of each position. B. Evidence for polyclonal infections. Bars represent the proportion of sequencing reads showing the ISC-specific genotype or the reference (REF) genotype at loci with ISC-specific alleles. 1) Genotypes at ISC9 and ISC4 loci for SureSelect-enriched bone marrow sample (BPK471BM). 2) Genotypes at ISC4 and ISC3 loci for SureSelect-enriched bone marrow sample BPK296BM.