| Literature DB >> 31616465 |
Stephen R Doyle1, Geetha Sankaranarayanan1, Fiona Allan2, Duncan Berger1, Pablo D Jimenez Castro3,4, James Bryant Collins3, Thomas Crellen1,5, María A Duque-Correa1, Peter Ellis1, Tegegn G Jaleta6, Roz Laing7, Kirsty Maitland7, Catherine McCarthy1, Tchonfienet Moundai8, Ben Softley1, Elizabeth Thiele9, Philippe Tchindebet Ouakou8, John Vianney Tushabe1,10, Joanne P Webster11, Adam J Weiss12, James Lok6, Eileen Devaney7, Ray M Kaplan3, James A Cotton1, Matthew Berriman1, Nancy Holroyd1.
Abstract
Whole-genome sequencing is being rapidly applied to the study of helminth genomes, including de novo genome assembly, population genetics, and diagnostic applications. Although late-stage juvenile and adult parasites typically produce sufficient DNA for molecular analyses, these parasitic stages are almost always inaccessible in the live host; immature life stages found in the environment for which samples can be collected non-invasively offer a potential alternative; however, these samples typically yield very low quantities of DNA, can be environmentally resistant, and are susceptible to contamination, often from bacterial or host DNA. Here, we have tested five low-input DNA extraction protocols together with a low-input sequencing library protocol to assess the feasibility of whole-genome sequencing of individual immature helminth samples. These approaches do not use whole-genome amplification, a common but costly approach to increase the yield of low-input samples. We first tested individual parasites from two species spotted onto FTA cards-egg and L1 stages of Haemonchus contortus and miracidia of Schistosoma mansoni-before further testing on an additional five species-Ancylostoma caninum, Ascaridia dissimilis, Dirofilaria immitis, Strongyloides stercoralis, and Trichuris muris-with an optimal protocol. A sixth species-Dracunculus medinensis-was included for comparison. Whole-genome sequencing followed by analyses to determine the proportion of on- and off-target mapping revealed successful sample preparations for six of the eight species tested with variation both between species and between different life stages from some species described. These results demonstrate the feasibility of whole-genome sequencing of individual parasites, and highlight a new avenue toward generating sensitive, specific, and information-rich data for the diagnosis and surveillance of helminths.Entities:
Keywords: DNA extraction; diagnostics; genomics; helminths; low input; whole-genome sequencing
Year: 2019 PMID: 31616465 PMCID: PMC6764475 DOI: 10.3389/fgene.2019.00826
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Summary of the species, life stages, and conditions tested using one or more of the DNA extraction approaches followed by whole-genome sequencing.
| Samples sequenced per | ||||||||
|---|---|---|---|---|---|---|---|---|
| Species | Species | Life stage | Life stage | BSP | CGP | FGM | NXT | PIP |
|
| HC | Egg | EGG | 10 | 10 | 4 | 10 | 10 |
| HC | Egg (frozen) | EGGf | 6 | 6 | 0 | 6 | 6 | |
| HC | Larval stage L1 | LS1 | 10 | 10 | 4 | 10 | 10 | |
|
| SM | Miracidia | MIR | 59 | 36 | 10 | 51 | 12 |
|
| AC | Egg | EGG | 0 | 8 | 0 | 0 | 0 |
| AC | Larval stage L1 | LS1 | 0 | 8 | 0 | 0 | 0 | |
| AC | Larval stage L3 | LS3 | 0 | 8 | 0 | 0 | 0 | |
|
| AD | Egg | EGG | 0 | 8 | 0 | 0 | 0 |
|
| DI | Microfilaria | MFL | 0 | 8 | 0 | 0 | 0 |
|
| SS | Free living females | FL | 0 | 8 | 0 | 0 | 0 |
| SS | Larval stage L1 | LS1 | 0 | 8 | 0 | 0 | 0 | |
| SS | Larval stage L3 | LS3 | 0 | 8 | 0 | 0 | 0 | |
|
| TM | Egg | EGG | 0 | 8 | 0 | 0 | 0 |
| TM | Egg (bleached) | EGGb | 0 | 8 | 0 | 0 | 0 | |
| TM | Larval stage L1 | LS1 | 0 | 8 | 0 | 0 | 0 | |
|
| DM | Larval stage L1 | LS1 | 0 | 0 | 0 | 129 | 0 |
| Total samples per kit (n = 497 total) | 85 | 150 | 18 | 206 | 38 | |||
Breakdown of sequencing strategies per species based on whole-genome sequencing at 30× coverage and whole-genome sequencing to achieve 100× whole mitochondrial genome coverage.
| Species / stage | Genome assembly | Number of samples that can be multiplexed 2 | mtDNA / | Sampled multiplexed targeting mtDNA | Nuclear genome coverage per 100× mtDNA genome |
|---|---|---|---|---|---|
|
| 409 | 173 | 142.77 | 7418 | 0.70 |
|
| 283 | 250 | 70.76 | 5313 | 1.41 |
|
| 283 | 250 | 44.55 | 3345 | 2.24 |
|
| 103 | 688 | 70.81 | 14608 | 1.41 |
|
| 88 | 805 | 14.27 | 3446 | 7.01 |
|
| 42 | 1687 | 11.06 | 5597 | 9.04 |
1Genome size: Obtained from Wormbase Parasite v12.
2Multiplex: total number of samples per NovaSeq 6000 sequencing run with S4 2 × 150 bp PE chemistry generating 2.5 Tb of data and accounting for 85% mapping rate. Theoretically possible, but may be limited by barcoding available.
3mtDNA/nuclear ratio: based on coverage of mitochondrial and nuclear-derived sequencing reads, normalized to the nuclear genome coverage.
Figure 1Comparison of sequencing library concentration (ng/µl) as a proxy for relative DNA extraction efficiency for the five DNA extraction protocols tested. DNA from multiple life stages were extracted, including from freshly collected egg (EGG), frozen eggs (EGGf), and L1 stage larvae (LS1) of Haemonchus contortus, and miracidia (MIR) of Schistosoma mansoni. The number of samples in each comparison is presented in , and the raw data used are described in .
Figure 2Impact of sequencing library concentration on mapped on-target reads for five low-input DNA extraction protocols. Each point represents an individual sequencing library sample, colored by extraction protocol (A) or proportion of putative contaminating reads (100 - percent_unclassified_reads) (B). The dashed grey vertical line represents a library concentration of 0.25 ng/µl. Extraction protocols were performed on Haemonchus contortus eggs (EGG), frozen eggs (EGGf), and L1 (LS1) stages, and Schistosoma mansoni miracidia (MIR). The number of samples in each comparison is presented in , and the raw data used are described in .
Figure 3Impact of sequencing library concentration on mapped on-target reads from multiple life stages of seven helminth species. Each point represents an individual sequencing library samples, colored by life stage (A) or proportion of putative contaminating reads (100 - percent_unclassified_reads) (B). DNA was extracted from all samples using the CGP protocol. Consistent with , the dashed grey vertical line represents 0.25 ng/µl. Egg, untreated egg; EGGb, bleached egg; EGGf, frozen egg; FL, full-length female; LS1, larval stage L1; LS3, larval stage L3; MFL, microfilaria; MIR, miracidia. The number of samples in each comparison is presented in .