| Literature DB >> 27027771 |
Nils Pilotte1,2, Marina Papaiakovou1, Jessica R Grant1, Lou Ann Bierwert1, Stacey Llewellyn3, James S McCarthy3, Steven A Williams1,2.
Abstract
BACKGROUND: The soil transmitted helminths are a group of parasitic worms responsible for extensive morbidity in many of the world's most economically depressed locations. With growing emphasis on disease mapping and eradication, the availability of accurate and cost-effective diagnostic measures is of paramount importance to global control and elimination efforts. While real-time PCR-based molecular detection assays have shown great promise, to date, these assays have utilized sub-optimal targets. By performing next-generation sequencing-based repeat analyses, we have identified high copy-number, non-coding DNA sequences from a series of soil transmitted pathogens. We have used these repetitive DNA elements as targets in the development of novel, multi-parallel, PCR-based diagnostic assays. METHODOLOGY/PRINCIPALEntities:
Mesh:
Substances:
Year: 2016 PMID: 27027771 PMCID: PMC4814118 DOI: 10.1371/journal.pntd.0004578
Source DB: PubMed Journal: PLoS Negl Trop Dis ISSN: 1935-2727
Fig 1Workflow for repeat analysis.
Output data from a next-generation sequencing run are uploaded to the RepeatExplorer Galaxy-based platform. During the QC and manipulation phase, the FASTQ Groomer tool is used to convert sequence reads into Sanger format. The FASTQ: READ QC tool is then used to verify the quality of the reads before removing unnecessary sequence (i.e. adapter sequences, etc.) from the ends of each read using the FASTQ Trimmer tool. The QC analysis is then repeated, and the FASTQ to FASTA converter tool is used to convert each read into FASTA format. Using these DNA sequence reads as input, sequences undergo clustering, during which an “all-to-all” sequence comparison is performed, and similar sequences are grouped together into clusters. Clusters containing the most highly repetitive sequences are then selected as putative diagnostic targets to be used for primer and probe-based real-time PCR assay design.
Fig 2Illustrative output from RepeatExplorer analysis of Necator americanus.
During “clustering” each nucleotide within a cluster is assigned a number. That number corresponds to how many individual next-generation sequencing reads that particular nucleotide appeared in. Using this output, a stretch of the most abundant nucleotides (depicted in green within the larger cluster’s sequence) is selected, and the corresponding nucleotides (highlighted in yellow) are selected as the candidate sequence from which the primers and probe are designed.
Fig 3Comparative probe testing.
For each novel probe design, FAM-TAMRA and double quenched FAM-ZEN-IOWA BLACK probes were synthesized. Comparative testing revealed that double quenched probes outperformed traditional probes, as evidenced by lower Ct values and greater ΔRn values. The plot above demonstrates these findings with the amplification of three concentrations of N. americanus template DNA using both double quenched (yellow) and traditional (blue) probe designs.
Selected primer and probe sequences for each multi-parallel assay.
| 5’- CCAGAATCGCCACAAATTGTAT -3’ | 5’- GGGTTTGAGGCTTATCATAAAGAA -3’ | 5'- /56-FAM/CCCGATTT G/ZEN/AGCTGAATTGTCA AA/3IABkFQ/ -3' | |
| 5’- GTATTTCACTCATATGATCGAGTGTTC -3’ | 5’- GTTTGAATTTGAGGTATTTCGACCA -3’ | 5'- /56-FAM/TGACAGTG T/ZEN/GTCATACTGTGGA AA/3IABkFQ/ -3' | |
| 5’- GGCGTAGAGGAGCGATTT -3’ | 5’- TACTACCCATCACACATTAGCC -3’ | 5'- /56-FAM/TTTGCGGGC/ZEN/G AGAACGGAAATA TT/3IABkFQ/ -3' | |
| 5’- CGCTCCAGAATTAGTTCCAGTT -3’ | 5’- GCAGCTTAGTCGAAAGCATAGA -3’ | 5'- /56-FAM/ACAGTCTC C/ZEN/AGTTCACTCCAGA AGAGT/3IABkFQ/ -3' | |
| 5’- GTAATAGCAGTCGGCGGTTTCTT -3’ | 5’- GCCCAACATGCCACCTATTC -3’ | 5’- /56-FAM/ TTGGCGGACAATTGCATGCGAT/MBG/ -3’ |
Comparative assay results for each species of parasite.
| 48 | 0 | |
| 12 | 19 | |
| 40 | 0 | |
| 7 | 32 | |
| 10 | 4 | |
| 8 | 57 | |
| 1 | 0 | |
| 0 | 78 | |
| 0 | 22 | |
| 0 | 57 |
* Two of these four samples contained T. ovis. The identity of the pathogen in the remaining 2 samples could not be determined due to a lack of material.
** 21 of these 22 samples contained A. ceylanicum. The identity of the pathogen in the 22nd sample could not be determined due to a lack of material.
Estimated Minimum Quantities of DNA in the Eggs of Each Species of STH.
| 244 Mb[ | 0.54 pg | 4–8 | 2.16 pg– 4.32 pg | |
| 317 Mb (GCA_000951055.1) | 0.70 pg | As few as 1 | 0.70 pg | |
| 75 Mb (GCA_000613005.1) | 0.17 pg | 1 | 0.17 pg | |
| 332 Mb (GCA_000816745.1) | 0.73 pg | 4–8 | 2.92 pg– 5.84 pg | |
| 42–60 Mb[ | 0.09 pg– 0.13 pg | 2–8 | 0.18 pg– 1.04 pg |
* As S. stercoralis eggs typically hatch in the intestinal lumen, this parasite is released into the stool as rhabditiform larvae which possess an even greater number of cells. Therefore, a patient stool sample harboring a single larval worm will contain a much higher quantity of parasite DNA than the quantity listed here.