| Literature DB >> 26318339 |
Stephen J Hadfield1, Justin A Pachebat2, Martin T Swain3, Guy Robinson4, Simon Js Cameron5, Jenna Alexander6, Matthew J Hegarty7, Kristin Elwin8, Rachel M Chalmers9.
Abstract
BACKGROUND: Whole genome sequencing (WGS) of Cryptosporidium spp. has previously relied on propagation of the parasite in animals to generate enough oocysts from which to extract DNA of sufficient quantity and purity for analysis. We have developed and validated a method for preparation of genomic Cryptosporidium DNA suitable for WGS directly from human stool samples and used it to generate 10 high-quality whole Cryptosporidium genome assemblies. Our method uses a combination of salt flotation, immunomagnetic separation (IMS), and surface sterilisation of oocysts prior to DNA extraction, with subsequent use of the transposome-based Nextera XT kit to generate libraries for sequencing on Illumina platforms. IMS was found to be superior to caesium chloride density centrifugation for purification of oocysts from small volume stool samples and for reducing levels of contaminant DNA.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26318339 PMCID: PMC4552982 DOI: 10.1186/s12864-015-1805-9
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Properties of Cryptosporidium oocyst and DNA preparations from two pilot phase stool samples
| Isolate, species and gp60 subtype | Preparation stage | qPCR 18S rRNA gene (CT) from original stool sample | Quantification of | Bacterial DNA calculated from qPCR 16 s rRNA gene (ng) (% reduction compared with salt-floated suspension) | Total DNA from Qubit (ng) | |
|---|---|---|---|---|---|---|
| Numbers of oocysts counted; genome copies derived from microscopy counts | Genome copies calculated from qPCR 18 s rRNA gene | |||||
| UKP2, | Salt-floated suspension | 28.8 | 1.0 ×106; 4.0×106 | 4.27×105 | 323.1 | ND |
| Post IMS, bleach treated suspension | 3.7×105; 1.5×106 | 2.99×105 | 4.2 (98.7 % reduction) | 2.44 | ||
| Post CsCl, bleach treated suspension | 1.3×105; 5.2 ×105 | 1.99×105 | NDa | 1.52 | ||
| UKH3, | Initial suspension | 26.7 | 1.0×106; 4.0×106 | 4.6×105 | 86.7 | ND |
| Post IMS, bleach treated suspension | 2.9×105; 1.2×106 | 1.3×106 | 4.4 (94.9 % reduction) | 2.68 | ||
ND, not done
aNot tested in order to preserve DNA as total DNA level was low
Assessment of Cryptosporidium preparation from 17 stool samples containing C. hominis or C. parvum oocysts
| Initial suspension prior to processing by IMS | DNA extracted from IMS purified oocysts | |||||||
|---|---|---|---|---|---|---|---|---|
| Sample | Species and | qPCR 18S rRNA CT | Number of oocysts counted | Visual condition of oocysts | qPCR 18S rRNA CT | Calculated genome copies | Mass of DNA (ng) | >1 ng DNA available for library preparation (isolate reference number) |
| 1 |
| ND | 1.9×107 | Good | 17.7 | 5.53×107 | 77 | Yes (UKH4) |
| 2 |
| 23.5 | 1.6×107 | Good | 20.0 | 1.24×107 | 49.55 | Yes |
| (UKH5) | ||||||||
| 3 |
| 29.3 | 2.1×105 | Good | 25.7 | 1.64×105 | Below threshold | No |
| 4 |
| 31.6 | 7.5×104 | Good | 25.9 | 1.49×105 | 0.00 | No |
| 5 |
| 31.1 | 5.0×104 | Good | 25.1 | 2.36×105 | 0.00 | No |
| 6 |
| 28.1 | 3.6×105 | OK - a few empty oocysts, didn't stain well | 28.6 | 2.66×104 | 0.00 | No |
| 7 |
| 29.1 | 1.5×106 | Good | 22.2 | 2.99×106 | 5.45 | Yes |
| (UKP3) | ||||||||
| 8 |
| 25.5 | 2.8×106 | Good | 20.1 | 5.99×106 | 18.65 | Yes |
| (UKP4) | ||||||||
| 9 |
| 27.5 | 9.1×105 | Good | 23.2 | 1.52×106 | 2.68 | Yes |
| (UKP5) | ||||||||
| 10 |
| 27.1 | 2.7×106 | Good | 23.4 | 1.32×106 | 4.40 | Yes |
| (UKP6) | ||||||||
| 11 |
| 23.8 | 9.0×106 | Good | 20.3 | 1.01×107 | 28.65 | Yes |
| (UKP7) | ||||||||
| 12 |
| 30.4 | 1.1×106 | Good | 24.4 | 3.74×105 | 0.8 | No |
| 13 |
| 29.2 | 1.1×106 | Good | 28.7 | 2.53×104 | 0.00 | No |
| 14 |
| 30.6 | 5.2×105 | OK | 31.8 | 3.58×103 | 0.00 | No |
| 15 |
| 34.0 | 3.9×104 | OK - a few empty oocysts | No amplicon detected | 0 | ND | No |
| 16 |
| 28.4 | 2.3×106 | Good | 21.9 | 3.62×106 | 6.55 | Yes |
| (UKP8) | ||||||||
| 17 |
| 30.9 | 1.2×106 | Good | 23.9 | 1.01×106 | 2.88 | Yesa |
aA technical problem resulted in no data being produced for this sample
Fig. 1Relationship between total oocyst numbers, CT values and concentration of recovered DNA. Oocyst numbers (counted by microscopy), and CT values in the salt-floated suspension were determined prior to processing, and compared to final DNA recovery. Red, WGS successful; Blue, <1 ng DNA available (below manufacturer’s threshold for library preparation)
Fig. 2Relationship between Cryptosporidium genome copies and total DNA in extracts prior to whole genome sequencing
Quality of whole genome sequences generated directly from stools containing C. hominis and C. parvum
| Isolate; species and | Total # base pairs sequenced (after merging reads) | Total # base pairs mapped to reference sequence | Proportion of | % of reference genome covered | Mean depth on reference sequence | % AT content of reads mapped to reference sequence |
|---|---|---|---|---|---|---|
| Pilot phase | ||||||
| UKH3 (IMS preparation) | 337,791,948 | 305,024,423a | 90.30 % | 98 % | 34.71 × | 67.49 % |
|
| ||||||
| PRJNA253834 | ||||||
| UKP2 (IMS preparation) | 510,081,295 | 471,881,392b | 92.51 % | 100 % | 51.80 × | 67.92 % |
|
| ||||||
| PRJNA253836 | ||||||
| UKP2 (CsCl preparation) | 521,277,984 | 426,692,177b | 81.86 % | 100 % | 46.84 × | 67.85 % |
|
| ||||||
| PRJNA253836 | ||||||
| Main phase | ||||||
| UKH4 | 2,164,426,378 | 1,828,866,488a | 84.50 % | 96 % | 209.17 × | 62.72 % |
|
| ||||||
| PRJNA253838 | ||||||
| UKH5 | 2,182,317,271 | 1,765,458,438a | 80.90 % | 96 % | 201.92 × | 63.03 % |
|
| ||||||
| PRJNA253839 | ||||||
| UKP3 | 1,703,132,267 | 1,514,828,932b | 88.94 % | 99 % | 166.42 × | 63.55 % |
|
| ||||||
| PRJNA253840 | ||||||
| UKP4 | 1,967,147,533 | 1,751,979,030b | 89.06 % | 99 % | 192.48 × | 63.60 % |
|
| ||||||
| PRJNA253843 | ||||||
| UKP5 | 288,922,509 | 244,528,063b | 84.63 % | 99 % | 26.86 × | 67.67 % |
|
| ||||||
| PRJNA253845 | ||||||
| UKP6 | 1,169,379,989 | 954,176,437b | 81.60 % | 99 % | 104.83 × | 67.73 % |
|
| ||||||
| PRJNA253846 | ||||||
| UKP7 | 795,715,168 | 708,613,859b | 89.05 % | 99 % | 77.85 × | 63.64 % |
|
| ||||||
| PRJNA253847 | ||||||
| UKP8 | 1,896,616,473 | 1,587,380,412b | 83.70 % | 98 % | 174.39 × | 63.20 % |
|
| ||||||
| PRJNA253848 |
amapped against C. hominis TU502 v1 reference sequence (Bioproject PRJNA15585)
bmapped against the C. parvum IOWA II v1 reference sequence (Bioproject PRJNA15586) using default mapping conditions
Summary of improvements made to the initial genome assemblies by PAGIT
| Isolate | Initial assembly statistics: No. N50 Av. (kb) | Gaps closed by IMAGE | ICORN Sequence corrections: SNP Indel | RATT Gene annotations transferred | Final assembly statistics: No. N50 Av. Size (kb) | Paired reads aligning concordantly (%) | Contaminant sequence NC BCT (kb) |
|---|---|---|---|---|---|---|---|
| UKH3 | 1902 94.7 4.9 | 330 | 210 27 | 3649 | 34 1000 268.7 9136 | 87.5 | 41.1 0.0 |
| UKH4 | 2375 29.1 4.0 | 343 | 584 99 | 3503 | 110 1032 84.4 9293 | 93.5 | 40.1 14.1 |
| UKH5 | 1305 29.9 7.0 | 437 | 189 61 | 3569 | 65 1006 142.4 9257 | 96.2 | 6.9 2.4 |
| UKP2 | 3084 105.2 3.1 | 174 | 182 48 | 3600 | 67 1011 136.7 9164 | 85.1 | 89.6 1.6 |
| UKP3 | 919 57.5 10.0 | 259 | 226 60 | 3546 | 55 1009 167.7 9224 | 90.2 | 2.0 0.0 |
| UKP4 | 1966 49.3 4.8 | 334 | 170 108 | 3553 | 52 891 177.4 9222 | 91.0 | 7.3 0.7 |
| UKP5 | 3830 9.7 2.4 | 1435 | 341 80 | 3527 | 47 1031 198.4 9329 | 88.6 | 31.4 14.6 |
| UKP6 | 16895 40.3 0.8 | 274 | 3144 394 | 3565 | 417 1014 22.7 9471 | 81.2 | 685.7 69.5 |
| UKP7 | 1158 30.2 7.8 | 321 | 185 80 | 3556 | 55 1026 168.7 9278 | 90.4 | 2.5 0.0 |
| UKP8 | 1792 42.0 5.2 | 356 | 152 47 | 3545 | 63 1015 147.4 9290 | 93.5 | 37.9 24.4 |
The assembly statistics (initial and final) include the number of scaffolds, scaffold N50 metric, scaffold mean length (Av.), and the total size of the final assembly. Gene annotations were transferred by RATT out of a total of 3805 gene annotations in the reference assembly. The “Contaminant sequence” column refers to the total length non-concordantly mapping read pairs that have been assembled separately and identified as non-cryptosporidium sequence (NC) and bacterial sequence (BCT)