| Literature DB >> 27338614 |
Karin Troell1, Björn Hallström2, Anna-Maria Divne3, Cecilia Alsmark4,5, Romanico Arrighi2, Mikael Huss6, Jessica Beser2, Stefan Bertilsson7.
Abstract
BACKGROUND: Infectious disease involving multiple genetically distinct populations of pathogens is frequently concurrent, but difficult to detect or describe with current routine methodology. Cryptosporidium sp. is a widespread gastrointestinal protozoan of global significance in both animals and humans. It cannot be easily maintained in culture and infections of multiple strains have been reported. To explore the potential use of single cell genomics methodology for revealing genome-level variation in clinical samples from Cryptosporidium-infected hosts, we sorted individual oocysts for subsequent genome amplification and full-genome sequencing.Entities:
Keywords: Apicomplexa; Cryptosporidium; FACS; Multiple infection; Single cell genomics; Whole genome amplification
Mesh:
Year: 2016 PMID: 27338614 PMCID: PMC4917956 DOI: 10.1186/s12864-016-2815-y
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Conceptual graph of single cell genomics workflow applied to Cryptosporidium parvum. a Scattergram of side scatter and antigen-based fluorescence used for sorting individual cells. The gate for sorting is indicated by a circle and individual analyzed cells are marked by color. Panel with antibody stained oocysts to the left. b. Real-time MDA kinetics for individual amplified genomes color coded as in (a). Blanks are given in brown. c Small subunit rRNAscreening of amplified cells with gel electrophoreses and sequencing. Negative control and positive control with 10 cells are included for comparison
Genome properties. Reads were either mapped to the Cryptosporidium parvum IowaII reference genome or assembled de novo using Spades
| Total sequence (Mbp) | Read length | Mapped reads (%) | Fraction of genome covered (%) | Sequencing coverage | De Novo Genome Assembly Statistics | MDA statistics | |||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Size (% of reference) | Longest contig (bp) | NG50 | Ct (h)a | Effic-iencyb | Yield (AU)c | ||||||
| Cell 1 | 920 | 2 × 300 | 99.8 % | 94.7 % | 90.3 ± 157 | 91.8 % | 225 980 | 57 459 | 2.3 | 70 | 0.85 |
| Cell 2 | 1366 | 2 × 300 | 99.9 % | 93.8 % | 127 ± 225 | 90.7 % | 227 055 | 64 805 | 3.0 | 65 | 0.74 |
| Cell 3 | 2013 | 2 × 300 | 99.8 % | 87.8 % | 169 ± 540 | 80.6 % | 147 412 | 20 114 | 3.0 | 123 | 1.69 |
| Cell 4 | 341 | 2 × 300 | 99.6 % | 69.4 % | 30.6 ± 79.4 | 60.0 % | 67 943 | 1 513 | 2.8 | 99 | 0.69 |
| Cell 5 | 1218 | 2 × 300 | 99.8 % | 74.9 % | 105 ± 307 | 66.3 % | 119 204 | 4 309 | 3.3 | 91 | 0.56 |
| Cell 6 | 1460 | 2 × 300 | 99.9 % | 68.4 % | 111 ± 509 | 57.1 % | 73 330 | 1 398 | 2.3 | 97 | 2.17 |
| Cell 7 | 1470 | 2 × 300 | 99.8 % | 78.8 % | 90.3 ± 304 | 67.6 % | 203 151 | 5 364 | 2.8 | 102 | 0.60 |
| Cell 8 | 1460 | 2 × 300 | 99.8 % | 85.5 % | 112 ± 284 | 77.5 % | 117 185 | 11 229 | 2.3 | 192 | 1.52 |
| Cell 9 | 1360 | 2 × 300 | 99.8 % | 94.9 % | 118 ± 194 | 91.6 % | 547 655 | 65 774 | 3.3 | 44 | 0.50 |
| Cell 10 | 1877 | 2 × 300 | 99.8 % | 66.9 % | 115 ± 767 | 49.6 % | 57 954 | 687 | 3.5 | 72 | 0.60 |
| Meta-genome | 6273 | 2 × 250 | 84.2 % | 97.8 % | 535 ± 829 | 94.4 % | 95 102 | 7 076 | N.A. | N.A. | N.A. |
aThe number of hours to reach above the background fluorescence threshold
bThe exponential amplification of DNA in the exponential amplification phase where a value of 100 % represents a doubling in DNA concentration every 15 min
cThe total DNA fluorescence (arbitrary fluorescence units) at the end of the 16 h MD
Fig. 2Coverage statistics for single cell genomes of Cryptosporidium parvum. a Color coded heat map for the eight chromosomes of individual cells where red is < 1 × coverage (sequence missing), orange is 1–5 × coverage, yellow is 5–20 × coverage and green is >20 × coverage. Positions with ambiguity characters in the reference genome are colored gray. The combined data for all 10 single cell genomes (all cells combined), the full parent metagenome and the same metagenome subsampled to equal sequencing depth as an average single cell genome (1.3 Gbp) are included for reference. b Density profiles showing more detailed view of the sequencing depth for chromosome 3 with the ten single cell genomes given in color according to (a). “All cells combined” are shown with gray shading. Similar plots for all chromosomes can be found in Additional file 4. The bold black line represents the metagenome
Fig. 3Fractions of genome shared among single amplified genomes (0 to 10 cells) from Cryptosporidium sp. The “fraction of genome accounted for by N cells” at 0 “number of cells above threshold” represents the portion of the reference genome not accounted for in any of the cells at the indicated coverage. In contrast, the portion of the genome accounted for in all cells is found at 10 cells on the X-axis. Observed shared fraction of the genome is given at coverage cutoffs of 1, 5 and 20 ×
Genomic variation relative the Cryptosporidium parvum IowaII reference genome or the Uppsala1499 metagenome
| SNPs | Single cellsavs. IowaII | Single cellsa vs. Uppsala1499 | Uppsala1499 vs. IowaII |
|---|---|---|---|
| Total | 3350 | 210 | 3874 |
| C > T | 545 | 91 | 622 |
| G > A | 583 | 105 | 669 |
| A > G | 584 | 1 | 679 |
| T > C | 559 | 5 | 667 |
| T > A | 174 | 1 | 204 |
| A > T | 172 | 2 | 197 |
| C > A | 151 | 3 | 153 |
| G > T | 198 | 0 | 227 |
| G > C | 47 | 1 | 58 |
| C > G | 57 | 1 | 67 |
| T > G | 142 | 0 | 170 |
| A > C | 138 | 0 | 161 |
aStatistics refer to the 10 single cell genomes combined
Effect of detected mutations
| Comparison | All vs. IowaII | Single cells vs. Uppsala1499 |
|---|---|---|
| Downstream | 28 (0.8 %) | 2 (1.0 %) |
| Upstream | 1066 (31.8 %) | 38 (18.1 %) |
| Synonymous | 1063 (31.7 %) | 40 (19.0 %) |
| Missense | 1180 (35.2 %) | 118 (56.2 %) |
| Stop gained | 3 (0.1 %) | 12 (5.7 %) |
| Start lost | 1 (0.0 %) | 0 (0.0 %) |