| Literature DB >> 30166611 |
Claudia Wylezich1, Anna Papa2, Martin Beer3, Dirk Höper4.
Abstract
Metagenomics is currently the only generic method for pathogen detection. Starting from RNA allows the assessment of the whole sample community including RNA viruses. Here we present our modular concerted protocol for sample processing for diagnostic metagenomics analysis of human, animal, and food samples. The workflow does not rely on dedicated amplification steps at any stage in the process and, in contrast to published methods, libraries prepared accordingly will yield only minute amounts of unclassifiable reads. We confirmed the performance of the approach using a spectrum of pathogen/matrix-combinations showing it has the potential to become a commonly usable analytical framework.Entities:
Mesh:
Year: 2018 PMID: 30166611 PMCID: PMC6117295 DOI: 10.1038/s41598-018-31496-1
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Sketch of the basic idea of a “one serves all” analytical framework[20]. Images used as symbols were obtained from the free websites https://pixabay.com/.
Figure 2Overview of the workflow and trials for its improvement. (A) Overview of the main steps of sample processing. The red numbers refer to the steps given in the procedure (Supplementary File 1). (B) Summary of different trials conducted to improve the workflow. NA, nucleic acid. Images used as symbols were obtained from the free websites https://pixabay.com/.
Figure 3Comparison of different disintegration methods. Shown is the RNA quality analysed using RNA 6000 Pico Chip (Bioanalyzer, Agilent). (A) Bacterial suspensions of exponentially growing Bacillus subtilis (B. subtilis – EX) or endospores (B. subtilis – SP), exponentially growing Staphylococcus aureus, or Escherichia coli; (B) samples of swine faeces; (C) pools of midges. Legend: L, ladder; (−), without disintegration step; C, cryoPREP; M, Micro-Dismembrator; T, TissueLyser. The labelling for the ladder (fragment lengths, nucleotides) is given on the left side of each image.
Tested Mycobacterium-containing tissues, the grade of infection as investigated by histology and cultivation (given for high-infected (+++) and low-infected (+) samples) and corresponding Cq values. MAP, Mycobacterium avium paratuberculosis; Cq, quantification cycle; C, cryoPREP; T, TissueLyser.
| Animal | Goat tissue | MAP infection | Cq value for C | Cq value for T |
|---|---|---|---|---|
| 13 | Lymph node | +++ | ||
| 14 | Lymph node | +++ | ||
| 15 | Lymph node | +++ | ||
| 8 | Ileum (Peyer’s patches) | +++ | no Cq | |
| 21 | Lymph node | + | ||
| 23 | Lymph node | + | no Cq | no Cq |
| 3 | Ileum (Peyer’s patches) | + | no Cq | no Cq |
Overview of pre-diagnosed samples containing known verified pathogens.
| Category | Sample | Host Species | Known Verified Pathogen | Pathogen Genome | Library ID | DNA/RNA | Total Number Reads | % Pathogen Reads | % Unclassified Reads | Platform | Accession numbers and reference |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Liquid | Cell-culture supernatant | Mouse | Rabies virus | ssRNA | 1343 | RNA | 60,275 | 17.0 | 0.64 | PGM | PRJEB21530[ |
| Cell-culture supernatant | Mouse | Rabies virus | ssRNA | 1233 | RNA | 145,158 | 84.9 | 0.62 | PGM | PRJEB21530[ | |
| Cell-culture supernatant | Sheep | Betacoronavirus | ssRNA | 2172 | RNA | 652,693 | 57.7 | 0.39 | MiSeq | PRJEB27711, | |
| Cell-culture supernatant | Green monkey | Rotavirus A | dsRNA | 2173 | RNA | 360,375 | 24.4 | 0.42 | MiSeq | PRJEB27711, | |
| Cell-culture supernatant | Green monkey | Arrabida virus | ssRNA | 921 | RNA | 911,414 | 37.7 | 0.046 | MiSeq | KP863799-801[ | |
| Faeces | Faeces | Pig | PEDV | ssRNA | 721 | RNA | 2,224,040 | 0.77 | 1.02 | MiSeq | PRJEB19039[ |
| Faeces | Pig | PEDV | ssRNA | 799 | RNA | 2,670,508 | 10.8 | 2.68 | MiSeq | PRJEB19039[ | |
| Faeces | Pig | PEDV | ssRNA | 1012 | RNA | 1,831,855 | 5.4 | 1.84 | MiSeq | PRJEB19039[ | |
| Faeces | Pig | PEDV | ssRNA | 1060 | RNA | 2,020,926 | 0.28 | 0.79 | MiSeq | PRJEB19039[ | |
| Faeces | Pig | PEDV | ssRNA | 1420 | RNA | 1,282,824 | 19.5 | 0.38 | MiSeq | PRJEB19039[ | |
| Stool (ethanol-fixed) | Human | Giardia | DNA | 2178 | RNA | 232,189 | 0.0009 | 0.33 | PGM | This study | |
| Stool (ethanol-fixed) | Human | Blastocystis | DNA | 2178 | RNA | 232,189 | 0.04 | 0.33 | PGM | This study | |
| Tissue | Brain | Dog | Rabies virus | ssRNA | 417 | RNA | 1,257,233 | 0.52 | 0.39 | MiSeq | LM645022[ |
| Brain | Red fox | Rabies virus | ssRNA | 1188 | RNA | 2,551,046 | 0.0014 | 0.98 | PGM | PRJEB27711 | |
| Brain | Arctic fox | Rabies virus | ssRNA | 985 | RNA | 329,625 | 2.8 | 1.01 | MiSeq | LT598540[ | |
| Brain | Cat | Rabies virus | ssRNA | 325 | RNA | 1,117,539 | 0.087 | 0.068 | MiSeq | LM645046[ | |
| Brain | Sheep | Rabies virus | ssRNA | 300 | RNA | 2,457,633 | 0.16 | 0.059 | MiSeq | LM645044[ | |
| Brain | Arctic fox | Rabies virus | ssRNA | 455 | RNA | 1,507,356 | 0.22 | 1.66 | MiSeq | LM645019[ | |
| Liver | Chicken | Sendai virus | ssRNA | 2019# | RNA | 1,249,386 | 0.017 | 0.53 | PGM | PRJEB27711 | |
| Liver | Chicken | Sendai virus | ssRNA | 2043# | RNA | 2,521,313 | 0.017 | 0.25 | PGM | PRJEB27711 | |
| Lymph node | Goat | MAP | DNA | 2099 | DNA | 2,356,712 | 0.001 | 0.17 | PGM | PRJEB27711 | |
| Intestine | Goat | MAP | DNA | 2100 | DNA | 1,677,552 | 0.0 | 0.17 | PGM | PRJEB27711 | |
| Food | Frozen berries | Strawberries | Norovirus | ssRNA | 1962 | RNA | 933,881 | 0.0 | 0.33 | PGM | This study |
| Muscle | Wild boar | Trichina | DNA | 1806 | RNA | 1,627,079 | 0.17 | 0.38 | PGM | PRJEB27711 | |
| Muscle | Wild boar | Liver fluke | DNA | 1949$ | RNA | 568,673 | 0.0065 | 0.68 | PGM | PRJEB27711 | |
| Muscle | Wild boar | Liver fluke | DNA | 1950$ | RNA | 491,774 | 0.0037 | 0.20 | PGM | PRJEB27711 | |
| Muscle | Wild boar | Liver fluke | DNA | 1951$ | RNA | 555,425 | 0.0074 | 0.25 | PGM | PRJEB27711 |
A graphical representation of read counts for the 100 most abundant families for each of the samples is provided in Supplementary File 2. Some datasets are not publicly available due to the EU General Data Protection Regulation and the Nagoya Protocol but are available from the corresponding authors on reasonable request.
# and $ technical replicates.
Abbreviations: MAP, Mycobacterium avium paratuberculosis; PEDV, porcine epidemic diarrhoea virus; PGM, Ion Torrent Personal Genome Machine; MiSeq, Illumina MiSeq.
Figure 4Proportion of bacterial reads found in datasets generated from selected consumables. The samples were processed with the present workflow starting from RNA or from DNA. The read abundance of bacterial reads (top twelve) is given as the percentage of the total dataset.
Overview of analysed samples of initially unknown pathogen content.
| Category | Sample | Host Species | Library ID | DNA/ RNA | Total Number Reads | % Archaeal Reads | % Bacterial Reads | % Eukaryotic Reads | % Viral Reads | % Unclassified Reads | Confirmed Pathogen | Platform | Accession numbers and reference |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Liquid | Cell-culture supernatant | Penguin | 1003 | DNA | 982,590 | 0.014 | 0.041 | 72.9 | 0.61 | 25.3 | Alphaherpesvirus | PGM | LT608136[ |
| Cell-culture supernatant | Penguin | 1004 | DNA | 1,413,128 | 0.041 | 0.046 | 74.1 | 0.74 | 23.8 | Alphaherpesvirus | PGM | LT608135[ | |
| Serum | Cattle | 126/11 | RNA | 26,749 | 0.022 | 26.7 | 69.8 | 0.42 | 2.7 | Schmallenberg virus | GS FLX | HE649912-14[ | |
| Serum | Cattle | 127/11 | RNA | 15,738 | 0.076 | 30.3 | 65.0 | 0.43 | 3.6 | ND | GS FLX | HE649912-14[ | |
| Serum | Cattle | 128/11 | DNA | 75,124 | 0.0013 | 0.091 | 98.0 | 0.004 | 1.7 | ND | GS FLX | HE649912-14[ | |
| Serum | Cattle | 129/11 | DNA | 83,988 | 0.0024 | 0.014 | 98.5 | ND | 1.3 | ND | GS FLX | HE649912-14[ | |
| Chest-cavity fluid | Squirrel | 651 | RNA + DNA | 37,816 | ND | 0.26 | 78.0 | 0.67 | 20.8 | VSBV-1 | MiSeq | PRJEB27711[ | |
| Oropharyngeal swab | Squirrel | 648 | RNA + DNA | 1,000,000# | 0.0069 | 0.70 | 76.3 | 0.081 | 22.9 | VSBV-1 | MiSeq | PRJEB27711[ | |
| Tap water$ | NA | 2091 | RNA | 50,602 | 0.012 | 51.6 | 44.1 | 0.6 | 3.1 | ND | PGM | This study | |
| Tap water | NA | 2092 | RNA | 73,681 | 0.012 | 71.5 | 26.0 | 0.77 | 1.7 | ND | PGM | This study | |
| Tap water$ | NA | 2093 | DNA | 2,277,162 | 0.12 | 54.9 | 2.2 | 0.12 | 38.5 | ND | PGM | This study | |
| Tap water | NA | 2094 | DNA | 1,864,200 | 0.13 | 58.4 | 2.8 | 0.054 | 35.0 | ND | PGM | This study | |
| Rumen | Cattle | 1005 | DNA | 917,596 | 1.1 | 33.5 | 6.3 | 0.19 | 56.1 | ND | PGM | PRJEB27711 | |
| Faeces | Bird faeces | NA | 2177 | RNA | 2,264,941 | 0.0015 | 51.0 | 36.8 | 0.44 | 10.2 | ND | PGM | This study |
| Tissue | Organ-pool (kidney, liver lung) | Squirrel | 652 | RNA + DNA | 349,819 | 0.0034 | 0.15 | 71.1 | 0.14 | 28.7 | VSBV-1 | MiSeq | PRJEB27711[ |
| Organ-pool (Heart, brain) | Squirrel | 653 | RNA + DNA | 367,299 | 0.0035 | 0.31 | 76.8 | 0.27 | 22.6 | VSBV-1 | MiSeq | PRJEB27711[ | |
| Brain | Cattle | 852 | RNA | 388,206 | 0.0093 | 49.4 | 44.2 | 0.47 | 5.8 | Bovine astrovirus | MiSeq | LN879482[ | |
| Organ-pool (brain, spinal cord, spleen) | Sheep | 1454 | RNA | 971,433 | 0.0001 | 0.005 | 95.8 | 0.13 | 0.66 | Ovine astrovirus | PGM | LT706531[ | |
| Organ-pool (brain, spinal cord, spleen) | Sheep | 1455 | RNA | 993,038 | ND | 0.004 | 95.1 | 0.91 | 0.67 | Ovine astrovirus | PGM | LT706530[ | |
| Vector | Pooled ticks | NA | 1163 | RNA | 192,549 | 0.001 | 8.4 | 89.4 | 0.009 | 0.59 | ND | PGM | This study |
| Pooled ticks | NA | 1164 | RNA | 2,210,546 | 0.0005 | 6.4 | 91.0 | 0.0005 | 0.58 | ND | PGM | This study | |
| Pooled midges | NA | 1081 | RNA | 2,545,182 | 0.047 | 7.1 | 74.5 | 0.18 | 18.2 | ND | MiSeq | This study | |
| Pooled midges | NA | 1082 | RNA | 1,429,726 | 0.013 | 3.1 | 91.4 | 0.023 | 5.5 | ND | MiSeq | This study | |
| Food | Leaf | Rocket | 1497 | RNA | 439,328 | ND | 0.064 | 97.6 | 0.15 | 0.32 | ND | PGM | PRJEB27711 |
| Fruiting body | Mushroom | 1469 | RNA | 2,826,378 | ND | 0.42 | 92.9 | 0.094 | 0.32 | ND | MiSeq | PRJEB27711 | |
| Pizza with mushrooms | NA | 1960 | RNA | 427,509 | 0.0002 | 0.55 | 94.5 | 0.054 | 1.5 | ND | PGM | PRJEB27711 | |
| Crude ham | Pig | 1496 | RNA | 582,692 | ND | 0.16 | 96.7 | 0.063 | 0.73 | ND | PGM | PRJEB27711 | |
| Meat loaf | NA | 1488 | RNA | 360,151 | 0.0006 | 3.6 | 93.3 | 0.11 | 0.76 | ND | PGM | PRJEB27711 |
A graphical representation of read counts for the 100 most abundant families for each of the samples is provided in Supplementary File 2. Some datasets are not publicly available due to the EU General Data Protection Regulation and the Nagoya Protocol but are available from the corresponding authors on reasonable request.
#partial random subset of the original published dataset.
$w/o CryoPrep treatment prior to nucleic acid extraction; the tap water libraries are derived from one sample split into four subsamples.
Abbreviations: VSBV-1 Variegated squirrel bornavirus 1; PGM, Ion Torrent Personal Genome Machine; MiSeq, Illumina MiSeq; GS FLX, Genome Sequencer FLX; ND, none detected.