| Literature DB >> 26754142 |
Dingchen Li1,2, Zongwei Li3,4, Zhe Zhou5,6, Zhen Li7,8, Xinyan Qu9,10, Peisong Xu11, Pingkun Zhou12, Xiaochen Bo13,14, Ming Ni15,16.
Abstract
Next-generation sequencing (NGS) enables the recovery of pathogen genomes from clinical samples without the need for culturing. Depletion of host/microbiota components (e.g., ribosomal RNA and poly-A RNA) and whole DNA/cDNA amplification are routine methods to improve recovery results. Using mixtures of human and influenza A virus (H1N1) RNA as a model, we found that background depletion and whole transcriptome amplification introduced biased distributions of read coverage over the H1N1 genome, thereby hampering genome assembly. Influenza serotyping was also affected by pretreatments. We propose that direct sequencing of noncultured samples without pretreatment is a favorable option for pathogen genome recovery applications.Entities:
Mesh:
Year: 2016 PMID: 26754142 PMCID: PMC4710016 DOI: 10.1186/s13062-016-0105-x
Source DB: PubMed Journal: Biol Direct ISSN: 1745-6150 Impact factor: 4.540
Fig. 1Identification, serotyping and genome recovery of influenza A virus based on reads. Expected proportions of H1N1 within mixed RNA samples were 0.55 % (a, and c) and 1.5 % (b, and d). a and b Percentages of reads aligned to influenza A virus among all reads passing quality control. Black dashed lines denote expected proportion of influenza A virus. c and d Influenza A virus-aligned read distribution for influenza A viral RNA segments. Influenza A virus-aligned reads of “BD + WTA (8 h)” are insufficient; thus, their distribution is not shown. e Site depths based on NGS read alignments on eight H1N1 RNA segments are shown as a filled area graph, colored by experimental condition. Segment names and their positions are labeled at top and bottom, respectively. Assembled contig alignments are denoted by thick lines (Velvet assembly) or thin lines (Trinity assembly) up the site depth profiles. f H1N1 genome recovery coverage with increasing numbers of random sampling reads. For a given read number, influenza A virus-aligned reads were randomly sampled 10 times for de novo assembly, and the average coverage values are shown
Genome de novo assembly
| Treatmenta | Trinity assembler | Velvet assembler | IDBA-UD assembler | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Contig total size (bp)b | Genome coverage (%) | H1 + N1 coverage (%) | Contig total size (bp)b | Genome coverage (%) | H1 + N1 coverage (%) | Contig total size (bp)b | Genome coverage (%) | H1 + N1 coverage (%) | |
| BD (0.55 %)a | 12,987 | 95.3 | 98.3 | 12,163 | 89.2 | 89.5 | 13,035 | 95.6 | 95.0 |
| No pretreatment (0.55 %)a | 13,351 | 97.9 | 98.5 | 12,993 | 95.3 | 96.5 | 13,077 | 95.9 | 95.5 |
| BD + 8-h WTA (0.55 %)a | — | — | — | — | — | — | — | — | — |
| 8-h WTA (0.55 %)a | 12,912 | 94.7 | 97.3 | 12,649 | 92.8 | 90.9 | 10,723 | 78.7 | 83.8 |
| BD + 2-h WTA (1.50 %)a | 13,431 | 98.5 | 97.3 | 12,555 | 92.1 | 91.3 | 13,083 | 95.9 | 91.7 |
| No pretreatment (1.50 %)a | 13,461 | 98.7 | 98.5 | 13,064 | 95.8 | 99.6 | 13,240 | 97.1 | 94.4 |
aExpected proportions of H1N1 within mixed RNA samples are indicated in parentheses
bContigs generated by assemblers were aligned to the H1N1 reference genome. Contig total sizes were obtained from aligned contigs. Overlapping contig regions were counted only once