| Literature DB >> 34687159 |
Na Zhao1, Jiabao Cao1,2, Jiayue Xu1, Beibei Liu3, Bin Liu4, Dingqiang Chen5, Binbin Xia1,2, Liang Chen1, Wenhui Zhang1,2, Yuqing Zhang1,2, Xuan Zhang1, Zhimei Duan4, Kaifei Wang4, Fei Xie4, Kun Xiao4, Wei Yan3, Lixin Xie4, Hongwei Zhou5, Jun Wang1,2.
Abstract
Fast and accurate identification of microbial pathogens is critical for the proper treatment of infections. Traditional culture-based diagnosis in clinics is increasingly supplemented by metagenomic next-generation-sequencing (mNGS). Here, RNA/cDNA-targeted sequencing (meta-transcriptomics using NGS (mtNGS)) is established to reduce the host nucleotide percentage in clinic samples and by combining with Oxford Nanopore Technology (ONT) platforms (meta-transcriptomics using third-generation sequencing, mtTGS) to improve the sequencing time. It shows that mtNGS improves the ratio of microbial reads, facilitates bacterial identification using multiple-strategies, and discovers fungi, viruses, and antibiotic resistance genes, and displaying agreement with clinical findings. Furthermore, longer reads in mtTGS lead to additional improvement in pathogen identification and also accelerate the clinical diagnosis. Additionally, primary tests utilizing direct-RNA sequencing and targeted sequencing of ONT show that ONT displays important potential but must be further developed. This study presents the potential of RNA-targeted pathogen identification in clinical samples, especially when combined with the newest developments in ONT.Entities:
Keywords: Oxford Nanopore Technology; direct RNA sequencing; metagenome; metatranscriptome
Mesh:
Substances:
Year: 2021 PMID: 34687159 PMCID: PMC8655164 DOI: 10.1002/advs.202102593
Source DB: PubMed Journal: Adv Sci (Weinh) ISSN: 2198-3844 Impact factor: 16.806
Summary of the clinical characteristics of the samples used in this study
| Characteristic | Overall | BALF | Blood | CSF |
|---|---|---|---|---|
|
| ||||
| Mean yr | 59.8 | 66.2 | 34.5 | 25.4 |
|
| ||||
| 0–1 yr | 7 (11.3) | 0 | 2 (14.3) | 5 (55.6) |
| 1–20 yr | 8 (12.9) | 3 (7.7) | 5 (35.7) | 0 |
| 21–60 yr | 17 (27.4) | 10 (5.6) | 4 (28.6) | 3 (33.3) |
| ≥60 yr | 30 (48.4) | 26 (66.7) | 3 (21.4) | 1 (11.1) |
|
| ||||
| Male | 50 (80.6) | 33 (84.6) | 11 (78.6) | 6 (66.7) |
| Female | 12 (19.4) | 6 (15.4) | 3 (21.4) | 3 (33.3) |
Figure 1Sequencing efficiency regarding the proportion of microbial reads and correlation in bacterial abundances between mtNGS and mNGS. A total of 62 samples, including 39 BALF samples, 14 blood samples, and nine CSF samples, were available for comparison in mtNGS and mNGS sequencing. A) The proportion of microbial mapping reads in combined samples and each sample type separately in mtNGS and mNGS. Percentages were calculated by dividing by total reads. For BALF and blood samples, an overall increase in microbial read ratios was observed for mtNGS (above the 1:1 line, relative to mNGS samples). B–D) Correlations of shared bacterial species calculated using multiple gene markers across all samples, as well as in each sample type in mtNGS and mNGS. The relative abundance of bacterial was defined as the proportion of marker reads for the respective bacteria relative to all reads and correlations were calculated using log(10) transformed nonzero proportions. R 2 denotes the square of the pearson correlation coefficient calculated using the stat_cor function. Markers were B) the 16S rRNA gene, C) 23S rRNA genes, D) and MetaPhlAn marker genes. The top ten species in terms of relative abundance across samples are indicated by colored dots, while others are indicated in gray. Total: All sample types involved in our study; BALF: bronchoalveolar lavage fluid samples; Blood: blood samples; CSF: cerebrospinal fluid samples.
Figure 2Improved viruses detection in mtNGS in comparison to mNGS. A) The correlation between the relative abundance of shared viruses using all samples and each individual type of sample in mtNGS and mNGS (n = 62 in each sequencing assay). The abundances of viruses were calculated by mapping against the viral genome from the NCBI viral genome database and the mapped number of reads was then divided into total reads and nonzero abundances were retained. R 2 and statistical significant were assessed using stat_cor function. The top ten viruses in terms of relative abundance across samples are indicated by colored dots, while others are indicated in gray. B) Venn diagram illustrating the overlap of the viral composition between mNGS and mtNGS. C) Distribution of relative abundances (log10‐transformed) of major virus groups in mtNGS and mNGS, showing approximately two‐magnitude higher viral reads in the results of mtNGS. Total: All sample types involved in our study; BALF: bronchoalveolar lavage fluid samples; Blood: blood samples; CSF: cerebrospinal fluid samples.
Figure 3Improved fungi and ARGs detection in mtNGS compared with mNGS. A) Venn diagram illustrating the overlap of fungi composition between mNGS and mtNGS, showing that mtNGS discovers many more fungi and covers the majority of mNGS findings. B) The correlation between the nonzero relative abundance of shared fungi in all samples and each individual type of sample (n = 62 in each sequencing assay), calculated by the percentage of reads mapped to 18S rRNA genes divided by the total reads, followed by a log10 transformation. R 2 and statistical significant were assessed using stat_cor function. The top ten fungi in terms of relative abundance across samples are indicated by colored dots, while others are indicated in gray. Total: All sample types involved in our study; BALF: bronchoalveolar lavage fluid samples; Blood: blood samples; CSF: cerebrospinal fluid samples. C) Distribution of high‐occurrence ARGs (61 ARGs appearing in 52–97% of the samples) across all samples and three different sample types, detected with the mtNGS and mNGS methods. ARGs in blue were detected using mNGS only, red using mtNGS only, and green using shared detection in both methods. The top and right‐hand stacked bars represent the number of ARGs and the frequency in each individual type of sample, respectively.
Figure 4mtTGS increased the mapping rate for major markers used for bacteria, fungi, and virus identifications. The percentage of microbial mapping reads (bacteria, fungi, and viruses) was calculated in each individual type of sample (n = 43) for mtTGS (green), mtNGS (red), and mNGS (blue). MtTGS had an overall approximately two‐magnitude higher percentage of microbial reads than mNGS, as well as a higher percentage of 18S/MetaPhlAn‐markers than mtNGS. Significant differences in pairwise comparisons between mtTGS versus mtNGS and between mtTGS versus mtNGS are shown. Statistical significant was assessed with Wilcoxon rank‐sum test. *P < 0.05, **P < 0.01, ***P < 0.001, ****P < 0.0001, ns, not significant; BALF: bronchoalveolar lavage fluid samples; Blood: blood samples; CSF: cerebrospinal fluid samples.
Figure 5Direct RNA sequencing further increased the mapping rate percentage. A) The proportion of microbial mapping reads in each individual type of sample (n = 11) using the mtNGS and ONT direct RNA platforms. B) The percentage of microbial mapping reads (bacteria using 16S, 23S, and MetaPhlAn markers, respectively; fungi using 18S; and viruses using reference genomes) was calculated in each individual type of sample (n = 11) by mtNGS and ONT direct RNA sequencing, followed by log10 transformed. BALF: bronchoalveolar lavage fluid samples; Blood: blood samples; CSF: cerebrospinal fluid samples.