Literature DB >> 33219095

Metagenomic Next-Generation Sequencing of Nasopharyngeal Specimens Collected from Confirmed and Suspect COVID-19 Patients.

Heba H Mostafa1, John A Fissel1, Brian Fanelli2, Yehudit Bergman1, Victoria Gniazdowski1, Manoj Dadlani2, Karen C Carroll1, Rita R Colwell2,3, Patricia J Simner4.   

Abstract

Metagenomic next-generation sequencing (mNGS) offers an agnostic approach for emerging pathogen detection directly from clinical specimens. In contrast to targeted methods, mNGS also provides valuable information on the composition of the microbiome and might uncover coinfections that may associate with disease progression and impact prognosis. To evaluate the use of mNGS for detecting severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and/or other infecting pathogens, we applied direct Oxford Nanopore long-read third-generation metatranscriptomic and metagenomic sequencing. Nasopharyngeal (NP) swab specimens from 50 patients under investigation for CoV disease 2019 (COVID-19) were sequenced, and the data were analyzed by the CosmosID bioinformatics platform. Further, we characterized coinfections and the microbiome associated with a four-point severity index. SARS-CoV-2 was identified in 77.5% (31/40) of samples positive by RT-PCR, correlating with lower cycle threshold (Ct) values and fewer days from symptom onset. At the time of sampling, possible bacterial or viral coinfections were detected in 12.5% of SARS-CoV-2-positive specimens. A decrease in microbial diversity was observed among COVID-19-confirmed patients (Shannon diversity index, P = 0.0082; Chao richness estimate, P = 0.0097; Simpson diversity index, P = 0.018), and differences in microbial communities were linked to disease severity (P = 0.022). Furthermore, statistically significant shifts in the microbiome were identified among SARS-CoV-2-positive and -negative patients, in the latter of whom a higher abundance of Propionibacteriaceae (P = 0.028) and a reduction in the abundance of Corynebacterium accolens (P = 0.025) were observed. Our study corroborates the growing evidence that increased SARS-CoV-2 RNA detection from NP swabs is associated with the early stages rather than the severity of COVID-19. Further, we demonstrate that SARS-CoV-2 causes a significant change in the respiratory microbiome. This work illustrates the utility of mNGS for the detection of SARS-CoV-2, for diagnosing coinfections without viral target enrichment or amplification, and for the analysis of the respiratory microbiome.IMPORTANCE SARS-CoV-2 has presented a rapidly accelerating global public health crisis. The ability to detect and analyze viral RNA from minimally invasive patient specimens is critical to the public health response. Metagenomic next-generation sequencing (mNGS) offers an opportunity to detect SARS-CoV-2 from nasopharyngeal (NP) swabs. This approach also provides information on the composition of the respiratory microbiome and its relationship to coinfections or the presence of other organisms that may impact SARS-CoV-2 disease progression and prognosis. Here, using direct Oxford Nanopore long-read third-generation metatranscriptomic and metagenomic sequencing of NP swab specimens from 50 patients under investigation for COVID-19, we detected SARS-CoV-2 sequences by applying the CosmosID bioinformatics platform. Further, we characterized coinfections and detected a decrease in the diversity of the microbiomes in these patients. Statistically significant shifts in the microbiome were identified among COVID-19-positive and -negative patients, in the latter of whom a higher abundance of Propionibacteriaceae and a reduction in the abundance of Corynebacterium accolens were observed. Our study also corroborates the growing evidence that increased SARS-CoV-2 RNA detection from NP swabs is associated with the early stages of disease rather than with severity of disease. This work illustrates the utility of mNGS for the detection and analysis of SARS-CoV-2 from NP swabs without viral target enrichment or amplification and for the analysis of the respiratory microbiome.
Copyright © 2020 Mostafa et al.

Entities:  

Keywords:  COVID-19; SARS-CoV-2; metagenomic next-generation sequencing; metagenomics; nasopharyngeal

Mesh:

Substances:

Year:  2020        PMID: 33219095      PMCID: PMC7686804          DOI: 10.1128/mBio.01969-20

Source DB:  PubMed          Journal:  mBio            Impact factor:   7.867


INTRODUCTION

Since the emergence of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) in December 2019, cases of coronavirus disease 2019 (COVID-19) have rapidly increased around the world. Research in this area has aggressively expanded, but to date, there have been few metagenomic next-generation sequencing (mNGS) studies of samples from COVID-19 patients. The first study identified SARS-CoV-2 via mNGS of RNA extracted from bronchoalveolar lavage (BAL) samples collected from two patients using an Illumina MiSeq platform (1). Within 6 days, the group was able to identify a novel coronavirus and report a complete genome, demonstrating the utility of mNGS in the early stages of novel pathogen discovery. Other approaches have included random primer metagenomic sequencing (sequence-independent single primer amplification [SISPA]) or metagenomic sequencing with spiked primer enrichment (MSSPE) to identify SARS-CoV-2 using mNGS methods with a limited sample size (2, 3). Most studies have applied an amplicon-based approach to detect and sequence SARS-CoV-2 directly from specimens (1, 2, 4–7). Perhaps one of the greatest advantages of mNGS is the ability to obtain a snapshot of the patient’s microbiome at a given sampling site to detect coinfections and determine other organisms that may impact patient outcomes. Understanding coinfection is important as it may lead to exacerbation of COVID-19, as was observed with Middle East respiratory syndrome (MERS), and may provide insight into the management of these patients (8). There have been limited reports of coinfection in COVID-19 patients, and thus far, the results have varied, perhaps due to limitations of the different methods used to detect coinfection, poor recovery or detection by standard-of-care methods due to broad-spectrum empirical coverage, lack of testing to understand coinfections, and the diverse geographic regions in which studies were conducted (9–11). Studies from China, the United States, and Europe have found various levels of coinfection within regions as well. These studies report wide ranges of coinfection, from as low as 2% to as high as 80% (12–18). While real-time reverse transcription-PCR (RT-PCR) detection of respiratory pathogens was the most common method used to detect coinfection, there is still a great deal of heterogeneity in the detection methods between these studies. The use of mNGS would not suffer from the same limitations as previous methods and may potentially reveal coinfections that may be missed by the targeted detection methods. In addition to evidence of true pathogens, there is growing evidence that the microbiome of the respiratory tract can have an impact on the health of patients, but the majority of this evidence focuses on interactions among the bacteria (19). There is growing evidence that it may be possible to predict which patients with respiratory tract infections are more likely to experience more serious disease by analyzing the microbiome (19, 20). There is also evidence that the bacterial burden and composition of the lung microbiome may impact the likelihood that mechanically ventilated critically ill patients will develop acute respiratory distress syndrome (ARDS) (21, 22). Metagenomic analysis of COVID-19 patients provides an opportunity to evaluate whether the microbiome plays a beneficial or deleterious role in patient outcomes (23).

RESULTS

Detection of SARS-CoV-2 by mNGS.

First, we demonstrated that SARS-CoV-2 could be detected by direct long-read third-generation metatranscriptomic sequencing from nasopharyngeal (NP) swab specimens of COVID-19 patients collected between 14 and 31 March 2020. SARS-CoV-2 was identified in 31/40 (77.5%) samples that were positive for SARS-CoV-2 by the diagnostic RT-PCR using the online CosmosID bioinformatics program (Table 1). Time to detection of SARS-CoV-2 reads ranged from 1 min (cycle threshold [Ct], 16.0) to 15 h (Ct, 33.4) after the start of the sequencing run, which correlated with the RT-PCR Ct values (Table 1). In the 8 samples where SARS-CoV-2 was unable to be identified, the Ct values ranged from 21.0 to 36.6, with mean and median values of 29.1 and 29.0, respectively (Table 1). We considered that perhaps an abundance of host reads could have masked the SARS-CoV-2 reads in these samples, but there was no relationship between the number of SARS-CoV-2 reads that were detected and human reads detected (Fig. 1a). Lower Ct values were associated with increased sequencing coverage of the SARS-CoV-2 reference genome, a decreased number of total sequencing reads compared to the first SARS-CoV-2 read detection, a greater proportion of SARS-CoV-2 total matches by CosmosID, and a decreasing number of days from the onset of symptoms (Fig. 1b to i). We observed that the most severe cases were spread out along the range of Ct values (Fig. 1d to f). No SARS-CoV-2 reads were identified aligning in any of the 10 samples obtained from patients that were suspected of having SARS-CoV-2 infection but were negative by RT-PCR. A summary of sequencing reads and taxonomic classification of sequencing reads are provided in Tables S1 and S2 in the supplemental material, respectively.
TABLE 1

Suspect COVID-19 sample characteristics and sequencing results

Patient IDSARS-CoV-2 RT-PCR Ct valueSeverity indexa No. of days from onsetb SARS-CoV-2 avg coverage depth (SD)SARS-CoV-2 genome coverage% of total CosmosID SARS-CoV-2 matchesSARS-CoV-2 read length, min–max (mean) [median]Time to SARS-CoV-2 detection (h:min)Putative pathogen causing coinfection (% genome coverage)c
COVID-19+d
    116.04119.21 (48.40)99.166.8279–14,387 (1,614.51) [1,260]0:01
    219.3211.83 (3.23)51.224.9462–9,233 (2,570.95) [1,316]2:40Moraxella catarrhalis (6.7)
    318.6430.57 (1.27)23.88.0504–3,357 (1,611.6) [1,376]5:30
    429.0450 (0)0.0No ID
    518.44331.15 (88.21)97.884.0229–9,676 (1,731.46) [1,409]0:02
    619.6431.80 (3.89)55.723.6233–4,381 (1,403) [1,161]0:35
    718.043.83 (8.19)78.239.9190–5,306 (1,839) [1,647]0:37
    833.41140.01 (0.07)0.50.2314–314 (314) [314]15:12
    915.634100.26 (237.48)100.096.982–14,560 (1,753) [1,466]0:00
    1020.9320.07 (0.42)2.40.7623–1,347 (988) [988]1:36
    1133.64120 (0)0.0No ID
    1220.341.20 (1.85)41.919.5475–8,505 (2,170.05) [1,611]0:37
    133100 (0)0.0No ID
    1421.040 (0)0.0No ID
    1521.140.53 (1.19)21.511.1576–6,104 (2,245.4) [1,778]1:26
    1625.14<70.12 (0.47)5.52.22,798–2,967 (2,882.50) [2,883]1:01
    1725.11<70.02 (0.17)1.1No ID749–749 (749) [749]2:28
    1820.6451.64 (2.72)54.527.4417–6,821 (1,874.82) [1,463]0:11
    1930.9470 (0)0.0No ID
    2036.640 (0)0.0No ID
    2117.1431.28 (2.11)43.720.3349–3,890 (1,505.74) [1,202]0:12Human alphaherpesvirus 1 (56.8)
    2218.0420.81 (1.13)45.016.4420–5,331 (1,889) [1,539]1:13
    2317.9410.51 (1.66)17.37.8505–4,245 (1,489.93) [875]0:07
    2424.2390.67 (1.68)19.17.3990–14,306 (3,240.86) [1,247]1:56
    2513.94114.14 (23.25)99.985.5218–16,599 (1,611.98) [1,068]0:02
    2617.445.74 (6.66)94.960.2312–9,804 (2,111.98) [1,781]0:02
    2717.39.15 (18.71)92.360.5290–20,914 (1943.81) [1,477]0:01
    2818.1426.76 (18.51)69.829.5318–5,930 (1,838.85) [1,627]0:08
    2920.640.03 (0.17)2.90.81,017–1,017 (1,017) [1,017]10:25Haemophilus influenzae (23.3)
    3014.211.73 (21.78)99.068.1307–10,590 (2,131.49) [1,793]0:11
    3120.1151.92 (5.10)34.517.2395–6,014 (1,816.36) [1,534]0:02
    3214.34729.77 (61.23)99.992.5299–17,295 (1,866.7) [1,551]0:08
    3325.8470.05 (0.21)4.41.13,034–3,034 (3,034) [3,034]12:42Haemophilus influenzae (5.2)
    3426.60.16 (0.44)12.03.9725–3,091 (2,216.75) [2,526]0:51
    3526.8350.02 (0.14)1.90.6663–663 (663) [663]4:01
    3622.41.19 (2.50)34.115.9822–5,207 (2034.61) [1,572]1:35
    3720.44140.28 (0.67)18.57.7796–2,696 (1,676.67) [1,567]6:33
    3818.0324.37 (7.29)70.542.3307–17,619 (1,835.43) [1,241]0:21
    3919.71.93 (4.25)41.020.5341–6,919 (1,828.77) [1,526]0:05
    4027.44<70 (0)0.0No IDHuman metapneumovirus (99.3)
COVID-19–e
    41NA40 (0)0No ID
    42NA430 (0)0No ID
    43NA0 (0)0No ID
    44NA180 (0)0No ID
    45NA440 (0)0No ID
    46NA410 (0)0No ID
    47NA460 (0)0No ID
    48NA430 (0)0No IDMoraxella catarrhalis (5.6)
    49NA440 (0)0No ID
    50NA40 (0)0No ID

The severity index was defined on a scale of 1 to 4, as follows: 4, not admitted; 3, admitted; 2, intensive care unit; and 1, required ventilator.

Days from onset is defined as the number of days from the initial onset of symptoms until the time of specimen collection.

Putative pathogens causing coinfections are defined as known pathogens, with bacteria having above 50% relative abundance and not considered microbiota.

Suspected COVID-19 specimens that were found to be positive for SARS-CoV-2 by RT-PCR.

Suspected COVID-19 specimens that were found to negative for SARS-CoV-2 by RT-PCR.

Ct, cycle threshold; SD, standard deviation; –, unknown information; No ID, samples that were not identified by CosmosID; NA, not applicable.

FIG 1

Relationships of metatranscriptomic sequencing sample characteristics to sequencing results. (a) Relationship between the proportion of reads mapped to SARS-CoV-2 and the number of reads matched to human sequences. (b to i) Plots of sequencing results against the Ct values of metatranscriptomic sequencing samples. Panels a to f are color coded by severity index values, and panels g to i are color coded by days from onset of symptoms. The severity index was defined on a scale of 1 to 4, as follows: 4, not admitted; 3, admitted; 2, intensive care unit; and 1, required ventilator. Black dots represent samples with unknown onset or severity index values. (b) Relationship of Ct values determined by LDT-RT-PCR to the number of days from symptom onset. (c) Relationship between Ct values and the proportion of total matches of SARS-CoV-2 by CosmosID. (d, g) Relationship between Ct values and the sequencing coverage across the SARS-CoV-2 strain Hu-1 genome. (e, h) Relationship between Ct values and the number of reads analyzed by CosmosID until the first SARS-CoV-2 read was detected. (f, i) Relationship of Ct values to the proportion of SARS-CoV-2 reads present in the sample. Simple linear regression analysis was performed for each set, and the null hypothesis is rejected for panels b to i (P < 0.01), and for each set, R2 is reported.

Suspect COVID-19 sample characteristics and sequencing results The severity index was defined on a scale of 1 to 4, as follows: 4, not admitted; 3, admitted; 2, intensive care unit; and 1, required ventilator. Days from onset is defined as the number of days from the initial onset of symptoms until the time of specimen collection. Putative pathogens causing coinfections are defined as known pathogens, with bacteria having above 50% relative abundance and not considered microbiota. Suspected COVID-19 specimens that were found to be positive for SARS-CoV-2 by RT-PCR. Suspected COVID-19 specimens that were found to negative for SARS-CoV-2 by RT-PCR. Ct, cycle threshold; SD, standard deviation; –, unknown information; No ID, samples that were not identified by CosmosID; NA, not applicable. Relationships of metatranscriptomic sequencing sample characteristics to sequencing results. (a) Relationship between the proportion of reads mapped to SARS-CoV-2 and the number of reads matched to human sequences. (b to i) Plots of sequencing results against the Ct values of metatranscriptomic sequencing samples. Panels a to f are color coded by severity index values, and panels g to i are color coded by days from onset of symptoms. The severity index was defined on a scale of 1 to 4, as follows: 4, not admitted; 3, admitted; 2, intensive care unit; and 1, required ventilator. Black dots represent samples with unknown onset or severity index values. (b) Relationship of Ct values determined by LDT-RT-PCR to the number of days from symptom onset. (c) Relationship between Ct values and the proportion of total matches of SARS-CoV-2 by CosmosID. (d, g) Relationship between Ct values and the sequencing coverage across the SARS-CoV-2 strain Hu-1 genome. (e, h) Relationship between Ct values and the number of reads analyzed by CosmosID until the first SARS-CoV-2 read was detected. (f, i) Relationship of Ct values to the proportion of SARS-CoV-2 reads present in the sample. Simple linear regression analysis was performed for each set, and the null hypothesis is rejected for panels b to i (P < 0.01), and for each set, R2 is reported. Summary of sequencing reads. Download Table S1, XLSX file, 0.03 MB. Taxonomic classification matrices for metagenomic and metatranscriptomic data. Download Table S2, XLSX file, 0.3 MB.

Identifying coinfections.

Next, we examined the samples for possible coinfections (Table 1). Of the 40 COVID-19-positive samples, 5 (12.5%) revealed organisms of clinical relevance detected in high abundance (>50% relative to normal levels in microbiota). These included Haemophilus influenzae (n = 2; 5%), Moraxella catarrhalis (n = 1; 2.5%), human metapneumovirus (hMPV) (n = 1; 2.5%), and human alphaherpesvirus 1 (n = 1; 2.5%) (Table 1). In our COVID-19-negative samples, Moraxella catarrhalis (n = 1; 10%) was identified. Unfortunately, standard-of-care testing was not performed to detect these pathogens. No fungal or protist coinfections were detected.

Evaluating the respiratory microbiome.

Beyond determining coinfections, we analyzed the metagenomic profiles of these patients in order to uncover potential shifts in the microbiome that could impact patient outcomes. Specimens obtained from SARS-CoV-2-positive patients did have a significant reduction in the diversity of their bacterial communities at the species level as measured by the Shannon diversity index (P = 0.0082), Chao richness estimate (P = 0.0097) (Fig. 2a and b), and Simpson diversity index (P = 0.018). We did not see significant differences at the genus and family levels. Given that we did see a decrease in diversity in the positive samples, we were interested in determining if there was decreasing diversity at lower Ct values. However, among these samples, we did not observe any relationship between Ct values and diversity (data not shown) using the same analyses described above. Further, we also assessed whether there was a relationship with days from symptom onset and found that there was no difference using these analyses (Fig. 2c).
FIG 2

Bacterial diversity analysis of metagenomic sequencing results. (a, b, c) Alpha diversity analysis of metagenomic sequencing results. SARS-CoV-2 positivity was determined by LDT-RT-PCR. (a, b) Shannon diversity plot and Chao diversity plot of SARS-CoV-2-negative and -positive samples at the species level. (c) Shannon diversity plot of SARS-CoV-2-negative samples and SARS-CoV-2-positive samples at different periods post-onset of symptoms. (d, e, f) Beta diversity principal-coordinate analysis of metagenomic sequencing results at the species level. (d) Bray-Curtis analysis of bacterial community composition diversity between SARS-CoV-2-negative and SARS-CoV-2-positive samples. (e) Bray-Curtis analysis of bacterial community composition diversity grouped by PCR and sequencing positivity. (f) Bray-Curtis analysis of bacterial diversity in different disease severity groups. (a, b) Wilcoxon rank sum tests were performed between positive and negative SARS-CoV-2 groups for the Shannon diversity index (P = 0.0097) and Chao diversity (P = 0.0082). (c) Wilcoxon rank sum tests were performed between disease onset groups (no significance). (d, f) PERMANOVA tests were performed on Bray-Curtis distance matrices for SARS-CoV-2-positive and -negative groups (P = 0.027) (d) and groups of disease severity (P = 0.022) (f). (e) Pairwise PERMANOVA tests were performed on Bray-Curtis distance matrices between groups defined by SARS-CoV-2 positivity by RT-PCR and sequencing. RT-PCR+/sequencing+ (PCR+/Seq+) versus RT-PCR–/sequencing– (PCR–/Seq–) (P = 0.007); PC1 and -2, principal components 1 and 2. The severity index was defined on a scale of 1 to 4, as follows: 4, not admitted; 3, admitted; 2, intensive care unit; and 1, required ventilator.

Bacterial diversity analysis of metagenomic sequencing results. (a, b, c) Alpha diversity analysis of metagenomic sequencing results. SARS-CoV-2 positivity was determined by LDT-RT-PCR. (a, b) Shannon diversity plot and Chao diversity plot of SARS-CoV-2-negative and -positive samples at the species level. (c) Shannon diversity plot of SARS-CoV-2-negative samples and SARS-CoV-2-positive samples at different periods post-onset of symptoms. (d, e, f) Beta diversity principal-coordinate analysis of metagenomic sequencing results at the species level. (d) Bray-Curtis analysis of bacterial community composition diversity between SARS-CoV-2-negative and SARS-CoV-2-positive samples. (e) Bray-Curtis analysis of bacterial community composition diversity grouped by PCR and sequencing positivity. (f) Bray-Curtis analysis of bacterial diversity in different disease severity groups. (a, b) Wilcoxon rank sum tests were performed between positive and negative SARS-CoV-2 groups for the Shannon diversity index (P = 0.0097) and Chao diversity (P = 0.0082). (c) Wilcoxon rank sum tests were performed between disease onset groups (no significance). (d, f) PERMANOVA tests were performed on Bray-Curtis distance matrices for SARS-CoV-2-positive and -negative groups (P = 0.027) (d) and groups of disease severity (P = 0.022) (f). (e) Pairwise PERMANOVA tests were performed on Bray-Curtis distance matrices between groups defined by SARS-CoV-2 positivity by RT-PCR and sequencing. RT-PCR+/sequencing+ (PCR+/Seq+) versus RT-PCR–/sequencing– (PCR–/Seq–) (P = 0.007); PC1 and -2, principal components 1 and 2. The severity index was defined on a scale of 1 to 4, as follows: 4, not admitted; 3, admitted; 2, intensive care unit; and 1, required ventilator. In order to further compare the microbial community compositions for these patients at the family, genus, and species levels, we utilized Bray-Curtis principal-coordinate analysis (PCoA). When comparing the communities in SARS-CoV-2-positive and SARS-CoV-2-negative patients, we saw a significant difference (P = 0.027) between the two groups at the species level (Fig. 2d) but not at the genus and family levels. We also analyzed whether there was a difference between samples grouped by RT-PCR and sequencing positivity (Fig. 2e). There was no difference between RT-PCR-positive/sequencing-negative (RT-PCR+/sequencing–) samples and RT-PCR+/sequencing+ samples (P = 0.372) or between RT-PCR+/sequencing– samples and RT-PCR–/sequencing– samples (P = 0.064) at the family, genus, or species level. There was a significant difference between the RT-PCR+/sequencing+ samples and RT-PCR–/sequencing– samples (P = 0.007) at the species level. In addition, we also observed a difference at the species level when we compared patients’ samples grouped by severity index (P = 0.022) (Fig. 2f). As was the case in the previous PCoA analysis, there was no significant difference at the genus and family levels. Finally, we visualized the microbial community composition at the species level by comparing the overall relative abundances of species detected in SARS-CoV-2-positive and SARS-CoV-2-negative samples. The family Propionibacteriaceae revealed the greatest difference in abundance between these two groups, with Propionibacteriaceae proportionately more abundant in SARS-CoV-2 patients by ca. 30%, representing the most abundant organism group detected in these samples (P = 0.028) (Fig. 3a). Additionally, there was a significant decrease in the incidence of Corynebacterium accolens in COVID-19-positive patients (P = 0.025) (Fig. 3a). With respect to individual samples, we observed that, in many cases, a single organism comprised the vast majority of sequencing reads detected among the SARS-CoV-2-positive patients (Fig. 3b).
FIG 3

Changes in bacterial microbiome composition in COVID-19 patients. (a, b) Relative abundances of bacteria at the species level in COVID-19-positive and COVID-19-negative samples. (a) Overall relative abundances of bacterial species in COVID-19-positive and COVID-19-negative patients. (b) Relative abundances of bacterial species in individual COVID-19-positive and COVID-19-negative patients, with samples ordered by Ct value determined by LDT-RT-PCR. A Wilcoxon rank sum test was performed on overall relative abundance values for Propionibacteriaceae (*, P = 0.028) and for Corynebacterium accolens (*, P = 0.025).

Changes in bacterial microbiome composition in COVID-19 patients. (a, b) Relative abundances of bacteria at the species level in COVID-19-positive and COVID-19-negative samples. (a) Overall relative abundances of bacterial species in COVID-19-positive and COVID-19-negative patients. (b) Relative abundances of bacterial species in individual COVID-19-positive and COVID-19-negative patients, with samples ordered by Ct value determined by LDT-RT-PCR. A Wilcoxon rank sum test was performed on overall relative abundance values for Propionibacteriaceae (*, P = 0.028) and for Corynebacterium accolens (*, P = 0.025).

DISCUSSION

In this study, we demonstrated that it is possible to detect SARS-CoV-2 in NP swab specimens by direct long-read metatranscriptomic sequencing. Due to the relatively low cycle threshold (Ct) values (i.e., high viral burdens) observed in these samples, it was possible to obtain sufficient sequencing reads without amplification or enrichment of viral targets during preparation of the cDNA library. These steps are often necessary to obtain sufficient sequencing depth or to overcome background reads, particularly in low-titer samples (24–26). However, eliminating the amplification or enrichment step prior to sequencing introduces less bias in sequences that are ultimately generated by this method. Further, forgoing PCR amplification preserves the long sequencing reads that maximize the utility of this platform. Our study also corroborates growing evidence that the viral load in the nasopharynx is highest early in the disease course and wanes as disease progresses (27, 28). From our metagenomic analysis, we were able to observe a reduction in the microbial diversity of SARS-CoV-2-positive patients, which may be driven in part by several samples overwhelmingly dominated by a single species. Interestingly, single-species predominance was observed in samples throughout the range of Ct values. Using PCoA analysis, we also detected a significant difference in the microbial communities when grouped by severity index. However, it should be noted that there were larger numbers of samples in the less severe groups. A recent study of critically ill patients found that an increased abundance of Enterobacterales in respiratory samples increased the risk of developing ARDS (21). This observation suggests that gut-colonizing bacteria may have a deleterious impact on outcomes when present in the respiratory tract. However, in our study, we did not observe increased gut-colonizing organisms. Propionibacteriaceae were more abundant in our SARS-CoV-2-positive patients. However, Propionibacterium and Cutibacterium (formerly Propionibacterium) species may represent sampling contaminants since they are a component of normal skin microbiota (29–31). Previous reports have associated Propionibacteriaceae with the respiratory tract in cystic fibrosis patients, but thus far, no linkage with disease severity has been established (29, 32). We also observed a significant reduction in the relative abundance of Corynebacterium accolens in our COVID-19 patients. This organism is considered a commensal organism, and there is evidence that it has a negative association with colonization by Streptococcus pneumoniae (30, 31, 33). Further studies would be required to conclude the role of these associations in patients with COVID-19. At the time of sampling, we did observe several clinically relevant organisms that may be potential causes of coinfection. Haemophilus influenzae (n = 2) and Moraxella catarrhalis (n = 1) were identified in COVID-19-positive samples with high relative abundances. These organisms frequently colonize the respiratory tract preceding infection, have been associated with exacerbations of chronic bronchitis and pneumonia (34, 35), and have previously been detected in COVID-19 patients (11, 36). Interestingly, there is also evidence that viral infections of the respiratory tract can increase the ability of H. influenzae to establish infection (34, 37, 38). We identified the viral pathogen human metapneumovirus (hMPV) in a COVID-19-positive sample. Although typically associated with the common cold, this virus has also been associated with exacerbations of pulmonary disease, particularly in the very young and the elderly (34, 39). In addition, we identified human alphaherpesvirus 1 (HSV1) in one patient, which likely represents reactivation in the setting of COVID-19. In our COVID-19-positive samples, we observed that the SARS-CoV-2 reads were overwhelmingly abundant and that this virus frequently was the only virus identified in the metatranscriptomic sequencing results. There are limitations to our study. First is the inability to draw larger conclusions regarding the composition of the microbiome and associations with COVID-19, including its impact on the severity of disease due to the relatively low number of samples, particularly from the more severe categories of the severity index. Having a randomized sample selection approach was beneficial to demonstrate that it is possible to detect SARS-CoV-2 from samples at various Ct values seen in our patient population. However, this ultimately led to relatively few patients with severe disease being included in our study. More broadly, a larger sample size would be necessary for future studies in order draw more definitive conclusions about microbial diversity in COVID-19 patients. In order to assess the utility of direct long-read metatranscriptomic sequencing as a potential sentinel method for assessing the continued sensitivity of RT-PCR assays, future studies should also include greater numbers of samples from suspected COVID-19 patients that are RT-PCR negative. Our study did not detect SARS-CoV-2 in any of the 10 samples from patients suspected of having COVID-19 that were negative by RT-PCR despite both our study groups having no difference in mean number of days from symptom onset, which suggests that this observation was not due to the reduced clinical sensitivity reported for NP swabs from later stages of disease. Single-nucleotide polymorphism analysis to study genomic diversity for the majority of our samples was not pursued due to insufficient depth or incomplete viral genome coverage. In summary, this study serves as a proof of concept that it is possible to detect SARS-CoV-2 and characterize simultaneous coinfections and the respiratory microbiome of patients under investigation for COVID-19 using direct long-read metatranscriptomic and metagenomic sequencing.

MATERIALS AND METHODS

Ethics statement.

This study was approved by the Johns Hopkins University institutional review board, with a waiver of informed consent.

Sample collection and severity index assignment.

In this study, 50 randomly selected nasopharyngeal (NP) swab specimens were obtained from 40 COVID-19 patients and 10 patients suspected of having COVID-19 but negative by a diagnostic laboratory-developed test (LDT), RT-PCR (U.S. Food and Drug Administration Emergency Use Authorization pending), on gene targets E and S (Table 2) (40). Remnant NP swabs were collected for the study at the completion of standard-of-care testing prior to disposal. We assigned a four-point severity index value to the samples based on the following criteria: 4, the patient was not admitted to the hospital; 3, the patient was admitted to the hospital but not the intensive care unit (ICU); 2, the patient was admitted to the ICU; and 1, the patient was placed on a ventilator.
TABLE 2

Patient demographics and clinical characteristics

Demographic information or clinical characteristicValue(s)
Total no. of patients50
    SARS-CoV-2 positive40
    SARS-CoV-2 negative10
 
Median age in yr (IQR)d [range]50.5 (36–63) [18–78]
 
No. (%) of sex:
    Male26 (52)
    Female24 (48)
 
No. (%) of racea :
    Total38
    African American17 (44.7)
    Asian2 (5.3)
    White19 (50)
 
No. (%) with the following comorbidities:
    Total19 (38)
    Cardiovascular disease
    Hypertension14 (28)
    Metabolic disease
    Obesity2 (4)
    Diabetes mellitus3 (6)
 
No. (%) of days from onset of symptomsb
    1–29 (18)
    3–719 (38)
    >77 (14)
    Unknown15 (30)
 
No. (%) of patients with severity indexc :
    14 (8)
    21 (2)
    36 (12)
    433 (66)
    Unknown6 (12)

Self-reported in prespecified fixed categories.

Number of days from onset of symptoms until specimen collection.

The severity index was defined by a 4-point scale: 4, not admitted to hospital; 3, admitted to hospital; 2, admitted to ICU; and 1, required ventilator.

IQR, interquartile range.

Patient demographics and clinical characteristics Self-reported in prespecified fixed categories. Number of days from onset of symptoms until specimen collection. The severity index was defined by a 4-point scale: 4, not admitted to hospital; 3, admitted to hospital; 2, admitted to ICU; and 1, required ventilator. IQR, interquartile range.

Nucleic acid extraction.

Automated total nucleic acid extraction was performed using the NucliSENS easyMag (bioMérieux, Marcy-l'Étoile, France) software, version 2.1.0.1. The input volume was 500 μl, and the elution volume was 50 μl. Extraction was performed by following the manufacturer’s protocol. Extracts were stored at –80°C prior to mNGS sequencing.

mNGS GridION sequencing.

Long-read metagenomic sequencing was performed using the Nanopore GridION X5 (Oxford, England) sequencing instrument. Each Nanopore sequencing library was prepared using total nucleic acid extract from NP swabs. Two different libraries were generated for each specimen using two different kits. A direct cDNA sequencing kit (SQK-DCS 109; Oxford Nanopore Technologies) was used for sequencing poly(A)+ RNA full-length transcripts to allow for sequencing SARS-CoV-2 in a nonbiased way. In addition, we used a PCR barcoding kit (SQK-PBK004; Oxford Nanopore Technologies) for capturing DNA targets for untargeted metagenomic analysis and to complement the metatranscriptomic analysis. For cDNA libraries, complementary cDNA strand synthesis, as well as strand switching, was performed using kit-supplied reagents and oligonucleotides. This was followed by RNA strand degradation and synthesis of complementary strands and then ligation of sequencing adaptors. Multiplexing was performed using the Native barcoding expansion kit (EXP-NBD104). For PCR barcoding kit libraries, briefly, total nucleic acid was fragmented using the Covaris g-TUBE. Repair of sheared ends was performed using the NEBNext end repair/dA-tailing, followed by ligation of adaptors containing primer binding sites for sample amplification as well as barcoding. Primers used, in addition to incorporating barcodes, added 5′ tags to allow for sample attachment to one-dimensional rapid sequencing adaptors. Specimens were sequenced using R9.4.1 flow cells (FLO-MIN106). Five samples were multiplexed per flow cell, except for libraries for samples 1 to 5, which were prepared with the Direct cDNA sequencing kit and sequenced one sample per flow cell. Samples 1 to 10 have only metatranscriptomic data, as there was insufficient remaining specimens for subsequent metagenomic sequencing. MinKNOW software was used to collect and base call the sequencing data.

CosmosID analysis and bioinformatics pipeline.

Raw nanopore fastq files that passed filter based on nanopore default parameters were concatenated into one file per sample, with all reads in order of sequencing start time. Concatenated, unassembled sequencing reads were directly analyzed by the CosmosID bioinformatics platform (CosmosID Inc., Rockville, MD) as described elsewhere (41–44) for multi-kingdom microbiome analysis and quantification of organism relative abundance. Briefly, the system utilizes curated genome databases and a high-performance data-mining algorithm that rapidly disambiguates hundreds of millions of metagenomic sequence reads into the discrete microorganisms engendering particular sequences. Four variables are generated for each organism detected: unique match frequency, unique match percentage, total match percentage, and relative abundance, as previously defined (45). The CosmosID % Total Matches statistic was used as an approximation of the percent coverage of the sample against SARS-CoV-2 strain Hu-1. Sequencing reads were mapped against SARS-CoV-2 strain Hu-1 using the tool Minimap2, with default parameters. Output sam/bam files were sorted and indexed using SAMtools, and summary statistics were generated using Qualimap (46–48). To determine the detection, depth, and coverage of SARS-CoV-2 strain Hu-1 at different times, multiple cumulative read files were generated per sample. Depending on the number of reads in a sample, subsamples were created using the first 1,000, 2,000, 4,000, and 8,000 reads, doubling until a maximum of 256,000 reads was reached. These files were used for both CosmosID and Minimap analysis. The number of the first read identified was the first read mapped to SARS-CoV-2 strain Hu-1 via the CosmosID Metagenomics software. Approximations of the amounts of reads belonging to various kingdoms were determined by the number of reads in each sample that mapped to the CosmosID database for each kingdom. Relative abundance stacked bars were generated from the family-, genus-, and species-level relative abundance matrices from CosmosID taxonomic analysis. Bars are separated based on the comparative cohort and were visualized using the R package ggplot2 (49). Putative pathogens causing coinfections are defined as organisms of high abundance having >50% relative abundance compared to that in the normal microbiota (i.e., organisms expected to be found in the respiratory tracts of healthy individuals). Orthogonal validation of coinfection calls were completed by running raw nanopore output files against reference genomes of interest using minimap-2, with default parameters, in the same way as was used in the mapping to SARS-CoV-2 strain Hu-1. Mapping statistics and coverage plots were generated from Minimap’s sam/bam files using Qualimap’s bamqc function.

Statistical analysis.

Alpha diversity boxplots were calculated from the family-, genus-, and species-level abundance score matrices employing the CosmosID taxonomic analysis. Chao, Simpson, and Shannon alpha diversity metrics were calculated in R using the R package vegan. Wilcoxon rank sum tests were performed between positive and negative COVID-19 groups using the R package ggsignif. Boxplots with overlaid significance in P value format were generated using the R package ggplot2 (49–51). Beta diversity principal-coordinate analyses (PCoA) were calculated from the family-, genus-, and species-level relative abundance matrices from CosmosID taxonomic analysis. Bray-Curtis and Jaccard diversities were calculated in R using the R package vegan with the function vegdist, and PCoA tables were generated using vegan’s function pcoa. Permutational multivariate analysis of variance (PERMANOVA) tests for each distance matrix were generated using vegan’s function adonis2. Group pairwise PERMANOVAs were generated using the pairwise Adonis function pairwise.adonis2. Plots were visualized using the R package ggpubr (50, 52, 53).
  40 in total

1.  Presenting Characteristics, Comorbidities, and Outcomes Among 5700 Patients Hospitalized With COVID-19 in the New York City Area.

Authors:  Safiya Richardson; Jamie S Hirsch; Mangala Narasimhan; James M Crawford; Thomas McGinn; Karina W Davidson; Douglas P Barnaby; Lance B Becker; John D Chelico; Stuart L Cohen; Jennifer Cookingham; Kevin Coppa; Michael A Diefenbach; Andrew J Dominello; Joan Duer-Hefele; Louise Falzon; Jordan Gitlin; Negin Hajizadeh; Tiffany G Harvin; David A Hirschwerk; Eun Ji Kim; Zachary M Kozel; Lyndonna M Marrast; Jazmin N Mogavero; Gabrielle A Osorio; Michael Qiu; Theodoros P Zanos
Journal:  JAMA       Date:  2020-05-26       Impact factor: 56.272

2.  Rates of Co-infection Between SARS-CoV-2 and Other Respiratory Pathogens.

Authors:  David Kim; James Quinn; Benjamin Pinsky; Nigam H Shah; Ian Brown
Journal:  JAMA       Date:  2020-05-26       Impact factor: 56.272

3.  Upper respiratory tract microbial communities, acute otitis media pathogens, and antibiotic use in healthy and sick children.

Authors:  Melinda M Pettigrew; Alison S Laufer; Janneane F Gent; Yong Kong; Kristopher P Fennie; Joshua P Metlay
Journal:  Appl Environ Microbiol       Date:  2012-06-29       Impact factor: 4.792

4.  Cross-talk among flesh-eating Aeromonas hydrophila strains in mixed infection leading to necrotizing fasciitis.

Authors:  Duraisamy Ponnusamy; Elena V Kozlova; Jian Sha; Tatiana E Erova; Sasha R Azar; Eric C Fitts; Michelle L Kirtley; Bethany L Tiner; Jourdan A Andersson; Christopher J Grim; Richard P Isom; Nur A Hasan; Rita R Colwell; Ashok K Chopra
Journal:  Proc Natl Acad Sci U S A       Date:  2016-01-05       Impact factor: 11.205

5.  Lung Microbiota Predict Clinical Outcomes in Critically Ill Patients.

Authors:  Robert P Dickson; Marcus J Schultz; Tom van der Poll; Laura R Schouten; Nicole R Falkowski; Jenna E Luth; Michael W Sjoding; Christopher A Brown; Rishi Chanderraj; Gary B Huffnagle; Lieuwe D J Bos
Journal:  Am J Respir Crit Care Med       Date:  2020-03-01       Impact factor: 21.405

6.  Qualimap 2: advanced multi-sample quality control for high-throughput sequencing data.

Authors:  Konstantin Okonechnikov; Ana Conesa; Fernando García-Alcalde
Journal:  Bioinformatics       Date:  2015-10-01       Impact factor: 6.937

Review 7.  A review of the role of Haemophilus influenzae in community-acquired pneumonia.

Authors:  Mary P E Slack
Journal:  Pneumonia (Nathan)       Date:  2015-12-01

8.  Co-infections in people with COVID-19: a systematic review and meta-analysis.

Authors:  Louise Lansbury; Benjamin Lim; Vadsala Baskaran; Wei Shen Lim
Journal:  J Infect       Date:  2020-05-27       Impact factor: 6.072

9.  Genomic Diversity of Severe Acute Respiratory Syndrome-Coronavirus 2 in Patients With Coronavirus Disease 2019.

Authors:  Zijie Shen; Yan Xiao; Lu Kang; Wentai Ma; Leisheng Shi; Li Zhang; Zhuo Zhou; Jing Yang; Jiaxin Zhong; Donghong Yang; Li Guo; Guoliang Zhang; Hongru Li; Yu Xu; Mingwei Chen; Zhancheng Gao; Jianwei Wang; Lili Ren; Mingkun Li
Journal:  Clin Infect Dis       Date:  2020-07-28       Impact factor: 9.079

10.  Metagenomic sequencing with spiked primer enrichment for viral diagnostics and genomic surveillance.

Authors:  Xianding Deng; Asmeeta Achari; Scot Federman; Guixia Yu; Sneha Somasekar; Inês Bártolo; Shigeo Yagi; Placide Mbala-Kingebeni; Jimmy Kapetshi; Steve Ahuka-Mundeke; Jean-Jacques Muyembe-Tamfum; Asim A Ahmed; Vijay Ganesh; Manasi Tamhankar; Jean L Patterson; Nicaise Ndembi; Dora Mbanya; Lazare Kaptue; Carole McArthur; José E Muñoz-Medina; Cesar R Gonzalez-Bonilla; Susana López; Carlos F Arias; Shaun Arevalo; Steve Miller; Mars Stone; Michael Busch; Kristina Hsieh; Sharon Messenger; Debra A Wadford; Mary Rodgers; Gavin Cloherty; Nuno R Faria; Julien Thézé; Oliver G Pybus; Zoraima Neto; Joana Morais; Nuno Taveira; John R Hackett; Charles Y Chiu
Journal:  Nat Microbiol       Date:  2020-01-13       Impact factor: 17.745

View more
  47 in total

Review 1.  Role of the Microbiome in the Pathogenesis of COVID-19.

Authors:  Rituparna De; Shanta Dutta
Journal:  Front Cell Infect Microbiol       Date:  2022-03-31       Impact factor: 5.293

2.  The Brief Case: Ventilator-Associated Corynebacterium accolens Pneumonia in a Patient with Respiratory Failure Due to COVID-19.

Authors:  Benjamin M Liu; Emily M Beck; Mark A Fisher
Journal:  J Clin Microbiol       Date:  2021-08-18       Impact factor: 5.948

3.  Nasopharyngeal Bacterial Microbiota Composition and SARS-CoV-2 IgG Antibody Maintenance in Asymptomatic/Paucisymptomatic Subjects.

Authors:  Luca Ferrari; Chiara Favero; Giulia Solazzo; Jacopo Mariani; Anna Luganini; Monica Ferraroni; Emanuele Montomoli; Gregorio Paolo Milani; Valentina Bollati
Journal:  Front Cell Infect Microbiol       Date:  2022-07-06       Impact factor: 6.073

Review 4.  The role of respiratory microbiota in the protection against viral diseases: respiratory commensal bacteria as next-generation probiotics for COVID-19.

Authors:  Bruno G N Andrade; Rafael R C Cuadrat; Fernanda Raya Tonetti; Haruki Kitazawa; Julio Villena
Journal:  Biosci Microbiota Food Health       Date:  2022-03-29

Review 5.  Hallmarks of Severe COVID-19 Pathogenesis: A Pas de Deux Between Viral and Host Factors.

Authors:  Roberta Rovito; Matteo Augello; Assaf Ben-Haim; Valeria Bono; Antonella d'Arminio Monforte; Giulia Marchetti
Journal:  Front Immunol       Date:  2022-06-10       Impact factor: 8.786

Review 6.  Respiratory and Intestinal Microbiota in Pediatric Lung Diseases-Current Evidence of the Gut-Lung Axis.

Authors:  Sebastian Stricker; Torsten Hain; Cho-Ming Chao; Silvia Rudloff
Journal:  Int J Mol Sci       Date:  2022-06-18       Impact factor: 6.208

7.  Metagenomic analysis reveals differences in the co-occurrence and abundance of viral species in SARS-CoV-2 patients with different severity of disease.

Authors:  Pavel Iša; Blanca Taboada; Rodrigo García-López; Celia Boukadida; José Ernesto Ramírez-González; Joel Armando Vázquez-Pérez; Alejandra Hernández-Terán; José Ángel Romero-Espinoza; José Esteban Muñoz-Medina; Concepción Grajales-Muñiz; Alma Rincón-Rubio; Margarita Matías-Florentino; Alejandro Sanchez-Flores; Edgar Mendieta-Condado; Gisela Barrera-Badillo; Susana López; Lucía Hernández-Rivas; Irma López-Martínez; Santiago Ávila-Ríos; Carlos F Arias
Journal:  BMC Infect Dis       Date:  2022-10-19       Impact factor: 3.667

8.  Biographical Feature: Karen C. Carroll, M.D.

Authors:  Erik Munson
Journal:  J Clin Microbiol       Date:  2021-07-19       Impact factor: 5.948

9.  Unusual source of recurrent Corynebacterium bacteraemia in an immunocompromised patient.

Authors:  Abdul Rahim Ali Bakhsh; Keir Edward Lewis
Journal:  BMJ Case Rep       Date:  2021-06-11

10.  Association between the nasopharyngeal microbiome and metabolome in patients with COVID-19.

Authors:  Jing Liu; Sheng Liu; Zhao Zhang; Xuejun Lee; Wenxuan Wu; Zhanlian Huang; Ziying Lei; Wenxiong Xu; Dabiao Chen; Xing Wu; Yang Guo; Liang Peng; Bingliang Lin; Yutian Chong; Xiangyu Mou; Mang Shi; Ping Lan; Tao Chen; Wenjing Zhao; Zhiliang Gao
Journal:  Synth Syst Biotechnol       Date:  2021-06-14
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.