| Literature DB >> 33203890 |
Eran Mick1,2,3, Jack Kamm3, Angela Oliveira Pisco3, Kalani Ratnasiri3, Jennifer M Babik1, Gloria Castañeda3, Joseph L DeRisi3,4, Angela M Detweiler3, Samantha L Hao3, Kirsten N Kangelaris5, G Renuka Kumar3, Lucy M Li3, Sabrina A Mann3,4, Norma Neff3, Priya A Prasad5, Paula Hayakawa Serpa1,3, Sachin J Shah5, Natasha Spottiswoode5, Michelle Tan3, Carolyn S Calfee2, Stephanie A Christenson2, Amy Kistler3, Charles Langelier6,7.
Abstract
SARS-CoV-2 infection is characterized by peak viral load in the upper airway prior to or at the time of symptom onset, an unusual feature that has enabled widespread transmission of the virus and precipitated a global pandemic. How SARS-CoV-2 is able to achieve high titer in the absence of symptoms remains unclear. Here, we examine the upper airway host transcriptional response in patients with COVID-19 (n = 93), other viral (n = 41) or non-viral (n = 100) acute respiratory illnesses (ARIs). Compared with other viral ARIs, COVID-19 is characterized by a pronounced interferon response but attenuated activation of other innate immune pathways, including toll-like receptor, interleukin and chemokine signaling. The IL-1 and NLRP3 inflammasome pathways are markedly less responsive to SARS-CoV-2, commensurate with a signature of diminished neutrophil and macrophage recruitment. This pattern resembles previously described distinctions between symptomatic and asymptomatic viral infections and may partly explain the propensity for pre-symptomatic transmission in COVID-19. We further use machine learning to build 27-, 10- and 3-gene classifiers that differentiate COVID-19 from other ARIs with AUROCs of 0.981, 0.954 and 0.885, respectively. Classifier performance is stable across a wide range of viral load, suggesting utility in mitigating false positive or false negative results of direct SARS-CoV-2 tests.Entities:
Mesh:
Year: 2020 PMID: 33203890 PMCID: PMC7673985 DOI: 10.1038/s41467-020-19587-y
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Host Transcriptional Signatures of SARS-CoV-2 Infection as Compared to Other Respiratory Viruses.
a Hierarchical clustering of 121 genes comprising the union of the top 50 differentially expressed (DE) genes by significance in each of the pairwise comparisons between patients with COVID-19 (SARS-CoV-2; n = 93), other viral ARIs (n = 41) and non-viral ARIs (n = 100). Gene expression values were clustered after variance-stabilizing transformation and row normalization. Group labels and viral load of SARS-CoV-2 are shown in the annotation bars. rpM, reads-per-million. b Normalized enrichment scores of selected REACTOME pathways that achieved statistical significance in at least one of the gene set enrichment analyses, using either DE genes between SARS-CoV-2 and non-viral ARIs or between other viral ARIs and non-viral ARIs. If a pathway could not be tested in one of the comparisons since it had <10 members in the input gene set, the enrichment score was set to 0. Pathway p-values were calculated using an adaptive, multilevel splitting Monte Carlo approach and Benjamini–Hochberg adjusted. c In silico estimation of cell-type proportions in the bulk RNA sequencing using single-cell signatures. Black lines denote the median. The y-axis in each panel was trimmed at the maximum value among the three patient groups of 1.5*IQR above the third quartile, where IQR is the inter-quartile range. Pairwise comparisons between patient groups were performed with a two-sided Mann–Whitney–Wilcoxon test followed by Bonferroni’s correction. Sample sizes as in (a). d Scatter plots of normalized gene counts (log2 scale, y-axis) as a function of SARS-CoV-2 viral load (log10(rpM), x-axis). Shown are inflammasome-related genes selected from among the genes most depressed in expression in SARS-CoV-2 compared to other viral ARIs. Robust regression was performed on SARS-CoV-2 positive patients with log10(rpM) ≥ 0 (n = 82) to characterize the relationship to viral load. Shaded bands represent 95% confidence intervals. Statistical results listed for each gene refer to, from top to bottom: the regression analysis (p-values for difference of the slope from 0 derived from a t-statistic and Benjamini–Hochberg adjusted; R2 is the adjusted robust coefficient of determination), the DE analysis between SARS-CoV-2 and non-viral ARIs (p-values derived from a moderated t-statistic and Benjamini–Hochberg adjusted), and the DE analysis between SARS-CoV-2 and other viral ARIs (p-values derived from a moderated t-statistic and Benjamini–Hochberg adjusted). Sample sizes for DE analyses as in (a). FC, fold-change.
Fig. 2Performance of COVID-19 diagnostic classifiers based on patient gene expression.
a Receiver operating characteristic (ROC) curve for a 27-gene classifier that differentiates COVID-19 from other acute respiratory illnesses (viral and non-viral). The mean and range of the area under the curve (AUC) are indicated. b Accuracy of the 27-gene classifier within each patient group using a cut-off of 40% out-of-fold predicted probability for COVID-19. c ROC curve for a 10-gene classifier. d ROC curve for a 3-gene classifier. e Out-of-fold predicted probability of COVID-19 derived from the 27-gene classifier plotted as a function of SARS-CoV-2 viral load, log10(rpM). Dashed lines indicate 40% (our chosen cut-off) and 50%.