| Literature DB >> 32511476 |
Eran Mick1,2,3, Jack Kamm3, Angela Oliveira Pisco3, Kalani Ratnasiri3, Jennifer M Babik1, Carolyn S Calfee2, Gloria Castañeda3, Joseph L DeRisi3,4, Angela M Detweiler3, Samantha Hao3, Kirsten N Kangelaris5, G Renuka Kumar3, Lucy M Li3, Sabrina A Mann3,4, Norma Neff3, Priya A Prasad5, Paula Hayakawa Serpa1,3, Sachin J Shah5, Natasha Spottiswoode5, Michelle Tan3, Stephanie A Christenson2, Amy Kistler3, Charles Langelier1,3.
Abstract
We studied the host transcriptional response to SARS-CoV-2 by performing metagenomic sequencing of upper airway samples in 238 patients with COVID-19, other viral or non-viral acute respiratory illnesses (ARIs). Compared to other viral ARIs, COVID-19 was characterized by a diminished innate immune response, with reduced expression of genes involved in toll-like receptor and interleukin signaling, chemokine binding, neutrophil degranulation and interactions with lymphoid cells. Patients with COVID-19 also exhibited significantly reduced proportions of neutrophils and macrophages, and increased proportions of goblet, dendritic and B-cells, compared to other viral ARIs. Using machine learning, we built 26-, 10- and 3-gene classifiers that differentiated COVID-19 from other acute respiratory illnesses with AUCs of 0.980, 0.950 and 0.871, respectively. Classifier performance was stable at low viral loads, suggesting utility in settings where direct detection of viral nucleic acid may be unsuccessful. Taken together, our results illuminate unique aspects of the host transcriptional response to SARS-CoV-2 in comparison to other respiratory viruses and demonstrate the feasibility of COVID-19 diagnostics based on patient gene expression.Entities:
Year: 2020 PMID: 32511476 PMCID: PMC7273244 DOI: 10.1101/2020.05.18.20105171
Source DB: PubMed Journal: medRxiv
Figure 1.Host Transcriptional Signatures of SARS-CoV-2 Infection as Compared to Other Respiratory Viruses.
A. Hierarchical clustering of 120 genes comprising the union of the top 50 DE genes by significance in each of the pairwise comparisons between patients with COVID-19 (SARS-CoV-2), other viral ARIs and non-viral ARIs. Group labels and viral load of SARS-CoV-2 are shown in the annotation bars. rpM, reads-per-million. B. Normalized enrichment scores of selected REACTOME pathways that achieved statistical significance (Benjamini-Hochberg adjusted p-value < 0.05) in at least one of the gene set enrichment analyses, using either DE genes between SARS-CoV-2 and non-viral ARIs or between other viruses and non-viral ARIs. If a pathway could not be tested in one of the comparisons since it had less than 10 members in the input gene set, the enrichment score was set to 0. C. in silico estimation of cell type fractions in the bulk RNA-seq using lung single cell signatures. Black lines denote the median. The y-axis in each panel was trimmed at the maximum value among the three patient groups of 1.5*IQR above the third quartile. All pairwise comparisons were performed with a two-sided Mann-Whitney-Wilcoxon test followed by Bonferroni’s correction. D. Scatter plots of normalized gene counts (log2 scale) as a function of SARS-CoV-2 viral load, log10(rpM). Robust regression was performed on SARS-CoV-2 positive patients with log10(rpM) > 0 to highlight the relationship to viral load. Shown are inflammasome-related genes selected from among the genes most depressed in expression in SARS-CoV-2 compared to other viral ARIs. Statistical results for each gene refer to (from top to bottom): the regression analysis, the DE analysis between SARS-CoV-2 and non-viral ARIs, and the DE analysis between SARS-CoV-2 and other viral ARIs.
Figure 2.Performance of COVID-19 Diagnostic Classifiers Based on Patient Gene Expression.
A. Receiver operating characteristic (ROC) curve for a 26-gene classifier that differentiates COVID-19 from other acute respiratory illnesses (viral and non-viral). B. Accuracy of the 26-gene classifier within each patient group, using a cut-off of 40% out-of-fold predicted probability for COVID-19. C. ROC curve for a 10-gene classifier. D. ROC curve for a 3-gene classifier. E. Out-of-fold predicted probability of COVID-19 derived from the 26-gene classifier plotted as a function of SARS-CoV-2 viral load, log10(rpM). Dashed lines indicate 40% (our chosen cut-off) and 50%.