| Literature DB >> 33923155 |
Hyun-Hwan Jeong1, Johnathan Jia1,2, Yulin Dai1, Lukas M Simon1,3, Zhongming Zhao1,2,4.
Abstract
Single-cell RNA sequencing of the bronchoalveolar lavage fluid (BALF) samples from COVID-19 patients has enabled us to examine gene expression changes of human tissue in response to the SARS-CoV-2 virus infection. However, the underlying mechanisms of COVID-19 pathogenesis at single-cell resolution, its transcriptional drivers, and dynamics require further investigation. In this study, we applied machine learning algorithms to infer the trajectories of cellular changes and identify their transcriptional programs. Our study generated cellular trajectories that show the COVID-19 pathogenesis of healthy-to-moderate and healthy-to-severe on macrophages and T cells, and we observed more diverse trajectories in macrophages compared to T cells. Furthermore, our deep-learning algorithm DrivAER identified several pathways (e.g., xenobiotic pathway and complement pathway) and transcription factors (e.g., MITF and GATA3) that could be potential drivers of the transcriptomic changes for COVID-19 pathogenesis and the markers of the COVID-19 severity. Moreover, macrophages-related functions corresponded more to the disease severity compared to T cells-related functions. Our findings more proficiently dissected the transcriptomic changes leading to the severity of a COVID-19 infection.Entities:
Keywords: COVID-19; bronchoalveolar lavage fluid; deep learning; machine learning; single cell RNA-seq; trajectory inference
Mesh:
Year: 2021 PMID: 33923155 PMCID: PMC8145325 DOI: 10.3390/genes12050635
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Cellular trajectories inferred by Slingshot in macrophages and T cells using Liao et al.’s BALF scRNA-seq data [9]. The illustrations of UMAP embedded cells (colored by the disease status) and inferred cellular trajectories (displayed as bold lines) (top). Violin plots show the pseudo time distributions of the inferred cellular trajectories for each disease group (healthy vs. moderate/severe) (bottom). (A,B) The cellular trajectories from healthy control cells to moderate or severe cells in macrophages. (C,D) The cellular trajectories from healthy control cells to moderate or severe cells in T cells. UMAP: Uniform Manifold Approximation and Projection. H, M, and S denote healthy, moderate, and severe samples, respectively.
The summary of top inferred cellular trajectories of macrophages and T cells.
| Cell Type | Trajectory | Number of Healthy Cells | Number of Moderate/Severe Cells | Fold-Change | |
|---|---|---|---|---|---|
| Macrophage |
| 6370 | 2168 | 0.37 |
|
|
| 10,492 | 19,798 | 0.94 |
| |
| T cell |
| 414 | 1524 | 0.47 |
|
|
| 237 | 975 | 0.56 |
|
The statistics of the SARS-nCoV-2 gene expression across the pseudotime.
| Cell Type | Trajectory | Gene | Number of Infected Cells | Average (Pseudotime) |
|---|---|---|---|---|
| Macrophage |
| S | 104 | 0.46 |
| ORF8 | 41 | 0.46 | ||
| N | 258 | 0.48 | ||
| ORF10 | 67 | 0.49 | ||
| ORF3a | 24 | 0.49 | ||
| M | 34 | 0.50 | ||
| ORF1ab | 467 | 0.52 | ||
| ORF7a | 403 | 0.57 | ||
| T cell |
| ORF1ab | 38 | 0.61 |
Figure 2Changes of sub-cell type populations for each inferred cellular trajectory. (A) T cell sub-cell type changes of healthy-to-moderate (top) and healthy-to-severe (bottom). (B) Macrophage sub-cell type changes of healthy-to-moderate (top) and healthy-to-severe (bottom). The x-axis indicates the inferred pseudotime, and the y-axis indicates the height of density estimated and visualized by the geom_density function of ggplot2 R package [25]. A percentage next to a cell type name indicates the proportion of the cell type in the trajectory, and it is rounded to the ones place. Treg: regulatory T cells. CD8: cluster of differentiation 8. CCR7: C-C motif chemokine receptor 7. SPP1: secreted phosphoprotein 1. FCN1: Ficolin-1. FABP4: Fatty acid-binding protein 4.
Figure 3DrivAER analysis identified hallmark pathways and transcription factors differentially activated in macrophages and T cells. (A) The DrivAER pathway analysis results in macrophages. (B) The DrivAER transcription factor analysis results in macrophages. (C) The DrivAER pathway analysis results in T cells. (D) The DrivAER transcription factor analysis results in macrophages. Each bar indicates the score (deltaRS) of the corresponding TP. The top five TPs (red bars) are the transcriptional programs that were highly expressed in the severe trajectory. The bottom five TPs (blue bars) are the top five transcriptional programs that were highly expressed in the moderate trajectory.
Figure 4Visualization of both manifold and gene expression change of top transcriptional programs of the severe trajectory in macrophages. (A,B) A cell manifold of xenobiotic metabolism pathway and MITF transcription factor performed by DCA, respectively. X- and y-axes indicate the first and second dimensions of the gene manifold. Each point indicates a cell and is colored by its pseudotime. (C,D) Gene expression heatmaps for the top transcriptional programs (xenobiotic metabolism pathway and MITF). The transcriptional program genes expressed in less than 20% of cells, or the cells showing less than 40% of expressions of the transcriptional program’s genes were excluded during the heatmap visualization.