| Literature DB >> 32849601 |
Aline Fastrès1, Dimitri Pirottin2, Laurence Fievez2, Thomas Marichal2, Christophe J Desmet2, Fabrice Bureau2, Cécile Clercx1.
Abstract
Single-cell mRNA-sequencing (scRNA-seq) is a technique which enables unbiased, high throughput and high-resolution transcriptomic analysis of the heterogeneity of cells within a population. This recent technique has been described in humans, mice and other species in various conditions to cluster cells in populations and identify new subpopulations, as well as to study the gene expression of cells in various tissues, conditions and origins. In dogs, a species for which markers of cell populations are often limiting, scRNA-seq presents with elevated yet untested potential for the study of tissue composition. As a proof of principle, we used scRNA-seq to identify cellular populations of the bronchoalveolar lavage fluid (BALF) in healthy dogs (n = 4). A total of 5,710 cells were obtained and analyzed by scRNA-seq. Fourteen distinct clusters of cells were identified, further identified as macrophages/monocytes (4 clusters), T cells (2 clusters) and B cells (1 cluster), neutrophils (1 cluster), mast cells (1 cluster), mature or immature dendritic cells (1 cluster each), ciliated or non-ciliated epithelial cells (1 cluster each) and cycling cells (1 cluster). We used for the first time in dogs the scRNA-seq to investigate cellular subpopulations of the BALF of dog. This study hence expands our knowledge on dog lung immune cell populations, paves the way for the investigation at single-cell level of lower respiratory diseases in dogs, and establishes that scRNA-seq is a powerful tool for the study of dog tissue composition.Entities:
Keywords: bronchoalveolar lavage fluid; cell; dog; lung; single-cell RNA-sequencing
Mesh:
Substances:
Year: 2020 PMID: 32849601 PMCID: PMC7406785 DOI: 10.3389/fimmu.2020.01707
Source DB: PubMed Journal: Front Immunol ISSN: 1664-3224 Impact factor: 7.561
Total and differential cell count in each bronchoalveolar lavage fluid.
| TCC, cells/μL | 440 | 880 | 570 | 180 | |
| DCC, % | Macrophages | 70 | 80 | 91 | 71 |
| Neutrophils | 10 | 12 | 3 | 12 | |
| Lymphocytes | 18 | 5 | 3 | 10 | |
| Eosinophils | 1 | 3 | 1 | 2 | |
| Mast cells | 0 | 0 | 0 | 0 | |
| Epithelial cells | 1 | 0 | 2 | 5 | |
BALF 1, female Yorkshire terrier of 11-year-old; BALF 2, female French bulldog of 4-year-old; BALF 3, female West Highland white terrier of 9-year-old; BALF 4, female Australian shepherd of 6-year-old; TCC, total cell count; DCC, differential cell count.
Metrics about mapping and characteristics of the detected cells of each BALF sample.
| Number of cells passing quality control | 1,309 | 1,072 | 1,298 | 2,031 |
| Reads mapped confidently to genome, % | 68.1 | 68.6 | 59.8 | 72.4 |
| Reads mapped confidently to transcriptome, % | 26.5 | 30.4 | 23.9 | 30.7 |
| Median genes/cell | 485 (229–2,480) | 780 (350–1,313) | 834 (376–1,046) | 407 (215.25–953) |
| Median UMIs/cell | 1020 (321–3,351) | 1942 (720–4,003) | 1888.5 (678–2,671) | 837 (430–2,669) |
| Total genes detected | 11,343 | 11,133 | 10.839 | 11,543 |
Data were generated after passing quality control including the exclusion of cells with <100 and >2,500 genes. Reads mapped confidently to genome are the number of reads that mapped only to the genome. Reads mapped confidently to transcriptome are the fraction of the reads mapped to a unique gene in the transcriptome and are considered for UMI counting. Median genes per cell correspond to the median number of genes with at least one UMI count. Total genes detected is the detected number of genes with at least one UMI count in any cell. BALF 1, female Yorkshire terrier of 11-year-old; BALF 2, female French bulldog of 4-year-old; BALF 3, female West Highland white terrier of 9-year-old; BALF 4, female Australian shepherd of 6-year-old; UMI, unique molecular identifier.
Figure 1Compiled t-SNE plot of the cell clusters. (A) t-SNE plot of all cells (n = 5,710) representing the cell clusters analyzed by scRNA-seq. Each color corresponds to one cluster assigned via the graph-based clustering method with a resolution of 0.3. (B) Batch alignment across bronchoalveolar lavage fluid (BALF) specimens, each color representing the cells coming from one sample. (C) Bar plot showing the relative proportion of the cell from each BALF sample into each cluster. BALF 1, female Yorkshire terrier of 11-year-old; BALF 2, female French bulldog of 4-year-old; BALF 3, female West Highland white terrier of 9-year-old; BALF 4, female Australian shepherd of 6-year-old.
Selection of significant DEGs able to differentiate cell type in each cluster based on literature.
| Macro-phages/Mono-cytes markers | 83 | −1 | 48 | −0.99 | 60 | 25 | −0.97 | 0 | 94 | −0.97 | 51 | −0.95 | 38 | −0.91 | 20 | −0.96 | 0 | −0.94 | 24 | −0.96 | 27 | ||||||||
| 88 | −1.23 | 46 | −1.2 | 59 | 35 | −1.2 | 16 | −1.24 | 17 | −1.22 | 20 | −1.22 | 27 | ||||||||||||||||
| 100 | −1.18 | 84 | −1.5 | 87 | −3.27 | 26 | −2.47 | 76 | 100 | 100 | −3.25 | 19 | −2.23 | 22 | −2.1 | 73 | |||||||||||||
| 87 | −1.31 | 24 | −1.28 | 20 | 45 | −1.25 | 14 | −1.13 | 39 | −1.22 | 16 | ||||||||||||||||||
| 86 | −1.27 | 52 | −1.16 | 59 | 57 | −1.31 | 0 | −1.03 | 52 | 91 | 79 | −0.80 | 50 | −1.25 | 76 | −1.22 | 16 | ||||||||||||
| 91 | −1.68 | 44 | −1.69 | 54 | −1.71 | 12 | −0.52 | 92 | −1.68 | 65 | 83 | −1.51 | 43 | −1.67 | 26 | −1.6 | 20 | ||||||||||||
| 86 | −1.19 | 36 | −1.16 | 54 | −1.18 | 0 | 96 | −1.13 | 27 | −0.79 | 3 | −1.01 | 18 | ||||||||||||||||
| 99 | −0.98 | 64 | −1.17 | 68 | −0.26 | 49 | −1 | 30 | 100 | −0.5 | 43 | 97 | −1.12 | 72 | 97 | −0.51 | 33 | ||||||||||||
| DC markers | 79 | 55 | |||||||||||||||||||||||||||
| 71 | −1.06 | 7 | −1.08 | 16 | −0.78 | 17 | 83 | −0.57 | 27 | −1.04 | 8 | 100 | |||||||||||||||||
| 93 | |||||||||||||||||||||||||||||
| T cells markers | −1.75 | 58 | 77 | 87 | −1.56 | 4 | −1.57 | 55 | −1.43 | 25 | 73 | ||||||||||||||||||
| −1.6 | 56 | 72 | 83 | −1.32 | 2 | −1.39 | 55 | −1.27 | 39 | −1.39 | 22 | ||||||||||||||||||
| Epithe-lial cells markers | −3.92 | 61 | −4.15 | 67 | −3.29 | 79 | 99 | −4.17 | 93 | −3.5 | 54 | −3.24 | 77 | −4.46 | 67 | ||||||||||||||
| −3.31 | 58 | −3.42 | 64 | −2.31 | 78 | 95 | −3.3 | 93 | −2.66 | 77 | −3.82 | 81 | −3.39 | 28 | |||||||||||||||
| −1.84 | 60 | −1.15 | 67 | 81 | −1.66 | 52 | −1.63 | 37 | −1.64 | 60 | −1.65 | 10 | 69 | ||||||||||||||||
| Neutro-phils markers | −0.25 | 72 | −0.8 | 57 | −0.91 | 48 | −0.63 | 22 | −1.18 | 7 | −0.56 | 89 | 77 | −0.6 | 82 | −1.19 | 14 | ||||||||||||
| 71 | |||||||||||||||||||||||||||||
| Cycling cells markers | 96 | ||||||||||||||||||||||||||||
| 82 | |||||||||||||||||||||||||||||
| 45 | |||||||||||||||||||||||||||||
| B cells markers | −1.51 | 64 | −1.19 | 47 | −2.02 | 15 | −1.99 | 75 | −1.91 | 81 | 87 | −2.29 | 39 | ||||||||||||||||
| 72 | |||||||||||||||||||||||||||||
| 59 | |||||||||||||||||||||||||||||
| Baso-phils markers | −0.6 | 33 | −0.51 | 39 | −0.45 | 56 | −0.39 | 75 | −0.49 | 2 | 44 | −0.33 | 11 | 78 | |||||||||||||||
| −0.30 | 1 | 81 | 100 | ||||||||||||||||||||||||||
| 98 | −1.89 | 21 | −2.17 | 49 | −2.19 | 4 | 83 | 94 | −2.08 | 58 | 92 | −2.09 | 26 | ||||||||||||||||
The avg logFC was calculated by comparing each cluster to all other clusters, overexpressed transcripts are displayed in bold. Only significant data were reported in the table (P < 0.05). DEGs, differentially expressed genes; Avg logFC, average log2 fold; pct, percentage of cells in the cluster expressing the gene; DC, dendritic cells.
Figure 2Identification of cell identity corresponding to the clusters. t-SNE plot showing the cells identity based on the expression of differentially expressed genes representative of each cells type including genes coding for the macrophage receptor with collagenous structure (MARCO), the macrophage mannose receptor (MRC1, encoding CD206), the T-cell surface glycoprotein CD3 epsilon chain (CD3E), the cytokeratin 19 (KRT19), the selectin (SELL, encoding CD62L), the integrin alpha M (ITGAM), the T-cell surface glycoprotein CD1e (CD1E), the CD83 molecule, the Fc receptor like A (FCRLA), the CD19 molecule, the DNA topoisomerase II alpha (TOP2A), the proliferating cell nuclear antigen clamp associated factor (ENSCAFG00000030087, encoding PCLAF), the membrane spanning 4-domains A2 (MS4A2) and the mast/stem cell growth factor receptor (KIT). DC, dendritic cell.
Figure 3Single-cell mRNA-sequencing based identification of 4 distinct subpopulations of macrophages/monocytes in the bronchoalveolar lavage fluid of dogs. Dot plots showing the average expression of the indicated genes as well as the percentage of cells expressing the genes within each cluster of macrophages/monocytes. An example of transcripts significantly (P-value adjusted < 0.05) differentially upregulated (average log2 fold change > 0.5) between the clusters 0, 3, 5, and 8 are depicted.
Top 10 gene set overlap between significantly upregulated genes in cluster 0, 3, 5 and 8 compared to each other and the gene ontology (GO) biological process gene set.
| Cluster 0 vs. 3, 5 and 8 | Myeloid leukocyte activation | 650 | 88 | 28 | 0.0431 | 1.23E-24 |
| Leukocyte mediated immunity | 867 | 88 | 30 | 0.0346 | 3.06E-24 | |
| Cell activation involved in immune response | 705 | 88 | 28 | 0.0397 | 3.36E-24 | |
| Myeloid leukocyte mediated immunity | 550 | 88 | 26 | 0.0473 | 3.36E-24 | |
| Exocytosis | 899 | 88 | 29 | 0.0323 | 8.10E-23 | |
| Immune effector process | 1,253 | 88 | 32 | 0.0255 | 1.42E-22 | |
| Cell activation | 1,424 | 88 | 32 | 0.0225 | 6.03E-21 | |
| Secretion | 1,638 | 88 | 31 | 0.0189 | 4.86E-18 | |
| Defense response | 1,709 | 88 | 26 | 0.0152 | 2.40E-12 | |
| Innate immune response | 984 | 88 | 21 | 0.0213 | 2.50E-12 | |
| Cluster 3 vs. 0, 5 and 8 | Response to cytokine | 1,192 | 251 | 45 | 0.0378 | 4.29E-18 |
| Regulation of immune system process | 1,631 | 251 | 47 | 0.0288 | 1.22E-14 | |
| Positive regulation of protein metabolic process | 1,633 | 251 | 46 | 0.0282 | 3.81E-14 | |
| Cell activation | 1,424 | 251 | 43 | 0.0302 | 3.81E-14 | |
| Cell motility | 1,719 | 251 | 46 | 0.0268 | 1.95E-13 | |
| Response to oxygen containing compound | 1,616 | 251 | 44 | 0.0272 | 4.45E-13 | |
| Locomotion | 1,943 | 251 | 48 | 0.0247 | 6.01E-13 | |
| Defense response | 1,709 | 251 | 44 | 0.0257 | 2.46E-12 | |
| Regulation of cell activation | 608 | 251 | 27 | 0.0444 | 3.06E-12 | |
| Interspecies interaction between organisms | 927 | 251 | 32 | 0.0345 | 7.69E-12 | |
| Cluster 5 vs. 0, 3 and 8 | Myeloid leukocyte activation | 650 | 62 | 16 | 0.0246 | 1.70E-11 |
| Myeloid leukocyte mediated immunity | 550 | 62 | 15 | 0.0273 | 1.70E-11 | |
| Exocytosis | 899 | 62 | 17 | 0.0189 | 5.14E-11 | |
| Cell activation involved in immune response | 705 | 62 | 15 | 0.0213 | 2.91E-10 | |
| Leukocyte mediated immunity | 867 | 62 | 16 | 0.0185 | 2.91E-10 | |
| Secretion | 1638 | 62 | 20 | 0.0122 | 2.91E-10 | |
| Cell activation | 1,424 | 62 | 18 | 0.0126 | 2.95E-09 | |
| Immune effector process | 1253 | 62 | 17 | 0.0136 | 3.90E-09 | |
| Cellular homeostasis | 971 | 62 | 14 | 0.0144 | 1.69E-07 | |
| Homeostatic process | 1,913 | 62 | 18 | 0.0094 | 2.57E-07 | |
| Cluster 8 vs. 0, 3 and 5 | Defense response | 1,709 | 59 | 20 | 0.0117 | 1.22E-09 |
| Cell motility | 1,719 | 59 | 19 | 0.0111 | 7.91E-09 | |
| Cytokine mediated signaling pathway | 787 | 59 | 14 | 0.0178 | 1.49E-08 | |
| Locomotion | 1,943 | 59 | 19 | 0.0098 | 3.31E-08 | |
| Inflammatory response | 722 | 59 | 13 | 0.018 | 4.87E-08 | |
| Leukocyte migration | 488 | 59 | 11 | 0.0225 | 1.29E-07 | |
| Response to cytokine | 1,192 | 59 | 15 | 0.0126 | 1.29E-07 | |
| Response to bacterium | 681 | 59 | 12 | 0.0176 | 2.45E-07 | |
| Response to biotic stimulus | 1,023 | 59 | 13 | 0.0127 | 1.88E-06 | |
| Regulation of immune system process | 1,631 | 59 | 15 | 0.0092 | 6.45E-06 |
Gene set enrichment analysis carried on by computing overlaps between significantly upregulated genes (P < 0.05, avg_logFC > 0.25) and the gene ontology biological process gene set. “Genes in Gene Set” refers to the number of genes in the gene set, “Positive DEGs included” corresponds to the number of positive differentially expressed genes in the cluster of interest compared to the others and “Genes in Overlap” to the number of genes upregulated in the cluster and involved in the biological process. Avg_logFC, average log2 fold change.
Top 10 gene set overlap between significantly upregulated genes in cluster 1 and 2 compared to each other and the gene ontology (GO) biological process gene set.
| Cluster 1 vs. 2 | Regulation of immune system process | 1,631 | 24 | 13 | 0.008 | 1.85E-08 |
| Innate immune response | 984 | 24 | 11 | 0.0112 | 2.20E-08 | |
| Regulation of immune response | 1,094 | 24 | 11 | 0.0101 | 4.58E-08 | |
| Natural killer cell mediated immunity | 65 | 24 | 5 | 0.0769 | 9.46E-07 | |
| Defense response | 1,709 | 24 | 11 | 0.0064 | 3.10E-06 | |
| Regulation of natural killer cell chemotaxis | 9 | 24 | 3 | 0.3333 | 2.26E-05 | |
| Natural killer cell chemotaxis | 11 | 24 | 3 | 0.2727 | 3.81E-05 | |
| Lymphocyte mediated immunity | 344 | 24 | 6 | 0.0174 | 5.41E-05 | |
| Lymphocyte chemotaxis | 62 | 24 | 4 | 0.0645 | 5.41E-05 | |
| Cell activation | 1,424 | 24 | 9 | 0.0063 | 8.11E-05 | |
| Cluster 2 vs. 1 | Regulation of lymphocyte activation | 478 | 37 | 12 | 0.0251 | 1.38E-10 |
| Regulation of cell activation | 608 | 37 | 12 | 0.0197 | 8.27E-10 | |
| Regulation of T cell activation | 313 | 37 | 10 | 0.0319 | 8.27E-10 | |
| T cell activation | 459 | 37 | 11 | 0.024 | 8.27E-10 | |
| Regulation of cell death | 1,723 | 37 | 16 | 0.0093 | 2.23E-09 | |
| Lymphocyte activation | 721 | 37 | 12 | 0.0166 | 2.88E-09 | |
| Apoptotic process | 1,980 | 37 | 16 | 0.0081 | 1.30E-08 | |
| Biological adhesion | 1,417 | 37 | 14 | 0.0099 | 2.16E-08 | |
| Cell activation | 1,424 | 37 | 14 | 0.0098 | 2.16E-08 | |
| Leukocyte cell-cell adhesion | 336 | 37 | 9 | 0.0268 | 2.16E-08 |
Gene set enrichment analysis carried on by computing overlaps between significantly upregulated genes (P < 0.05, avg_logFC > 0.25) and the gene ontology biological process gene set. “Genes in Gene Set” refers to the number of genes in the gene set, “Positive DEGs included” corresponds to the number of positive differentially expressed genes in the cluster of interest compared to the other and “Genes in Overlap” to the number of genes upregulated in the cluster and involved in the biological process. Avg_logFC, average log2 fold change.
Top 10 gene set overlap between significantly upregulated genes in cluster 4 and 12 compared to each other and the gene ontology (GO) biological process gene set.
| Cluster 12 vs. 4 | Microtubule based process | 734 | 93 | 14 | 0.0191 | 7.90E-06 |
| Epithelial cilium movement | 23 | 93 | 5 | 0.2174 | 7.90E-06 | |
| Cilium movement | 65 | 93 | 6 | 0.0923 | 2.00E-05 | |
| Cytoskeleton organization | 1298 | 93 | 17 | 0.0131 | 2.00E-05 | |
| Microtubule based movement | 277 | 93 | 9 | 0.0325 | 3.13E-05 | |
| Regulation response to stress | 1,497 | 93 | 17 | 0.0114 | 8.91E-05 | |
| Cell projection organization | 1,512 | 93 | 16 | 0.0106 | 5.00E-04 | |
| Reproduction | 1,459 | 93 | 15 | 0.0103 | 1.49E-03 | |
| Actin filament bundle organization | 155 | 93 | 6 | 0.0387 | 1.59E-03 | |
| Central nervous system development | 980 | 93 | 12 | 0.0122 | 2.54E-03 |
Gene set enrichment analysis carried on by computing overlaps between significantly upregulated genes (P < 0.05, avg_logFC > 0.25) and the gene ontology biological process gene set. No overlap was found between the gene set and upregulated genes of the cluster 4 compared to the cluster 12. “Genes in Gene Set” refers to the number of genes in the gene set, “Positive DEGs included” corresponds to the number of positive differentially expressed genes in the cluster of interest compared to the other and “Genes in Overlap” to the number of genes upregulated in the cluster and involved in the biological process. Avg_logFC, average log2 fold change.
Top 10 gene set overlap between significantly upregulated genes in clusters 7 and 13 compared to each other and the gene ontology (GO) biological process gene set.
| Cluster 7 vs. 13 | Cell activation | 1424 | 218 | 59 | 0.0414 | 3.15E-31 |
| Myeloid leukocyte activation | 650 | 218 | 43 | 0.0662 | 4.95E-30 | |
| Myeloid leukocyte mediated immunity | 550 | 218 | 38 | 0.0691 | 5.41E-27 | |
| Cell activation involved in immune response | 705 | 218 | 41 | 0.0582 | 1.26E-26 | |
| Immune effector process | 1,253 | 218 | 51 | 0.0407 | 1.26E-26 | |
| Exocytosis | 899 | 218 | 44 | 0.0489 | 8.25E-26 | |
| Leukocyte mediated immunity | 867 | 218 | 43 | 0.0496 | 1.90E-25 | |
| Secretion | 1,638 | 218 | 54 | 0.033 | 3.62E-24 | |
| Defense response | 1,709 | 218 | 51 | 0.0298 | 1.02E-20 | |
| Regulation of immune system process | 1,631 | 218 | 46 | 0.0282 | 1.96E-17 | |
| Cluster 13 vs. 7 | Cell activation | 1,424 | 85 | 23 | 0.0162 | 2.66E-10 |
| Regulation of lymphocyte activation | 478 | 85 | 14 | 0.0293 | 6.34E-09 | |
| Lymphocyte activation | 721 | 85 | 16 | 0.0222 | 6.34E-09 | |
| Regulation of immune system process | 1,631 | 85 | 22 | 0.0135 | 6.34E-09 | |
| Regulation of cell activation | 608 | 85 | 15 | 0.0247 | 6.34E-09 | |
| Regulation of T cell activation | 313 | 85 | 12 | 0.0383 | 6.34E-09 | |
| Response to biotic stimulus | 1,023 | 85 | 18 | 0.0176 | 8.72E-09 | |
| T cell activation | 459 | 85 | 13 | 0.0283 | 2.55E-08 | |
| Cytokine production | 759 | 85 | 15 | 0.0198 | 9.27E-08 | |
| Response to cytokine | 1,192 | 85 | 17 | 0.0143 | 6.25E-07 |
Gene set enrichment analysis carried on by computing overlaps between significantly upregulated genes (P < 0.05, avg_logFC > 0.25) and the gene ontology biological process gene set. “Genes in Gene Set” refers to the number of genes in the gene set, “Positive DEGs included” corresponds to the number of positive differentially expressed genes in the cluster of interest compared to the other and “Genes in Overlap” to the number of genes upregulated in the cluster and involved in the biological process. Avg_logFC, average log2 fold change.