| Literature DB >> 33168968 |
Erick Armingol1,2,3, Adam Officer3,4, Olivier Harismendy5,6, Nathan E Lewis7,8,9.
Abstract
Cell-cell interactions orchestrate organismal development, homeostasis and single-cell functions. When cells do not properly interact or improperly decode molecular messages, disease ensues. Thus, the identification and quantification of intercellular signalling pathways has become a common analysis performed across diverse disciplines. The expansion of protein-protein interaction databases and recent advances in RNA sequencing technologies have enabled routine analyses of intercellular signalling from gene expression measurements of bulk and single-cell data sets. In particular, ligand-receptor pairs can be used to infer intercellular communication from the coordinated expression of their cognate genes. In this Review, we highlight discoveries enabled by analyses of cell-cell interactions from transcriptomic data and review the methods and tools used in this context.Entities:
Mesh:
Year: 2020 PMID: 33168968 PMCID: PMC7649713 DOI: 10.1038/s41576-020-00292-x
Source DB: PubMed Journal: Nat Rev Genet ISSN: 1471-0056 Impact factor: 59.581
Fig. 1Types and applications of cell–cell interactions and communication.
a | ‘Autocrine signalling’ refers to intracellular communication whereby cells secrete ligands that are used to induce a cellular response through cognate receptors for those molecules expressed on the same cell. Paracrine cell–cell communication does not require cell–cell contact, rather depending on the diffusion of signalling molecules from one cell to another after secretion. Juxtacrine, that is, contact-dependent, cell–cell communication relies on gap junctions or other structures such as membrane nanotubes to pass signalling molecules directly between cells, without secretion into the extracellular space. Endocrine cell–cell communication represents intercellular communication whereby signalling molecules are secreted and travel long distances through extracellular fluids such as the blood plasma; typical mediators of this communication are hormones. b | Overview of the main applications of cell–cell interaction methods: cell development, tissue and organ homeostasis, and immune interactions in disease (for more details on each study type, see Supplementary Table 1).
Illustrative studies and their strategies for deciphering cell–cell interactions and communication
| Sample or organ | Key input | Scoring | CS value | CCC score | Validation | Study focus | Ref |
|---|---|---|---|---|---|---|---|
| Haematopoietic cells (human) | Microarray; LRIs | Expression thresholding | Binary | No score | Functional validation | Role of CCC between differentiated haematopoietic cells and HSCs in fate decisions | [ |
| Brain (mouse embryonic cortex) | Microarray; LRIs | Expression thresholding | Binary | No score | Functional validation | Role of microenvironment in self-renewal versus differentiation decision of neural precursor cells during neurogenesis | [ |
| Liver and iPS cells (human) | scRNA-seq; LRIs | Expression thresholding | Binary | Normalized sum of CS | Functional validation | 3D liver bud organoid from iPS cells to characterize CCC shaping hepatogenesis | [ |
| Placenta (human) | scRNA-seq; LRIs | Expression thresholding | Binary | No score | Functional validation | CCI in the fetus–placenta interface before and after decidualization | [ |
| Brain (mouse) | Bulk RNA-seq; LRIs | Expression thresholding | Binary | Sum of CS | Colocalization | Ligand–receptor pathways active during neural development; CCC between neural, vascular and microglial cells | [ |
| iPS cells (mouse) | scRNA-seq; LRIs | Expression product | Continuous | Sum of CS | Functional validation | CCC at the beginning of differentiation | [ |
| Bone marrow (mouse) | scRNA-seq; LRIs | RNA-Magnet | Continuous | No score | Colocalization | CCC and interactions between bone marrow cells | [ |
| Multiple lineages (human) | scRNA-seq; LRIs | Expression thresholding | Binary | Sum of CS | None | CCC between multiple cell lineages | [ |
| Lungs (human) | popRNA-seq; LRIs | Expression thresholding | Binary | Sum of CS | Colocalization functional validation | Signals sent by mesenchymal cells in lungs that are key for self-renewal of epithelial progenitors after tissue injury | [ |
| Heart (mouse) | scRNA-seq; LRIs | Expression thresholding | Binary | Sum of CS | Functional validation | Transcriptional profiles of non-myocyte cells in heart and their CCC | [ |
| Lungs (mouse) | scRNA-seq; LRIs | Expression correlation (Spearman) | Continuous | No score | Expression; colocalization functional validation | CCC between and within immune and non-immune cells during development | [ |
| Immune system and structural cells (mouse) | Low-input RNA-seq; LRIs | Differential combinations | Binary | Odds ratios | Functional validation | Role of structural cells in immune responses | [ |
| Heart (mouse) | scRNA-seq; LRIs | Differential combinations | Binary | Sum of CS | Expression; colocalization functional validation | CCC of cardiomyocytes and non-cardiomyocytes in human heart in health and under failure | [ |
| Placenta (human) | scRNA-seq; LRIs | CellPhoneDB | Continuous | No score | Colocalization | Key ligand–receptor pairs based on subunit architecture; CCC at maternal–fetal interface | [ |
| Melanoma (human) | scRNA-seq; LRIs | Expression thresholding | Binary | Sum of CS | None | CCI network of isolated cells | [ |
| HNSCC (human) | scRNA-seq; LRIs | Expression thresholding | Binary | No score | Colocalization | CCC in patients with HNSCC generated by HPV or environmental carcinogens (HPV negative) | [ |
| Five cancer types (mouse) | scRNA-seq; LRIs | Expression product | Continuous | No score | None | CCC within a tumoural microenvironment | [ |
| Nine cancer types (human) | Microarray; LRIs | Expression correlation (Pearson) | Continuous | No score | None | Correlation between autocrine signalling pathways and mRNA levels of ligands and receptors | [ |
| Lungs (human) | scRNA-seq; LRIs | Differential combination; expression thresholding | Binary | No score | Functional validation | Tumour–stroma CCC in lung cancer; introduced CCCExplorer | [ |
| Ovary (human) | Microarray; LRIs; downstream target genes | Differential combination; expression thresholding | Binary | No score | Expression; functional validation | CCC between stromal and ovarian cancer cells | [ |
| Head and neck and immune system (human) | scRNA-seq; LRIs; downstream target genes | NicheNet | Continuous | No score | None | Prediction of ligand–target links between interacting cells; tested on HNSCC data set | [ |
CCC, cell–cell communication; CCI, cell–cell interaction; CS, communication score; HNSCC, head and neck squamous cell carcinoma; HPV, human papillomavirus; HSC, haematopoietic stem cell; iPS cell, induced pluripotent stem cell; LRI, ligand–receptor interaction; popRNA-seq, population RNA sequencing; RNA-seq; RNA sequencing; scRNA-seq, single-cell RNA sequencing. For additional studies and details, see Supplementary Table 1.
Fig. 2Analysis workflow for inferring cell–cell interactions and communication from gene expression.
a | Samples or cells are analysed by transcriptomics to measure the expression of genes (step 1). Then the data generated are preprocessed to build a gene expression matrix, which contains the transcript levels of each gene across different samples or cells (step 2). A list of interacting proteins that are involved in intercellular communication is generated or obtained from other sources (step 3), often including interactions between secreted and membrane-bound proteins (commonly ligands and receptors, respectively). Only the genes associated with the interacting proteins are held in the gene expression matrix (step 4). Their expression levels are used as inputs to compute a communication score for each ligand–receptor pair using a scoring function (function f(L, R), where L and R are the expression values of the ligand and the receptor, respectively). These communication scores may be aggregated to compute an overall state of interaction between the respective samples or cells using an aggregation function (function g(Cell 1, Cell 2), where Cell 1 and Cell 2 are all communication scores of those cells or corresponding samples) (step 5). Finally, communication and aggregated scores can be represented by, for instance, Circos plots and network visualizations to facilitate the interpretation of the results (step 6). b | Main scoring functions of communication pathways based on the expression of their components. Recommended data to use with these functions and the type of their resulting communication score are indicated.
Fig. 3Toy examples of using core functions to compute communication scores.
Two primary inputs are used for quantifying communication scores: a preprocessed gene expression matrix (part a) and a list of interacting proteins to supervise the analysis (for example, ligand–receptor pairs) (part b). Then a communication score (CS) can be computed for every ligand–receptor pair in a given pair of cells. Here, we show how to perform these calculations for four core functions (parts c–f). These are applied to elucidate paracrine (parts c,d) and autocrine (parts e,f) communication. To assess cell–cell communication, a CS can be computed for each ligand–receptor pair by accounting for the presence of both partners if their expression is greater than a given threshold, which for demonstrative purposes was set arbitrarily to a value of 3.3 (part c), or by multiplying their expression values (part d). Similarly, the CS for each ligand–receptor pair can be the correlation score obtained from their expression across all cell types for autocrine communication (part e). To reveal non-autocrine interactions, the correlation can be computed across pairs of different cells. Particular signatures of each cell type can be extracted through analysing differentially expressed ligands and receptors. Using the cell type-specific differentially expressed genes, we can assign a binary CS and study the ligand–receptors used for autocrine communication (part f). In this example, autocrine communication is evaluated for cell type A by using its differentially expressed genes with respect to cell type B (cell type A-specific genes are located in the coloured quadrant). Analogously to the correlation score, for non-autocrine communication we would need to consider differentially expressed genes in each of the cell types or samples. For a given pair of cells, we can say that a communication pathway is active when the ligand is differentially expressed in one cell and its cognate receptor is differentially expressed in the other. FC, fold change.
Existing tools for measuring cell–cell communication
| Tool | Method overview | Output | Visualization | Available in | URL | Refs |
|---|---|---|---|---|---|---|
| CellTalker | Uses differentially expressed ligands and receptors in each cluster to identify unique interactions between clusters | Upregulated and downregulated interactions between all clusters | Circos plot of differential interactions between clusters | R | [ | |
| iTALK | Enumerates differentially expressed ligand and receptor values to identify LRIs between different clusters | Upregulated and downregulated interactions between all clusters | CCI networks, Circos plots and boxplots | R | [ | |
| PyMINEr | Uses differentially expressed ligand and receptor pairs to identify altered signalling pathways. Detects both activation and inhibition | Upregulated and downregulated interactions between all clusters | Network visualization and Circos plots | Python and standalone application | [ | |
| CellChat | Modelling of LRI is generalized from the Hill equation, including expression of both agonists and antagonists. Significance is computed by permutation | Likelihood of CCC between all clusters for all interactions | Alluvial and Circos plots of communication pathways, dot plots of interactions between clusters | R and Web interface | [ | |
| CellPhoneDB | Randomly permutes cluster labels to generate a null distribution of LRI scores using protein complex subunit architecture to identify significant interactions | Upregulated and downregulated interactions between all clusters | Heatmap of significant interaction counts, dot plot of LRIs and cluster combinations | Python and Web interface | [ | |
| Giotto | Randomly permutes cluster labels to generate a null distribution of LRI scores using spatial information to identify significant interactions | Upregulated and downregulated interactions between all clusters | Heatmap of significant interaction counts, dot plot of LRIs and cluster combinations | R | [ | |
| ICELLNET | Sums the product of all LRI scores between two clusters to compute an overall CCI score | Intergroup communication scores, | Stacked bar plot of LRIs, network visualization of interacting groups and pathway-level analysis | R | [ | |
| SingleCellSignalR | Uses a regularized ligand–receptor expression product to measure extent of CCC | Interaction scores for each LRI between all clusters in the dataset | Circos plot, tables and graph visualizations of interactions between clusters | R | [ | |
| CCCExplorer | A graph of all signalling pathways is built, then a Fisher-like statistic is computed using ligand, receptor and downstream TF expression to identify significant interactions | Graph visualizations of all interactions | Interactive directed graphs | Standalone application | [ | |
| NicheNet | A network of ligand–receptor pathway interactions is used to measure the predictive power of the ligand for its downstream pathway targets as an interaction score, based on a personalized PageRank algorithm | Ligand interaction scores and expressing cell types for provided target pathway | Circos plot of interactions between cells or clusters | R | [ | |
| SoptSC | Integrates downstream signalling measurements into an LRI scoring function | Upregulated and downregulated interactions between all clusters | Circos plot of interactions between cells | MATLAB and R | [ | |
| SpaOTsc | An optimal transport model is used to infer CCC from ligand, receptor and downstream component expression | Likelihood of CCC between all clusters for all LRIs | Not implemented | Python | [ | |
| scTensor | Tucker decomposition on a tensor of order three to identify key LRIs present in certain cell types | HTML file with summaries of clustering, decomposition and interaction components | Many options for interaction, expression and pattern visualization | R | [ | |
CCC, cell–cell communication; CCI, cell–cell interaction; LRI, ligand–receptor interaction; TF, transcription factor.
Fig. 4Common visualization techniques for cell–cell interactions and communication.
a | A Sankey diagram for connecting key ligands from a sender cell to cognate receptors in the receiver cell. Node colour (ligand or receptor) indicates the expression level. b | Heatmap to represent the communication scores for each ligand–receptor interaction in each cell pair. c | Dot plot to show the communication score (colour of dots) and at the same time its significance (size), often obtained from a statistical model or permutation analysis. d | Circos plot or chord diagram to show key communication pathways used by different cell types to communicate. The links start from a ligand (red) and end in a receptor (blue), which are grouped for each cell type (coloured outer arcs). e | Bipartite network where nodes can be either cells or ligands. Edges can be directed only from a cell to a ligand it produces or from a ligand to a cell that expresses its cognate receptor. f | Cell–cell interaction network to represent the potential of cells to interact. Nodes correspond to cells and edges correspond to their interactions. These are directed from a sender cell to a receiver cell, and their thicknesses are proportional to the respective global cell–cell communication scores (for example, number of active ligand–receptor pairs).
| Resource | URL | Ref. |
|---|---|---|
| Gene Ontology | [ | |
| UniProt | [ | |
| KEGG | [ | |
| OrthoDB | [ | |
| gProfiler | [ | |
| The Human Protein Atlas | [ | |
| STRING | [ | |
| BioGRID | [ | |
| PICKLE | [ | |
| APID | [ | |
| IntAct | [ | |
| Pathway Commons | [ | |
| IUPHAR/BPS Guide to Pharmacology | [ | |
| Resources for cell–cell interactions from the Bader laboratory | [ | |
| Compendium of ligand–receptor pairs in the literature | 10.1038/s41576-020-00292-x | |