| Literature DB >> 35493464 |
Dongmei Ai1, Yonglian Xing2, Qingchuan Zhang3, Yishu Wang2, Xiuqin Liu2, Gang Liu2, Li C Xia4.
Abstract
Recent transcriptomics and metagenomics studies showed that tissue-infiltrating immune cells and bacteria interact with cancer cells to shape oncogenesis. This interaction and its effects remain to be elucidated. However, it is technically difficult to co-quantify immune cells and bacteria in their respective microenvironments. To address this challenge, we herein report the development of a complete a bioinformatics pipeline, which accurately estimates the number of infiltrating immune cells using a novel Particle Swarming Optimized Support Vector Regression (PSO-SVR) algorithm, and the number of infiltrating bacterial using foreign read remapping and the GRAMMy algorithm. It also performs systematic differential abundance analyses between tumor-normal pairs. We applied the pipeline to a collection of paired liver cancer tumor and normal samples, and we identified bacteria and immune cell species that were significantly different between tissues in terms of health status. Our analysis showed that this dual model of microbial and immune cell abundance had a better differentiation (84%) between healthy and diseased tissue. Caldatribacterium sp., Acidaminococcaceae sp., Planctopirus sp., Desulfobulbaceae sp.,Nocardia farcinica as well as regulatory T cells (Tregs), resting mast cells, monocytes, M2 macrophases, neutrophils were identified as significantly different (Mann Whitney Test, FDR< 0.05). Our open-source software is freely available from GitHub at https://github.com/gutmicrobes/PSO-SVR.git.Entities:
Keywords: RNA-seq; gene expression profiling; particle swarm algorithm; support vector regression; tumor microenvironment
Mesh:
Year: 2022 PMID: 35493464 PMCID: PMC9047545 DOI: 10.3389/fimmu.2022.853213
Source DB: PubMed Journal: Front Immunol ISSN: 1664-3224 Impact factor: 8.786
Figure 1Flow chart of RNA-Seq sequencing data processing.
Figure 2The flow chart of PSO-SVR algorithm. SVR model was embedded into the PSO algorithm to calculate the optimal parameters; then SVR model with optimized parameters was applied to the immune cell ratios.
PSO-SVR model parameter.
| Parameter | Meaning | Reference range | Value |
|---|---|---|---|
| k(x,y) | kernel function | linear | k (x,y)=x·y |
| C | Penalty factor | [1,108] | 3 |
| ϵ | sensitivity | [0,0.2] | 1.203564×10-5 |
| Φ | Kernel function coefficient | [0.01,2.0] | 0.04545455 |
Figure 3Comparison between PSO-SVR and orthogonal technologies. Scatter plots of immune cell fractions with regression of (A) PBMC-FC; (B) CRC-IC; and (C) Melanoma-scRNA data; r for Pearson’s correlation, rs for Spearman’s correlation.
Immune cell error table of three data sets.
| Cell type | Root mean square error (RMSE) | Pearson (r) | Spearman (rs) | |
|---|---|---|---|---|
| B cells | 0.028763 | 0.69534 | 0.67669 | |
| Monocytes | 0.056215 | 0.92192 | 0.93684 | |
| CD4 T cells | 0.073478 | 0.84827 | 0.88270 | |
| CD8T cells | 0.110915 | 0.84207 | 0.82105 | |
| B cells | 0.057609 | 0.39181 | 0.51515 | |
| Monocytes | 0.018876 | 0.87587 | 0.78181 | |
| NK cells | 0.116719 | 0.23098 | 0.30909 | |
| T cells | 0.060646 | 0.18626 | 0.16363 | |
| B cells | 0.019590 | 0.99128 | 0.85381 | |
| Macrophages | 0.028086 | 0.70316 | 0.67192 | |
| NK cells | 0.021296 | 0.92560 | 0.86315 | |
| CD4 T cells | 0.034945 | 0.95508 | 0.92847 | |
| CD8 T cells | 0.050518 | 0.97476 | 0.97543 |
Figure 4Differetial analysis of immune cell infiltration in normal and tumor tissues. “**” indicates FDR < 0.01; “***” indicates FDR < 0.001.
Bacteria with significant variability between normal and primary tumor tissues.
| Species | FDR | Difference % | State (↑↓) |
|---|---|---|---|
| 3.15 × 10-11 | 74.94% | ↑ | |
| 1.79 × 10-9 | 73.58% | ↑ | |
| 2.50 × 10-9 | 73.66% | ↑ | |
| 2.50 × 10-9 | 71.10% | ↑ | |
| 8.02 × 10-9 | 71.20% | ↑ |
Figure 5Difference and diversity of bacteria in liver cancer samples. (A) Differentiation of bacteria between liver tumor tissue and normal tissue. “**” indicates FDR < 0.05; “***” indicates FDR < 0.001. (B) Analysis of alpha diversity of bacteria in tumor and normal tissues. The green color on the left indicates normal samples, the red part on the right indicates tumor samples, and the middle BASE part indicates the interquartile range.
Classification effect of liver cancer samples under different input features.
| Case1: Bacteria | Case2: Immune cell | Case3: Bacteria-Cell | |
|---|---|---|---|
| 0.68 | 0.64 | 0.80 | |
| 0.75 | 0.71 | 0.88 | |
| 0.67 | 0.67 | 0.88 | |
| 0.70 | 0.67 | 0.85 | |
| 0.68 | 0.76 | 0.80 | |
| 0.75 | 0.79 | 0.75 | |
| 0.67 | 0.67 | 0.96 | |
| 0.70 | 0.74 | 0.84 |