| Literature DB >> 32760401 |
Olivia Estévez1,2, Luis Anibarro2,3,4, Elina Garet1,2, Ángeles Pallares5, Laura Barcia3, Laura Calviño3, Cremildo Maueia6, Tufária Mussá6,7, Florentino Fdez-Riverola1,2,8, Daniel Glez-Peña1,2,8, Miguel Reboiro-Jato1,2,8, Hugo López-Fernández1,2,8, Nuno A Fonseca9,10, Rajko Reljic11, África González-Fernández1,2.
Abstract
A better understanding of the response against Tuberculosis (TB) infection is required to accurately identify the individuals with an active or a latent TB infection (LTBI) and also those LTBI patients at higher risk of developing active TB. In this work, we have used the information obtained from studying the gene expression profile of active TB patients and their infected -LTBI- or uninfected -NoTBI- contacts, recruited in Spain and Mozambique, to build a class-prediction model that identifies individuals with a TB infection profile. Following this approach, we have identified several genes and metabolic pathways that provide important information of the immune mechanisms triggered against TB infection. As a novelty of our work, a combination of this class-prediction model and the direct measurement of different immunological parameters, was used to identify a subset of LTBI contacts (called TB-like) whose transcriptional and immunological profiles are suggestive of infection with a higher probability of developing active TB. Validation of this novel approach to identifying LTBI individuals with the highest risk of active TB disease merits further longitudinal studies on larger cohorts in TB endemic areas.Entities:
Keywords: RNA-seq; TB progression; latent tuberculosis; machine-learning; tuberculosis
Mesh:
Substances:
Year: 2020 PMID: 32760401 PMCID: PMC7372107 DOI: 10.3389/fimmu.2020.01470
Source DB: PubMed Journal: Front Immunol ISSN: 1664-3224 Impact factor: 7.561
Exclusion criteria for participants' recruitment.
| All participants | Having received anti-TB treatment before |
| Contacts only | Previous TB diagnosis |
Demographical composition of the Spanish and Mozambican cohorts.
| Total | 41 | 27 | 28 |
| Males (%) | 19 (46.3) | 16 (59.3) | 23 (82.1) |
| Age mean (range) | 39 (19–76) | 48 (19–71) | 41 (21–72) |
| Total | 9 | 16 | 37 |
| Males (%) | 4 (44.4) | 7 (43.8) | 25 (67.6) |
| Age mean (range) | 35 (9–80) | 32 (8–59) | 32 (13–61) |
Figure 1Summary of the differentially expressed (DE) genes in the pairwise contrasts comparing the TB study groups. (A) Total number of DE genes (adjusted p < 0.01 and absolute Log2 Fold Change (L2FC) >1) and number of up- or down-regulated genes. (B) Venn diagram with the overlapping genes between the different signatures. The volcano plots highlight the genes with significant differences between (C) active TB patients vs. uninfected (NoTBI) contacts and (D) active TB patients vs. contacts with latent infection (LTBI). The Top-30 most modulated (genes with the greatest absolute fold change) genes were labeled.
Figure 2Reactome Pathway Enrichment Analysis of genes differentiating Active TB vs. NoTBI (A) and vs. LTBI (B). The Gene count indicates the number of genes from the input list found on each pathway. The adjusted p-value (p.adj) indicates the significance of the enrichment.
Figure 3Hierarchical Clustering analysis of the active TB patients and their contacts from the Spanish cohort based on the expression of the differentially expressed genes. (A) Heatmap based on the 259 DE genes between Active TB and NoTBI contacts. (B) Heatmap based on the 133 genes between active TB and LTBI contacts. Each column of the heatmaps represents one sample and each row represents one gene. Both the samples and the genes have been clustered based on the similarity of their expression pattern. The color of the cells indicates the expression of each gene for the corresponding sample. The main enriched pathways related to each gene cluster are included.
Figure 4Schematic representation of the steps followed to classify the LTBI samples. Differentially Expressed (DE) genes between patients with confirmed infection (28 Active TB) and uninfected individuals (41 NoTBI) were used to create a class-prediction model. A leave-one-out Cross-validation (LOOCV) was used to select the classification algorithm. Random Forest algorithm, the one with the best performance in the LOOCV, was used to build a class-prediction model using the samples from the Spanish cohort (training set) as an input. The model was validated on an independent Mozambican cohort (test set). The validated model was then applied to the LTBI samples from the Spanish cohort to study their similarity to either the active TB patients or the uninfected contacts. Two subgroups were identified within the LTBI contacts, named TB-like and NoTB-like.
Figure 5Genes differentially expressed in LTBI subgroups compared between them and to NoTBI and active TB patients. (A) Table summarizing the number of DE genes on each pair-wise comparison (columns vs. rows). (B) Venn diagram showing the overlapping genes between signatures. (D–G) Volcano plots highlighting the genes with significant (adjusted p < 0.05) fold Change between groups. Up-regulated genes (Log2(Fold Change) >1) in green and down-regulated genes (Log2(Fold Change) < -1) in red. The differential expression analysis was made using the DESeq2 R package, comparing all annotations of the reference genome (34947 annotations). Genes PPP1R11 and AC004556, which showed a log2(Fold Change) < -20, are not represented in the volcano plot (C).
Figure 6Pathway enrichment analysis of the 150 DE genes between TB-like and NoTB-like subgroups. (A) Enriched pathways from the Reactome database. The color of the bars indicates the significance [adjusted p-value (padj)] of the enrichment. (B) String protein association network. Each node (circle) represents a protein (identified by their coding gene) and the edges represent protein interactions. The main enriched pathways have been circled in a matching color with the correspondent Reactome pathway.
Immunological variables in TB-like and NoTB-like patients.
| Leukocyte count (106 cells/mL) | 6058.57 | 7700 | |
| Neutrophils (%) | 57.95 | 59.83 | 1 |
| Lymphocytes (%) | 31 | 26.83 | 0.413 |
| Monocytes (%) | 7.67 | 9.33 | |
| Eosinophils (%) | 2.48 | 3 | 0.809 |
| Basophils (%) | 0.76 | 0.5 | 0.224 |
| IL-6 | 42.26 | 93.36 | |
| IL-7 | 6.4 | 23.58 | |
| IP-10 | 240.12 | 438.99 | |
| TGFα | 9.91 | 25 | |
| TNFα | 34.84 | 32.51 | 0.143 |
| BCA-1 | 23.38 | 24.84 | 0.34 |
| IL-27 | 317.47 | 579.14 | |
Bold values indicate those that were statistically significant.