| Literature DB >> 31781157 |
Eduardo Rocha Fukutani1, Pablo Ivan Pereira Ramos1, José Irahe Kasprzykowski1, Lucas Gentil Azevedo1, Moreno Magalhães de Souza Rodrigues2, João Victor de Oliveira Pimenta Lima1, Helton Fábio Santos de Araújo Junior1, Kiyoshi Ferreira Fukutani1,3,4, Artur Trancoso Lopo de Queiroz1.
Abstract
Human T-lymphotropic virus 1 (HTLV-1) was the first recognized human retrovirus. Infection can lead to two main symptomatologies: adult T-cell lymphoma/leukemia (ATLL) and HTLV-1 associated myelopathy/tropical spastic paraparesis (HAM/TSP). Each manifestation is associated with distinct characteristics, as ATLL presents as a leukemia-like disease, while HAM/TSP presents as severe inflammation in the central nervous system, leading to paraparesis. Previous studies have identified molecules associated with disease development, e.g., the downregulation of Foxp3 in Treg cells was associated with increased risk of HAM/TSP. In addition, elevated levels of CXCL10, CXCL9, and Neopterin in cerebrospinal fluid also present increased risk. However, these molecules were only associated with specific patient groups or viral strains. Furthermore, the majority of studies did not jointly compare all clinical manifestations, and robust analysis entails the inclusion of both ATLL and HAM/TSP. The low numbers of samples also pose difficulties in conducting gene expression analysis to identify specific molecular relationships. To address these limitations and increase the power of manifestation-specific gene associations, meta-analysis was performed using publicly available gene expression data. The application of supervised learning techniques identified alterations in two genes observed to act in tandem as potential biomarkers: GBP2 was associated with HAM/TSP, and CD40LG with ATLL. Together, both molecules demonstrated high sample-classification accuracy (AUC values: 0.88 and 1.0, respectively). Next, other genes with expression correlated to these genes were identified, and we attempted to relate the enriched pathways identified with the characteristic of each clinical manifestation. The present findings contribute to knowledge surrounding viral progression and suggest a potentially powerful new tool for the molecular classification of HTLV-associated diseases.Entities:
Keywords: HTLV-1 associated myelopathy/tropical spastic paraparesis; adult T-cell lymphoma/leukemia; bioinformatics; biomarkers; human T-lymphotropic virus 1; meta-analysis
Year: 2019 PMID: 31781157 PMCID: PMC6857459 DOI: 10.3389/fgene.2019.01056
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Description of the datasets used as the Discovery set.
| Accession number | Reference | Symptomatology | Sample number | Tissue |
|---|---|---|---|---|
| GSE55851 | Control | 3 | PBMCs | |
| Asymptomatic | 6 | PBMCs | ||
| ATLL | 12 | PBMCs | ||
| GSE29312 | Control | 9 | PBMCs | |
| Asymptomatic | 20 | PBMCs | ||
| HAM/TSP | 10 | PBMCs | ||
| GSE29332 | Control | 8 | PBMCs | |
| Asymptomatic | 17 | PBMCs | ||
| HAM/TSP | 10 | PBMCs | ||
| Total | Control | 20 | PBMCs | |
| Asymptomatic | 43 | PBMCs | ||
| ATLL | 12 | PBMCs | ||
| HAM/TSP | 20 | PBMCs |
Figure 1(A) Decision tree classification of three different symptomatologies using CD40LG to separate all ATLL samples from the others, and GBP2 to separate 84.2% of the HAM/TSP samples from controls. (B) Scatterplot of CD40LG (Y axis) and GBP2 (X axis) gene expression detailing the dispersion of the analyzed samples. Red lines represent the thresholds suggested by decision tree analysis. (C) ROC curve representing accuracy. An AUC of 0.9016 was found for the control group, 0.8898 for the HAM/TSP group, and 1.000 for ATLL. The red line represents the ATLL group, blue indicates HAM/TSP, and green is indicative of controls.
Figure 2(A) Pathways associated with genes found to be positively correlated with CD40LG and GBP2, grouped according to symptomatology. (B) Pathways associated with genes found to be negatively correlated with CD40LG and GBP2, grouped according to symptomatology. Analysis performed using the following parameters: p-value = 0.05, q-value = 0.2, minimum number of genes to cluster = 20, maximum number of genes to cluster = 500.
Performance of the two-gene signature classifying the samples from validation datasets.
| Accession number | Symptomatology | Tissue | Biomarkers | AUC |
|---|---|---|---|---|
| GSE17718 | Control | CD4+ Lymphocyte | CD40LG and GBP2 | 1.00 |
| ATLL | Cell lines StEd and MT-2 | CD40LG and GBP2 | 1.00 | |
| GSE6034 | Control | CD4+ Lymphocyte | CD40LG and GBP2 | 1.00 |
| ATLL | Cell lines TaY, MT-2 and MT-4 | CD40LG and GBP2 | 1.00 | |
| GSE38537 | Control | CD4+ Lymphocyte | CD40LG and GBP2 | 1.00 |
| HAM/TSP | CD4+ Lymphocyte | CD40LG and GBP2 | 1.00 | |
| GSE33615 | Control | CD4+ Lymphocyte | CD40LG and GBP2 | 1.00 |
| ATLL | PBMCs (Mostly CD4+ Lymphocytes) | CD40LG and GBP2 | 1.00 | |
| GSE19080 | Control | CD4+ Lymphocyte | GBP2 | 0.87 |
| ATLL | CD4+ Lymphocyte | GBP2 | 0.50 | |
| HAM/TSP | CD4+ Lymphocyte | GBP2 | 0.66 | |
| GSE57259 | Control | CD4+ CD25+ CCR4+ Lymphocytes | CD40LG and GBP2 | 1.00 |
| HAM/TSP | CD4+ CD25+ CCR4+ Lymphocytes | CD40LG and GBP2 | 1.00 | |
| ATLL | CD4+ CD25+ CCR4+ Lymphocytes | CD40LG and GBP2 | 1.00 |