| Literature DB >> 22984568 |
Tong Zhou1, Wei Zhang, Nadera J Sweiss, Edward S Chen, David R Moller, Kenneth S Knox, Shwu-Fan Ma, Michael S Wade, Imre Noth, Roberto F Machado, Joe G N Garcia.
Abstract
Sarcoidosis, a systemic granulomatous syndrome invariably affecting the lung, typically spontaneously remits but in ~20% of cases progresses with severe lung dysfunction or cardiac and neurologic involvement (complicated sarcoidosis). Unfortunately, current biomarkers fail to distinguish patients with remitting (uncomplicated) sarcoidosis from other fibrotic lung disorders, and fail to identify individuals at risk for complicated sarcoidosis. We utilized genome-wide peripheral blood gene expression analysis to identify a 20-gene sarcoidosis biomarker signature distinguishing sarcoidosis (n = 39) from healthy controls (n = 35, 86% classification accuracy) and which served as a molecular signature for complicated sarcoidosis (n = 17). As aberrancies in T cell receptor (TCR) signaling, JAK-STAT (JS) signaling, and cytokine-cytokine receptor (CCR) signaling are implicated in sarcoidosis pathogenesis, a 31-gene signature comprised of T cell signaling pathway genes associated with sarcoidosis (TCR/JS/CCR) was compared to the unbiased 20-gene biomarker signature but proved inferior in prediction accuracy in distinguishing complicated from uncomplicated sarcoidosis. Additional validation strategies included significant association of single nucleotide polymorphisms (SNPs) in signature genes with sarcoidosis susceptibility and severity (unbiased signature genes - CX3CR1, FKBP1A, NOG, RBM12B, SENS3, TSHZ2; T cell/JAK-STAT pathway genes such as AKT3, CBLB, DLG1, IFNG, IL2RA, IL7R, ITK, JUN, MALT1, NFATC2, PLCG1, SPRED1). In summary, this validated peripheral blood molecular gene signature appears to be a valuable biomarker in identifying cases with sarcoidoisis and predicting risk for complicated sarcoidosis.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22984568 PMCID: PMC3440319 DOI: 10.1371/journal.pone.0044818
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Study subjects with racial and complication status.
| Population | Healthy controls | Uncomplicated cases | Complicated cases | ||
| Cardiac | Neurologic | FVC<50% | |||
| AA | 8 | 11 | 5 | 5 | 10 |
| EA | 27 | 6 | 3 | 2 | 1 |
| Total | 35 | 17 | 8 | 7 | 11 |
Amongst the patients with cardiac sarcoidosis, three had severe pulmonary disease. In the patients with neurologic sarcoidosis, five had pulmonary disease. EA: European Americans (Caucasians); AA: African Americans.
Patient characteristics and concomitant medications.
| Characteristics | Uncomplicated sarcoidosis (n = 17) | Complicated sarcoidosis(n = 22) |
| Age | 49±10 | 47±9 |
| Gender (Male/Female) | 5/12 | 5/17 |
| FVC, L | 2.9±0.8 | 2.7±1.5 |
| FVC, percent of predicted | 74±17 | 65±31 |
| FEV1, L | 2.2±0.7 | 2.1±1 |
| FEV1, percent of predicted | 74±17 | 67±30 |
| DLCO, percent of predicted | 74±23 | 65±28 |
| Corticosteroids, n (dose, mg prednisone equivalent per day) | 7 (20±16) | 11 (13±11) |
| Methotrexate, n (dose, mg per week) | 3 (12.25±3.5) | 7 (11±4) |
| Mycophenolate, n (dose, mg per day) | 1 (250) | 3 (667±289) |
| Anti-TNF alpha therapy, n | 0 | 3 |
Figure 1Identifying gene signatures in sarcoidosis. Panel A. Enriched pathways among complicated sarcoidosis-associated genes.
The top ranking KEGG pathways are listed for each population. The red line indicates the cutoff of significance (adjusted p-value<0.05). The number of genes in each pathway is shown beside the pathway name. Panel B. Heatmap of patients with complicated sarcoidosis and healthy controls. Red represents increased gene expression; Blue represents down-regulation. “++”: patients with complicated sarcoidosis; “−”: healthy controls. Panel C. Principal component analysis on expression values of the 20-gene signature. X-axis: principal component 1 with eigenvalue; Y-axis: principal component 2 with eigenvalue. Left panel: patients with complicated sarcoidosis and healthy controls; middle panel: patients with complicated sarcoidosis, uncomplicated sarcoidosis and healthy controls; and right panel: patients with complicated sarcoidosis and uncomplicated sarcoidosis. HC: healthy controls; US: patients with uncomplicated sarcoidosis; and CS: patients with complicated sarcoidosis. Panel D. Comparison between the 20-gene signature and the TCR/JS/CCR signaling pathway gene signature. The distribution of prediction accuracy is based on 1,000 times of five-fold cross-validation. The dashed lines indicate the average classification accuracy for the 20-gene signature or the TCR/JS/CCR signaling pathway gene signature. Left panel: all sarcoidosis patients versus healthy controls; and right panel: patients with complicated sarcoidosis versus patients with uncomplicated sarcoidosis.
The unbiased 20-gene signature for complicated sarcoidosis.
| Gene symbol | Gene title | Weight |
|
| fat storage-inducing transmembrane protein 2 | 0.04872 |
|
| heparin-binding EGF-like growth factor | 0.04791 |
|
| teashirt zinc finger homeobox 2 | 0.04648 |
|
| meiosis inhibitor 1 | 0.04218 |
|
| cytokine receptor CRL2 | 0.03851 |
|
| zinc finger protein 540 | 0.03776 |
|
| Sin3A-associated protein, 30kDa | 0.02935 |
|
| zinc finger protein 614 | 0.02715 |
|
| KIAA1147 | 0.02585 |
|
| hypothetical protein LOC100132356 | 0.02561 |
|
| chemokine (C-X3-C motif) receptor 1 | 0.02547 |
|
| RNA binding motif protein 12B | 0.02286 |
|
| FK506 binding protein 1A, 12kDa | 0.02157 |
|
| SERTA domain containing 1 | 0.02119 |
|
| apolipoprotein B mRNA editing enzyme, catalytic polypeptide-like 3D | 0.02106 |
|
| killer cell lectin-like receptor subfamily B, member 1 | 0.01979 |
|
| cysteine-rich protein 1 (intestinal) | 0.01889 |
|
| noggin | 0.01724 |
|
| sestrin 3 | 0.01701 |
|
| zinc finger protein 671 | 0.01657 |
Here, the weight of each gene represents the frequency of the gene being selected during the last round of RFE procedure.
The 31 differentially-expressed TCR/JS/CRR signaling pathway genes in sarcoidosis.
| AA | EA | ||||
| Gene symbol | Gene title | Fold change | FDR (%) | Fold change | FDR (%) |
|
| CD247 molecule | 0.63 | 0.0 | 0.62 | 0.2 |
|
| CD28 molecule | 0.60 | 0.3 | 0.55 | 0.5 |
|
| CD3d molecule, delta (CD3-TCR complex) | 0.52 | 0.2 | 0.44 | 0.0 |
|
| CD3e molecule, epsilon (CD3-TCR complex) | 0.67 | 1.0 | 0.49 | 0.2 |
|
| CD3g molecule, gamma (CD3-TCR complex) | 0.52 | 0.2 | 0.45 | 0.0 |
|
| CD8a molecule | 0.84 | 7.4 | 0.63 | 0.8 |
|
| Cas-Br-M (murine) ecotropic retroviral transforming sequence b | 0.68 | 0.5 | 0.65 | 0.2 |
|
| GRB2-related adaptor protein 2 | 0.84 | 7.4 | 0.71 | 1.2 |
|
| IL2-inducible T-cell kinase | 0.52 | 0.2 | 0.42 | 0.0 |
|
| NCK adaptor protein 1 | 0.88 | 11.3 | 0.71 | 0.2 |
|
| RAS guanyl releasing protein 1 (calcium and DAG-regulated) | 0.51 | 0.0 | 0.48 | 0.0 |
|
| discs, large homolog 1 (Drosophila) | 0.73 | 1.5 | 0.69 | 0.8 |
|
| inducible T-cell co-stimulator | 0.59 | 0.2 | 0.61 | 1.7 |
|
| interferon, gamma | 0.56 | 0.0 | 0.74 | 4.7 |
|
| interleukin 7 receptor | 0.69 | 3.4 | 0.51 | 0.5 |
|
| jun oncogene | 0.67 | 4.3 | 2.17 | 23.7 |
|
| lymphocyte-specific protein tyrosine kinase | 0.70 | 0.5 | 0.61 | 0.2 |
|
| mitogen-activated protein kinase 9 | 0.78 | 3.4 | 0.71 | 0.5 |
|
| mucosa associated lymphoid tissue lymphoma translocation gene 1 | 0.66 | 0.3 | 0.69 | 0.5 |
|
| nuclear factor of activated T-cells, cytoplasmic, calcineurin-dependent 2 | 0.59 | 0.0 | 0.57 | 0.0 |
|
| nuclear factor of activated T-cells, cytoplasmic, calcineurin-dependent 3 | 0.66 | 0.0 | 0.71 | 0.2 |
|
| phosphoinositide-3-kinase, catalytic, alpha polypeptide | 0.76 | 2.3 | 0.69 | 0.5 |
|
| phospholipase C, gamma 1 | 0.60 | 0.0 | 0.62 | 0.2 |
|
| v-akt murine thymoma viral oncogene homolog 3 (protein kinase B, gamma) | 0.60 | 0.3 | 0.54 | 0.0 |
|
| zeta-chain (TCR) associated protein kinase 70kDa | 0.72 | 0.5 | 0.66 | 0.5 |
|
| cyclin D2 | 0.62 | 0.3 | 0.51 | 0.0 |
|
| interleukin 2 receptor, alpha | 0.61 | 0.3 | 0.78 | 17.2 |
|
| interleukin 2 receptor, beta | 0.66 | 0.3 | 0.64 | 1.7 |
|
| signal transducer and activator of transcription 4 | 0.61 | 0.3 | 0.52 | 0.0 |
|
| sprouty-related, EVH1 domain containing 1 | 0.60 | 0.0 | 0.77 | 4.7 |
|
| suppressor of cytokine signaling 4 | 0.71 | 0.3 | 0.77 | 0.8 |
EA: Caucasian Americans; AA: African Americans; FDR: false discovery rate.
Figure 2Validation in independent datasets.
The upper panels show the comparison between the 20-gene signature and the TCR/JS/CCR signaling pathway gene signature. The distribution of prediction accuracy is based on 1,000 times of five-fold cross-validation. The dashed lines indicate the average classification accuracy for the 20-gene signature or the TCR/JS/CCR signaling pathway gene signature. The lower panels show the results of principal component analysis on expression values of the 20-gene signature. X-axis: principal component 1 with eigenvalue; Y-axis: principal component 2 with eigenvalue.
SNPs significantly associated with sarcoidosis within the unbiased 20 signature genes (P<0.01).
| Population | SNP chromosome | dbSNP RS ID | Gene symbol | Gene relationship | Sarcoidosis vs healthy controls | Complicated sarcoidosis vs uncomplicated sarcoidosis | ||
| P | OR | P | OR | |||||
| African Americans | 11 | rs629508 | SESN3 | intron | 1.7E-03 | 1.645 | 5.4E-05 | 0.254 |
| 17 | rs7219027 | NOG | downstream | 4.3E-03 | 1.487 | |||
| 20 | rs1294689 | FKBP1A | intron | 4.8E-03 | 1.536 | 3.6E-05 | 2.710 | |
| 11 | rs12280779 | SESN3 | upstream | 5.5E-03 | 1.555 | |||
| 20 | rs201812 | TSHZ2 | intron | 7.5E-03 | 1.438 | |||
| 8 | rs16914980 | RBM12B | downstream | 7.8E-03 | 0.475 | |||
| 8 | rs491546 | RBM12B | downstream | 8.9E-03 | 0.529 | |||
| 8 | rs7821394 | RBM12B | downstream | 9.7E-03 | 0.728 | |||
| European Americans | 8 | rs2618134 | RBM12B | downstream | 4.7E-05 | 2.183 | ||
| 8 | rs6993453 | RBM12B | downstream | 3.1E-04 | 1.819 | |||
| 20 | rs1293381 | TSHZ2 | intron | 3.8E-04 | 0.614 | |||
| 8 | rs2595613 | RBM12B | downstream | 4.3E-04 | 2.357 | |||
| 8 | rs12544183 | RBM12B | downstream | 5.3E-04 | 1.916 | |||
| 17 | rs1914986 | NOG | downstream | 9.6E-04 | 1.946 | |||
| 8 | rs279959 | RBM12B | downstream | 1.2E-03 | 1.621 | |||
| 3 | rs4676483 | CX3CR1 | downstream | 1.7E-03 | 1.671 | |||
| 8 | rs10808648 | RBM12B | downstream | 2.0E-03 | 1.540 | |||
| 20 | rs1326861 | TSHZ2 | downstream | 2.1E-03 | 1.469 | |||
| 11 | rs11021203 | SESN3 | upstream | 4.0E-03 | 0.570 | |||
| 11 | rs16922328 | SESN3 | upstream | 5.5E-03 | 1.599 | |||
| 20 | rs6068555 | TSHZ2 | intron | 6.1E-03 | 0.710 | |||
| 8 | rs549043 | RBM12B | downstream | 6.2E-03 | 1.805 | |||
| 8 | rs566469 | RBM12B | downstream | 6.4E-03 | 1.718 | |||
| 20 | rs6097326 | TSHZ2 | intron | 6.4E-03 | 0.569 | |||
| 20 | rs6068566 | TSHZ2 | downstream | 6.6E-03 | 0.706 | |||
| 3 | rs6773586 | CX3CR1 | upstream | 8.3E-03 | 0.560 | |||
| 8 | rs278586 | RBM12B | downstream | 9.0E-03 | 1.821 | |||
| 8 | rs7829923 | RBM12B | downstream | 9.2E-03 | 1.434 | |||
| 17 | rs17820808 | NOG | downstream | 9.3E-03 | 0.471 | |||
| 20 | rs10485815 | TSHZ2 | intron | 9.8E-03 | 1.653 | 2.8E-05 | 3.535 | |
GWAS results between complicated and uncomplicated sarcoidosis were listed only for the SNPs with P<0.01. OR: odds ratio.
PubMatrix search results for the 20-gene signature against sarcoidosis-related search terms.
| Gene | Sarcoidosis | Tuberculosis | Granulomatous disease | Hypersensitivity pneumonitis | Pulmonary fibrosis |
|
| 0 | 0 | 0 | 0 | 0 |
|
| 1 | 0 | 0 | 0 | 4 |
|
| 0 | 0 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 0 |
|
| 2 | 114 | 5 | 3 | 3 |
|
| 0 | 0 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 0 |
Each number in the table represents the count of literatures containing the corresponding gene name and search term.
Figure 3Boxplot of expression of the 20 signature genes.
The dark grey points and lines indicate the geometric mean of expression in each category. HC: healthy controls; US: patients with uncomplicated sarcoidosis; and CS: patients with complicated sarcoidosis. Y-axis: log2-transformed expression values.