| Literature DB >> 19347046 |
Kathy Boon1, Nathaniel W Bailey, Jun Yang, Mark P Steel, Steve Groshong, Dolly Kervitsky, Kevin K Brown, Marvin I Schwarz, David A Schwartz.
Abstract
BACKGROUND: Idiopathic pulmonary fibrosis (IPF) is a progressive, chronic interstitial lung disease that is unresponsive to current therapy and often leads to death. However, the rate of disease progression differs among patients. We hypothesized that comparing the gene expression profiles between patients with stable disease and those in which the disease progressed rapidly will lead to biomarker discovery and contribute to the understanding of disease pathogenesis. METHODOLOGY AND PRINCIPALEntities:
Mesh:
Substances:
Year: 2009 PMID: 19347046 PMCID: PMC2661376 DOI: 10.1371/journal.pone.0005134
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Clinical and demographic variables
| Variable | Progressive group (n = 6) | Relatively Stable group (n = 6) |
| Age | 64.8±6.7 | 66.7±5.5 |
| Sex male/female | 5/1 | 3/3 |
|
| ||
| never | 3 | 2 |
| ever | 3 | 4 |
| current | 1 | 1 |
| % predicted DLCO baseline | 58.7±11.0 | 56.0±10.9 |
| Actual change in DLCO | −6.4±1.5 | −2.50±3.34 |
| Change in % predicted DLCO | −20.8±4.8 | −2.33±21.77 |
| % predicted FVC baseline | 61.2±18.1 | 68.0±16.9 |
| Actual change in FVC | −0.73±0.4 | −0.24±0.5 |
| Change in % predicted FVC | −16.0±7.8 | −4.17±13.9 |
The percent predicted DLCO and FVC values at baseline, as well as the changes in actual and predicted (DLCO or FVC) values over a 12 months period are indicated as the mean with standard deviation for each group.
Figure 1Forced vital capacity (FVC) and carbon monoxide diffusing capacity (DLCO) values.
The percent predicted DLCO (A) and FVC (B) values are indicated at baseline and end point for the two IPF disease groups. The progressive group is represented in red and the relatively stable group in blue.
Figure 2Analysis of the IPF transcriptome.
(A) Hierarchical clustering based on the gene expression profiles of 12 IPF and 5 normal lung SAGE libraries described in Table S1. The branch length in the dendrogram represents the distance or relatedness between the samples; the shorter the branch the higher the similarity between samples. In yellow are indicated up regulated and in blue down regulated genes. (B) Tag to gene mapping classification of the 1,121 transcript tags significantly over expressed in IPF when compared to normal lung parenchyma. (C) Most significant canonical pathways associated with pulmonary fibrosis according to the IPA pathway analysis tool. The significance of the association between the dataset and the canonical pathway was measured as a ratio (number of genes from the dataset that map to the pathway divided by the total number of molecules that exist in the canonical pathway). A Fischer's exact test was used to calculate a P-value. (D) Hierarchical clustering based on the 293 transcriptional signature that distinguished IPF from normal lung parenchyma.
Figure 3Differentially expressed genes in the lung parenchyma from the relatively stable and progressive IPF.
(A) Selection criteria applied in order to find significantly differentially expressed genes. (B) Relative mRNA expression of selected genes. Real-time PCR reactions were performed in triplicate, and the threshold cycle numbers were averaged. Gene expression levels were normalized to GAPDH, and PGK1. The genes ADM (adrenomedullin), Plunc (palate, lung and nasal epithelium carcinoma associated), and SPP1 (osteopontin) were selected as up regulated; and RTKN2 (rhotekin 2) as down regulated in the progressive group. The values obtained for the relatively stable group was arbitrarily set to one to calculate a fold difference. The fold difference in the progressive group is indicated by solid bars and the levels in the relatively stable group are represented by the patterned bars. The differences were not significant as calculated by a Mann-Whitney test. (C) Paraffin-embedded tissue was stained with Plunc antibodies and counterstained with hematoxylin. A representative IPF sample shows strong staining of the secretory/goblet type of bronchial columnar cells (10X magnification). (D) Control bronchial normal lung tissue showed no staining (10X magnification).
Top 50 differentially expressed genes in progressive group
| ID | Tag Sequence | Log2 ratio | Symbol | Gene description |
| 1 |
| 3.5 | DNAJB1 | DnaJ (Hsp40) homolog, subfamily B, member 1 |
| 2 |
| 3.5 | MEST | Mesoderm specific transcript homolog (mouse) |
| 3 |
| 3.3 | ZFP36 | Zinc finger protein 36, C3H type, homolog (mouse) |
| 4 |
| 3.3 | CCL2 | Chemokine (C-C motif) ligand 2 |
| 5 |
| 3.3 | EGR1 | Early growth response 1 |
| 6 |
| 3.2 | SFTPA1 | Surfactant, pulmonary-associated protein A1 |
| 7 |
| 3.2 | PLUNC | Palate, lung and nasal epithelium carcinoma associated |
| 8 |
| 3.2 | HSPA1A | Heat shock 70kDa protein 1A |
| 9 |
| 3.2 | SPATS2 | Spermatogenesis associated, serine-rich 2 |
| 10 |
| 3.2 | SRI | Sorcin |
| 11 |
| 3.2 | IPO4 | Importin 4 |
| 12 |
| 3.2 | CMAS | Cytidine monophosphate N-acetylneuraminic acid synthetase |
| 13 |
| 3.2 | LAPTM4B | Lysosomal associated protein transmembrane 4 beta |
| 14 |
| 3.2 | KLF4 | Kruppel-like factor 4 (gut) |
| 15 |
| 3.2 | TBCA | Tubulin-specific chaperone a |
| 16 |
| 3.0 | GADD45B | Growth arrest and DNA-damage-inducible, beta |
| 17 |
| 3.0 | SFTPA2 | Surfactant, pulmonary-associated protein A2B, shorter alternative transcript |
| 18 |
| 3.0 | REEP5 | Receptor accessory protein 5 |
| 19 |
| 3.0 | TM4SF1 | Transmembrane 4 L six family member 1 |
| 20 |
| 3.0 | CTSB | Cathepsin B |
| 21 |
| 3.0 | METRNL | Meteorin, glial cell differentiation regulator-like |
| 22 |
| 3.0 | SNCG | Synuclein, gamma (breast cancer-specific protein 1) |
| 23 |
| 3.0 | ITGB1BP1 | Integrin beta 1 binding protein 1 |
| 24 |
| 3.0 | GNS | Glucosamine (N-acetyl)-6-sulfatase (Sanfilippo disease IIID) |
| 25 |
| 3.0 | TMEM30A | Transmembrane protein 30A |
| 26 |
| −2.6 | SIN3B | SIN3 homolog B, transcription regulator (yeast) |
| 27 |
| −2.6 | ARHGEF4 | Rho guanine nucleotide exchange factor (GEF) 4 |
| 28 |
| −2.6 | RIP | RPA interacting protein |
| 29 |
| −2.6 | YLPM1 | YLP motif containing 1 |
| 30 |
| −2.6 | TMEM125 | Transmembrane protein 125, internally primed |
| 31 |
| −2.6 | CHRDL1 | Chordin-like 1 |
| 32 |
| −2.6 | NEK4 | NIMA (never in mitosis gene a)-related kinase 4 |
| 33 |
| −2.6 | PITPNM2 | Phosphatidylinositol transfer protein, membrane-associated 2 |
| 34 |
| −2.6 | ARHGAP30 | Rho GTPase activating protein 30 |
| 35 |
| −2.6 | mitochondrial | |
| 36 |
| −2.6 | SMARCA4 | SWI/SNF related, matrix associated, actin dependent regulator of chromatin, subfamily a, member 4 |
| 37 |
| −2.6 | MYO18A | Myosin XVIIIA |
| 38 |
| −2.6 | SLC11A2 | Solute carrier family 11 (proton-coupled divalent metal ion transporters), 2 |
| 39 |
| −2.6 | NHLRC2 | NHL repeat containing 2 |
| 40 |
| −2.6 | GTF2I | General transcription factor II, i |
| 41 |
| −2.3 | CYP2B7P1 | Cytochrome P450, subfamily B, polypeptide 7 pseudogene 1 |
| 42 |
| −2.3 | C9orf61 | Chromosome 9 open reading frame 61 |
| 43 |
| −2.3 | FAM46A | Family with sequence similarity 46, member A |
| 44 |
| −2.3 | CD37 | CD37 antigen |
| 45 |
| −2.3 | ZCCHC4 | Zinc finger, CCHC domain containing 4 |
| 46 |
| −2.3 | ARRDC2 | Arrestin domain containing 2 |
| 47 |
| −2.3 | PAK2 | P21 (CDKN1A)-activated kinase 2 |
| 48 |
| −2.3 | P29 | CCNDBP1 interactor |
| 49 |
| −2.3 | DST | Dystonin |
| 50 |
| −2.3 | TAL1 | T-cell acute lymphocytic leukemia 1 |
The top 25 up and down regulated genes in the progressive disease group are indicated. A complete list of the differentially expressed 191 transcript tags and their corresponding matching genes group is presented in .
Candidate Biomarkers for disease progression based on IPA database survey
| Gene Symbol | Gene Description | Location | Fold change | B | BAL | P/S | Sp | L |
| HLA-DQA1 | major histocompatibility complex, class II, DQ A1 | Plasma Membrane | 14 | x | x | |||
| MEST | mesoderm specific transcript homolog (mouse) | Extracellular Space | 11 | x | x | |||
| CCL2 | chemokine (C-C motif) ligand 2 | Extracellular Space | 10 | x | x | x | x | |
| ZFP36 | zinc finger protein 36, C3H type, homolog | Nucleus | 10 | x | x | |||
| HSPA1A | heat shock 70kDa protein 1A | Cytoplasm | 9 | x | x | x | ||
| KLF4 | Kruppel-like factor 4 (gut) | Nucleus | 9 | x | x | |||
| PLUNC | palate, lung and nasal epithelium carcinoma ass. | Extracellular Space | 9 | x | x | x | ||
| SRI | Sorcin | Cytoplasm | 9 | x | x | |||
| ADM | adrenomedullin | Extracellular Space | 8 | x | x | x | ||
| AQP1 | aquaporin 1 (Colton blood group) | Plasma Membrane | 8 | x | x | |||
| CTSB | cathepsin B | Cytoplasm | 8 | x | x | x | x | |
| DYNLT1 | dynein, light chain, Tctex-type 1 | Cytoplasm | 8 | x | x | |||
| GADD45B | Growth arrest and DNA-damage-inducible, beta | Cytoplasm | 8 | x | x | |||
| TMBIM1 | transmembrane BAX inhibitor motif containing 1 | Unknown | 8 | x | x | |||
| PSMA7 | proteasome (macropain) subunit, alpha type, 7 | Cytoplasm | 7 | x | x | x | x | |
| UBB | ubiquitin B | Cytoplasm | 7 | x | x | |||
| CD14 | CD14 molecule | Plasma Membrane | 6 | x | x | x | x | |
| CDC42 | cell division cycle 42 | Cytoplasm | 6 | x | x | x | ||
| LGALS3 | lectin, galactoside-binding, soluble, 3 | Extracellular Space | 6 | x | x | x | x | |
| NAGLU | N-acetylglucosaminidase, a-Sanfilippo IIIB | Cytoplasm | 6 | x | x | x | x | |
| PTPRF | Protein tyrosine phosphatase, receptor type, F | Plasma Membrane | 6 | x | x | x | x | |
| SPP1 | secreted phosphoprotein 1 (osteopontin) | Extracellular Space | 6 | x | x | x | ||
| APOA1BP | apolipoprotein A-I binding protein | Extracellular Space | 5 | x | x | |||
| CD74 | CD74, major histocompatibility complex II | Plasma Membrane | 5 | x | x | |||
| CD276 | CD276 molecule | Plasma Membrane | 5 | x | x | |||
| EIF3A | eukaryotic translation initiation factor 3A | Cytoplasm | 5 | x | x | |||
| FKBP2 | FK506 binding protein 2, 13kDa | Cytoplasm | 5 | x | x | |||
| FOXA1 | forkhead box A1 | Nucleus | 5 | x | x | x | ||
| PDIA4 | Protein disulfide isomerase family A, member 4 | Cytoplasm | 5 | x | x | x | x | |
| RHOB | ras homolog gene family, member B | Cytoplasm | 5 | x | x | |||
| TBCA | Tubulin folding cofactor A | Cytoplasm | 5 | x | x | |||
| TES | testis derived transcript (3 LIM domains) | Plasma Membrane | 5 | x | x | |||
| VDAC3 | Voltage-dependent anion channel 3 | Cytoplasm | 5 | x | x | x | ||
| ACTR3 | ARP3 actin-related protein 3 homolog (yeast) | Plasma Membrane | 4 | x | x | x | ||
| DST | dystonin | Plasma Membrane | −5 | x | x | x | x | |
| TAL1 | T-cell acute lymphocytic leukemia 1 | Nucleus | −5 | x | x | |||
| AHNAK | AHNAK nucleoprotein | Nucleus | −6 | x | x | x | ||
| BIRC6 | baculoviral IAP repeat-containing 6 (apollon) | Cytoplasm | −6 | x | x | x | ||
| EMR4 | Egf-like, mucin-like, hormone receptor 4 | Plasma Membrane | −6 | x | x | |||
| ITPKB | inositol 1,4,5-trisphosphate 3-kinase B | Cytoplasm | −6 | x | x | |||
| MGAT4A | mannosyl-glycoprotein, transferase 4A | Unknown | −6 | x | x | x | ||
| SIN3B | SIN3 homolog B, transcription regulator (yeast) | Nucleus | −6 | x | x | x | ||
| SMARCA4 | SWI/SNF related, matrix associated, actin dependent regulator of chromatin A4 | Nucleus | −6 | x | x | |||
| YLPM1 | YLP motif containing 1 | Nucleus | −6 | x | x | x | ||
| ARHGEF10 | Rho guanine nucleotide exchange factor 10 | Cytoplasm | −7 | x | x | |||
| SAFB2 | scaffold attachment factor B2 | Unknown | −7 | x | x | x | ||
| KIAA1217 | KIAA1217 | Cytoplasm | −9 | x | x | x |
Based on the literature available in the IPA database the cellular localization and detection in bodily fluids/tissue is indicated. Fold change is represented as the difference in expression level between the progressive and relatively stable group. B = blood; BAL = Bronchoalveolar Lavage Fluid; P/S = Plasma/Serum; SP = Sputum; L = Lung.
Figure 4Heat map SAGE molecular signature.
(A) Unsupervised clustering of gene expression patterns of IPF lung SAGE libraries described in Table S1, based on the expression signature of 134 transcripts showing a clear distinction between relatively stable (slow) and progressive (rapid) IPF. (B–C) Hierarchical clustering of 8 IPF samples previously identified as a slow and accelerated variants [13] based on 90 (B) or 58 (C) genes in common with the SAGE 134 molecular signature.
Figure 5Biological differences between progressive and relatively stable disease groups in IPF.
(A) Ingenuity Canonical Pathway analysis showing the most significant pathways associated with the datasets of up and down regulated genes in the progressive group. The significance of the association between the dataset and the canonical pathway was measured as a ratio (number of genes from the dataset that map to the pathway divided by the total number of molecules that exist in the canonical pathway). A Fischer's exact test was used to calculate a P-value. (B) Main molecular and cellular functions significantly associated with the datasets of up and down regulated genes in the progressive group according to the IPA functional analysis tool.
Functional network analysis of differentially expressed genes in the progressive group
| ID | Molecules in Network | Score (1) | Number Focus Genes | Top Functions | Number associated genes (2) | P-Values (3) |
| 1 |
| 43 | 21 | Cell Signaling Cell Death Cancer | 10 14 14 | 1.45-E05 – 1.2E-02 3.67E-05 – 1.59E-01 1.04E-04 – 2.02E-02 |
| 2 |
| 33 | 17 | Hepatic System Disease Gene ExpressionCardiovascular System Development and Function | 3 7 9 | 1.18E-05 – 8.49E-02 8.2E-05 – 3.59E-02 1.48E-04 – 7.0E-02 |
| 3 |
| 30 | 16 | Gene ExpressionCancerTumor Morphology | 11166 | 6.38E-09 – 1.33E-032.68E-08 – 1.41E-032.68E-08 – 6.24E-04 |
| 4 | ABCB1B, | 25 | 14 | CancerCellular MovementCell-to-Cell Signaling and Interaction | 211411 | 1.28E-07 – 1.64E-011.28E-07 – 1.25E-031.45E-07 – 1.35E-03 |
| 5 | amino acids, AREG, BCL2L1, | 14 | 9 | Amino Acid MetabolismPost-Translational ModificationSmall Molecule Biochemistry | 121415 | 3.27E-10 – 1.54E-033.27E-10 – 2.01E-033.27E-10 – 4.02E-03 |
Up regulated genes are indicated in bold. Down regulated genes are underlined. Genes not altered in our signature are indicated in plain text. (1) Negative logarithm of the P-value; indicating the likelihood that the focus genes within a network are grouped as a result of random chance; using a confidence level of 99% IPA regards a score ≥2 as significant. (2) Number of significantly associated genes with the corresponding molecular function. (3) Range of significances of the associated genes for the corresponding molecular function.
Figure 6Network analysis.
The network map represents the interaction between members of two networks highlighting the crosstalk between the multiple differentially expressed genes in the progressive group. Nodes represent genes, and theirs shapes represent the functional classes of the gene products. Solid lines indicate a direct interaction and dashed lines indicate an indirect interaction.