Literature DB >> 30720061

Bioinformatic analysis of next‑generation sequencing data to identify dysregulated genes in fibroblasts of idiopathic pulmonary fibrosis.

Chau-Chyun Sheu1, Wei-An Chang1, Ming-Ju Tsai1, Ssu-Hui Liao1, Inn-Wen Chong2, Po-Lin Kuo1.   

Abstract

Idiopathic pulmonary fibrosis (IPF) is a lethal fibrotic lung disease with an increasing global burden. It is hypothesized that fibroblasts have a number of functions that may affect the development and progression of IPF. However, the present understanding of cellular and molecular mechanisms associated with fibroblasts in IPF remains limited. The present study aimed to identify the dysregulated genes in IPF fibroblasts, elucidate their functions and explore potential microRNA (miRNA)‑mRNA interactions. mRNA and miRNA expression profiles were obtained from IPF fibroblasts and normal lung fibroblasts using a next‑generation sequencing platform, and bioinformatic analyses were performed in a step‑wise manner. A total of 42 dysregulated genes (>2 fold‑change of expression) were identified, of which 5 were verified in the Gene Expression Omnibus (GEO) database analysis, including the upregulation of neurotrimin (NTM), paired box 8 (PAX8) and mesoderm development LRP chaperone, and the downregulation of ITPR interacting domain containing 2 and Inka box actin regulator 2 (INKA2). Previous data indicated that PAX8 and INKA2 serve roles in cell growth, proliferation and survival. Gene Ontology analysis indicated that the most significant function of these 42 dysregulated genes was associated with the composition and function of the extracellular matrix (ECM). A total of 60 dysregulated miRNAs were also identified, and 1,908 targets were predicted by the miRmap database. The integrated analysis of mRNA and miRNA expression data, combined with GEO verification, finally identified Homo sapiens (hsa)‑miR‑1254‑INKA2 and hsamiR‑766‑3p‑INKA2 as the potential miRNA‑mRNA interactions in IPF fibroblasts. In summary, the results of the present study suggest that dysregulation of PAX8, hsamiR‑1254‑INKA2 and hsamiR‑766‑3p‑INKA2 may promote the proliferation and survival of IPF fibroblasts. In the functional analysis of the dysregulated genes, a marked association between fibroblasts and the ECM was identified. These data improve the current understanding of fibroblasts as key cells in the pathogenesis of IPF. As a screening study using bioinformatics approaches, the results of the present study require additional validation.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 30720061      PMCID: PMC6414167          DOI: 10.3892/ijmm.2019.4086

Source DB:  PubMed          Journal:  Int J Mol Med        ISSN: 1107-3756            Impact factor:   4.101


Introduction

Idiopathic pulmonary fibrosis (IPF) is a chronic, progressive, fibrotic lung disease. IPF predominantly affects elderly men. Patients with IPF usually present with exertional dyspnea, dry cough and inspiratory bibasilar Velcro-like crackles on lung auscultation. At present, high-resolution computed tomography is the most important diagnostic tool for IPF, which typically presents with usual interstitial pneumonia pattern (1,2). IPF is a lethal lung disease. The prognosis of IPF is poorer compared with a number of types of cancer (3), with an estimated median survival of 3-5 years if untreated (2). The incidence of IPF, hospitalization rates and mortality due to IPF have increased during previous years, suggesting an increasing global burden of this disease (4-6). The pathogenesis of IPF remains largely unknown. Data from observational studies suggest that cigarette smoking, air pollution (7), environmental exposure (8), chronic viral infection (9) and esophageal reflux (10) predispose individuals to IPF (4). There is also increasing evidence for genetic predisposition to IPF: Recent genome-wide association studies identified that several genetic variants, primarily involved in epithelial cell-cell adhesion and integrity, the innate immune response, host defense and DNA repair, are associated with increased risk of IPF (11-13). These together indicate that gene-environmental interactions serve important roles in the pathogenesis of IPF. In genetically predisposed individuals, repetitive environmental exposures lead to an aberrant injury-remodeling process (14). Activated lung epithelial cells produce pro-fibrotic growth factors, chemokines and other mediators, resulting in abnormal wound healing characterized by mesenchymal transition of epithelial cells, activation and differentiation of fibroblasts and myofibroblasts. These fibroblasts and myofibroblasts secrete excessive amounts of extracellular matrix proteins including fibrillary collagens, vimentin and fibronectin (15), contributing to the destruction of lung architecture by progressive scarring. It is hypothesized that fibroblasts have a number of functions that may affect the development and progression of IPF (16). Although previous studies have described the differences between IPF and normal lung fibroblasts (17-21), the knowledge concerning the cellular and molecular mechanisms associated with fibroblasts in IPF remains limited, and their specific roles requires additional investigation (16,22). For an improved understanding of the roles fibroblasts serve in the pathogenesis of IPF, the present study used a next generation sequencing (NGS) platform and a step-wise bioinformatic approach to analyze the expression levels of mRNAs and microRNAs (miRNAs) and their interactions in IPF fibroblasts compared with normal lung fibroblasts. The aims of the present study were to identify the top differentially expressed genes and subsequently elucidate the function of these dysregulated genes, and to explore potential miRNA-mRNA interactions in IPF fibroblasts.

Materials and methods

Study design

The study design is summarized in Fig. 1. Firstly, cell cultures of the IPF fibroblasts and healthy lung fibroblasts were generated. Then, total RNA of the fibroblasts were extracted and sent for RNA and small RNA sequencing using the NGS platform. The differentially expressed genes (>2 fold-change) were analyzed with Ingenuity® Pathway Analysis (IPA) and the Database for Annotation, Visualization and Integrated Discovery (DAVID) for pathway analysis and functional interpretation. These dysregulated genes were additionally verified in representative Gene Expression Omnibus (GEO) datasets. The differentially expressed miRNAs (>2 fold-change) were analyzed with miRmap (mirmap.ezlab. org) (23) for target prediction. Then, Venn diagrams were used to determine genes with potential miRNA-mRNA interactions. These potential miRNA-mRNA interactions were confirmed using a second miRNA prediction database, TargetScan (www.targetscan.org) (24). Finally, a literature search for the functions of these dysregulated genes was performed, and a hypothesis was generated.
Figure 1

Schematic illustration of the study design. Following culture of the IPF fibroblasts and healthy lung fibroblasts, the total RNA were extracted and deep sequenced using a next-generation sequencing platform. The differentially expressed genes (>2 fold-change) were identified, and analyzed with IPA and DAVID for pathway analysis and functional interpretation. The GEO IPF databases were analyzed to confirm dysregulated genes identified in the present study. Conversely, the differentially expressed miRNAs (>2 fold-change) were analyzed with miRmap for target prediction. Then, genes with potential miRNA-mRNA interactions were determined by Venn diagram analysis. These miRNA-mRNA interactions were verified by a second miRNA prediction database, TargetScan. Finally, a literature search was performed and a hypothesis was generated. IPF, idiopathic pulmonary fibrosis; IPA, Ingenuity® Pathway Analysis; DAVID, Database for Annotation, Visualization, and Integrated Discovery; GEO, Gene Expression Omnibus.

Culture of primary cells

Diseased human lung fibroblasts (DHLF)-IPF (cat. no. CC-7231) and normal human lung fibroblasts (NHLF; cat. no. CC-2512) were purchased from Lonza Walkersville Inc. (Walkersville, MD, USA). The cells were grown in FGW™-2 Fibroblast Growth Medium-2 Bulletkit™ (Lonza Walkersville Inc.; cat. no. CC-3132), containing 0.5 ml human fibroblast growth factor-basic, 0.5 ml insulin, 10 ml fetal bovine serum and 0.5 ml GA-1000. To avoid losing any original characteristics over multiple generations, the DHLF-IPF and NHLF cells used for the NGS analysis were harvested from the 1st generation of cells following cultivation from the primary cells.

NGS

The expression profiles of miRNAs and mRNAs were examined using NGS. Protocol was described in our previous studies (25-27). In brief, total RNA from the DHLF-IPF and NHLF cells were extracted with TRIzol® reagent (Thermo Fisher Scientific, Inc., Waltham, MA, USA) at Welgene Biotech Co., Ltd. (Taipei, Taiwan). Purified RNA was quantified at an optical density of 260 nm using an ND-1000 spectrophotometer (NanoDrop Technologies; Thermo Fisher Scientific, Inc.) and the RNA quality was examined using a Bioanalyzer 2100 (Agilent Technologies, Inc., Santa Clara, CA, USA) with RNA 6000 LabChip® kit (Agilent Technologies, Inc.). Next, the samples were prepared for library construction and sequencing using an Illumina preparation kit (Illumina, Inc., San Diego, CA, USA). For small RNA sequencing, the harvested cDNA constructs containing 18-40-nucleotide RNA fragments (140-155 nucleotides in length with all adapters) were selected. The libraries were then sequenced on an Illumina instrument (75 single-end cycles). Following trimming and removal of reads with low quality scores using Trimmomatics (28), the qualified reads were analyzed with miRDeep2 (29) and the UCSC Genome Browser (genome.ucsc.edu/) (30). The miRNAs with low levels (<1 normalized read per million) in either DHLF-IPF or NHLF cells were excluded. For transcriptome sequencing, the library was constructed with a SureSelect Strand Specific RNA Library Preparation kit (Agilent Technologies, Inc.), and sequenced on a Solexa platform (150 paired-end cycles), using the TruSeq SBS kit (Illumina, Inc.). Similar with small RNA sequencing data, Trimmomatics (version 0.36) was implemented to trim or remove the reads with low quality score. Qualified reads were analyzed using HISAT2 (version 2.1.0) (31). The genes with low expression levels [<0.3 fragment per kilobase of transcript per million mapped reads (FPKM)] in either DHLF-IPF or NHLF cells were excluded. The 2-tailed P-values were calculated [2(Cufflinks version 2.2.1)] with non-grouped samples used ‘blind mode’, in which all samples were treated as replicates of a single global ‘condition’ and used to build one model for statistical analysis (32,33). Genes with P<0.05 [(-log10) P-value >1.3] and >2 fold-changes were considered to be significantly dysregulated genes.

miRmap and TargetScan database analysis

miRmap is an open-source software library, which provides comprehensive miRNA-target prediction (23). By calculating the complementary ability of miRNA-mRNA interactions, the putative target genes of each miRNA may be identified. By combining thermodynamic, evolutionary, probabilistic and sequence-based features, the predictor also estimates mRNA-repression strength for ranking potential candidate targets. The prediction produces a list of putative target genes with an miRmap score, a predictive reference value. In the present study, an miRmap score ≥99 was used as the criterion for selecting putative miRNA targets. TargetScan is an online database for predicting biological targets of miRNAs (24). It searches for the presence of conserved 8, 7 and 6 mer sites, which match the seed region of each miRNA. Predictions are ranked based on the predicted targeting efficacy or the probability of conserved targeting.

IPA

IPA (Qiagen Inc., Valencia, CA, USA) (34) is a web-based software application for the analysis, integration and interpretation of data derived from ‘omics experiments, including RNA sequencing, small RNA sequencing, microarrays, metabolomics and proteomics. IPA enables the rapid understanding and visualization of data, and provides a range of data in addition to pathway analysis, including the identification of key regulators and activity to explain expression patterns, prediction of downstream effects on biological and disease processes, and provision of targeted data on genes, proteins, chemicals and drugs. The dysregulated genes from the IPF fibroblasts were uploaded to IPA (version 2.3) to identify the top canonical pathways, upstream regulators, and molecular and cellular functions.

DAVID database analysis

DAVID is a powerful gene functional classification tool that integrates multiple functional annotation databases, including Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes databases (https://david.ncifcrf.gov/) (35,36). By calculating the similarity of global annotation profiles with an agglomeration algorithm method, a list of notable genes may be classified into clusters of associated ‘Biological Process, ‘Cellular Components’, and ‘Molecular Functions’. It also provides an Expression Analysis Systematic Explorer (EASE) score, a modified one-tailed Fisher’s Exact P-value, for analysis. The reference score represents how specifically the user genes are involved in the category. An EASE score=0.1 was selected as the default, and =1 to extend clustering range in the analysis in the present study.

GEO database analysis

The GEO is a web database that collects submitted high-throughput gene-expression data of microarrays, chips or NGS (www.ncbi.nlm.nih.gov/geo) (37). Microarrays of accession numbers GSE24206, which included whole lung tissue samples from patients with early and advanced IPF (38) and GSE44723, which included samples of cultured lung fibroblasts from patients with IPF (39), were used in the present study. GSE44723 assessed gene expression in fibroblasts obtained from 4 normal controls, 4 patients with rapidly progressing IPF and 6 patients with slowly progressing IPF. GSE24206 assessed gene expression in whole lung biopsy or explant samples from 6 healthy controls, 8 patients with early IPF and 9 patients with advanced IPF. The databases used the platform of Affymetrix Human Genome U133 Plus 2.0 Array. Raw data extracted from the GEO were replotted using GraphPad Prism 7 software (GraphPad Software, Inc., La Jolla, CA, USA).

Statistical analysis

Statistical analyses were performed using SAS 9.4 software (SAS Institute Inc., Cary, NC, USA). To identify the differentially expressed genes, the Wilcoxon rank-sum test was used to compare gene expressions between two groups (patients with IPF vs. controls), and the Kruskal-Wallis test followed by Benjamini-Hochberg multiple-testing corrections was used to compare gene expressions across three groups. P<0.05 was considered to indicate a statistically significant difference. Dysregulated genes identified from the IPF and normal fibroblasts NGS analysis were considered verified if they met the following criteria: Exhibiting concordant up-/downregulation in the GSE24206 and GSE44723 datasets, and demonstrating significantly different expression between patients with IPF and controls or across three groups in at least one dataset.

Results

Gene expression profiles and miRNA changes in IPF fibroblasts

Samples from DHLF-IPF and NHLF cells were sequenced for mRNA and small RNA using an NGS platform. The volcano plot of dysregulated genes in fibroblasts of patients with IPF (DHLF-IPF cells) compared with normal controls (NHLF cells) is presented in Fig. 2A. The red nodes represent the significantly upregulated genes, and the green nodes represent significantly downregulated genes. The density plot of the deep sequencing results between the DHLF-IPF and NHLF cells is presented in Fig. 2B. The gene expression of DHLF-IPF cells demonstrated increased FPKM values and increased density compared with NHLF cells. Gene expression analysis revealed 42 differentially expressed genes with fold-change >2, including 23 upregulated and 19 down-regulated genes (Table I). miRNA expression analysis revealed 60 miRNAs with fold-change >2, including 43 upregulated and 17 downregulated miRNAs (Table II). According to miRmap analysis, 1,908 targets were predicted from these dysregulated miRNAs.
Figure 2

Differential gene expression patterns between IPF and normal fibroblasts. (A) The volcano plot of -log10 (P-value) vs. log2 (fold-change) demonstrated differentially expressed genes in DHLF-IPF vs. NHLF. Genes with -log10 (P-value) >1.3 and >2 fold-changes are plotted in red (upregulated genes) or green (downregulated genes). (B) The frequency distribution of FPKM between DHLF-IPF cells and NHLF cells was compared and presented in the density plot. IPF, idiopathic pulmonary fibrosis; FPKM, fragments per kilobase of transcript per million mapped reads; DHLF, diseased human lung fibroblasts; NHLF, normal human lung fibroblasts.

Table I

Dysregulated gene in IPF fibroblasts compared with normal lung fibroblasts.

A, Upregulated
GeneFull gene nameDHLF-IPF FPKMNHLF FPKMFold-changeP-value
HSPA12BHeat shock protein family A (Hsp70) member 12B59.623.2818.180.005
NTMNeurotrimin117.088.4013.940.032
PAX8Paired box 886.9013.696.350.000
MKI67Marker of proliferation Ki-6716.753.255.150.045
POSTNPeriostin42.918.355.140.024
MYO1CMyosin IC21.874.345.040.020
LINC01529Long intergenic non-protein coding RNA 152941.5412.833.240.007
COL1A2Collagen type I alpha 2 chain69.4723.422.970.015
CCNL1Cyclin L138.6113.412.880.031
ATP2B4ATPase plasma membrane Ca2+ transporting 429.6110.642.780.017
ANP32BAcidic nuclear phosphoprotein 32 family member B52.7519.242.740.037
PKMYT1Protein kinase, membrane associated tyrosine/threonine 1571.51212.092.690.032
GAB2GRB2-associated binding protein 223.348.872.630.005
GPM6BGlycoprotein M6B39.5715.382.570.029
TDGThymine DNA glycosylase32.7913.382.450.048
HSPA8Heat shock protein family A (Hsp70) member 894.9339.542.400.041
CD248CD248 molecule25.0110.792.320.036
LINC00847Long intergenic non-protein coding RNA 84791.6441.902.190.026
ANKRD18EPAnkyrin repeat domain 18E, pseudogene98.9945.652.170.022
CNN3Calponin 370.1133.292.110.047
MESDMesoderm development LRP chaperone93.6644.852.090.028
IQGAP1IQ motif containing GTPase activating protein 145.4022.512.020.023
LINC01291Long intergenic non-protein coding RNA 1291161.0280.152.010.010

B, Downregulated

TYW1tRNA-YW synthesizing protein 1 homolog24.22146.89-6.060.044
COL4A1Collagen type IV alpha 1 chain4.1123.87-5.810.006
AKAP12A-kinase anchoring protein 124.2423.09-5.450.017
ITPRID2ITPR interacting domain containing 24.9623.91-4.820.042
PRKAR1BProtein kinase cAMP-dependent type I regulatory subunit beta30.41131.62-4.330.036
INKA2Inka box actin regulator 211.3847.14-4.140.006
AKAP2A-kinase anchoring protein 212.3047.75-3.880.002
GPR17G protein-coupled receptor 1714.8852.29-3.510.008
FAM151AFamily with sequence similarity 151 member A11.4736.15-3.150.018
UXT-AS1UXT antisense RNA 148.81151.15-3.100.028
IL17RCInterleukin 17 receptor C8.1823.55-2.880.033
SPDYE6Speedy/RINGO cell cycle regulator family member E610.4530.07-2.880.037
MED31Mediator complex subunit 3125.5569.73-2.730.041
FEM1BFem-1 homolog B14.7735.13-2.380.010
NCLNucleolin19.5145.91-2.350.030
PTX3Pentraxin 320.3544.55-2.190.042
OR7E12POlfactory receptor family 7 subfamily E member 12 pseudogene126.50268.42-2.120.013
SPON1Spondin 116.9334.91-2.060.030
COL4A2Collagen type IV alpha 2 chain12.6525.46-2.010.040

IPF, idiopathic pulmonary fibrosis; FPKM, fragments per kilobase of transcript per million mapped reads; DHLF, diseased human lung fibroblast; NHLF, normal human lung fibroblast.

Table II

Dysregulated miRNAs in IPF fibroblasts vs. normal lung fibroblasts.

miRNADHLF-IPF Seq (norm)NHLF Seq (norm)Fold-changeUp/Down
hsa-miR-10b-5p3,357.09317.710.57Up
hsa-miR-412-5p50.65.868.63Up
hsa-miR-329-3p6.771.694.01Up
hsa-miR-4661-5p4.511.173.85Up
hsa-miR-452117.74.823.67Up
hsa-miR-3130-3p3.81.173.25Up
hsa-miR-369-3p26.738.343.21Up
hsa-miR-615-3p11.173.523.17Up
hsa-miR-766-3p5.581.823.07Up
hsa-miR-548am-5p5.111.693.02Up
hsa-miR-5000-3p5.461.823.00Up
hsa-miR-619-5p3.441.172.94Up
hsa-miR-486-5p424.28145.32.92Up
hsa-miR-493-5p369.05132.012.80Up
hsa-miR-3613-5p10.453.782.76Up
hsa-miR-625-3p5.72.082.74Up
hsa-miR-33b-5p4.751.822.61Up
hsa-miR-382-3p14.495.62.59Up
hsa-miR-379-3p33.3813.552.46Up
hsa-miR-199b-5p471.8192.862.45Up
hsa-miR-127-3p292.67120.672.43Up
hsa-miR-12542.791.172.38Up
hsa-miR-54331.9513.422.38Up
hsa-miR-549a15.926.912.30Up
hsa-miR-193a-3p5.72.482.30Up
hsa-miR-66524.1110.562.28Up
hsa-miR-376c-3p20.559.122.25Up
hsa-miR-487a-3p32.914.992.19Up
hsa-miR-582-3p4.872.222.19Up
hsa-miR-134-5p174.7380.142.18Up
hsa-miR-1185-5p3.681.692.18Up
hsa-miR-493-3p159.0573.892.15Up
hsa-miR-136-3p424.16198.722.13Up
hsa-miR-136-5p12.355.862.11Up
hsa-miR-1185-1-3p49.0623.332.10Up
hsa-miR-654-3p1,401.49669.932.09Up
hsa-miR-185-3p6.1832.06Up
hsa-miR-548ao-3p2.141.042.06Up
hsa-miR-668-3p3.211.562.06Up
hsa-miR-410-3p865.2424.422.04Up
hsa-miR-409-3p3,074.391,512.262.03Up
hsa-miR-494-3p6.33.132.01Up
hsa-miR-11977.843.912.01Up
hsa-miR-182-5p47.6395.52−2.01Down
hsa-miR-422a1.663.39−2.04Down
hsa-miR-3200-3p3.87.95−2.09Down
hsa-miR-335-5p49.53103.99−2.10Down
hsa-miR-138-1-3p87.18189.86−2.18Down
hsa-miR-365a-5p1.433.13−2.19Down
hsa-miR-5699-5p1.072.35−2.20Down
hsa-miR-664a-3p2.385.86−2.46Down
hsa-miR-15382.265.6−2.48Down
hsa-miR-4662a-5p2.977.56−2.55Down
hsa-miR-335-3p341.14920.39−2.70Down
hsa-miR-29c-5p1.193.39−2.85Down
hsa-miR-9-5p1.544.69−3.05Down
hsa-miR-44612.617.95−3.05Down
hsa-miR-36621.434.82−3.37Down
hsa-miR-47671.314.82−3.68Down
hsa-miR-204-5p45.37170.97−3.77Down

IPF, idiopathic pulmonary fibrosis; DHLF, diseased human lung fibroblast; NHLF, normal human lung fibroblast; Up, upregulation; Down, downregulation; hsa, Homo sapiens; miR, microRNA.

IPA and DAVID analysis of dysregulated genes in IPF fibroblasts

The pathways and functional annotation of the 42 dysregulated genes in IPF fibroblasts were first analyzed using IPA. The results are summarized in Table III. The top canonical pathways were cAMP-mediated signaling, GP6 signaling pathway, cardiac-adrenergic signaling, hepatic fibrosis/hepatic stellate cell activation and calcium transport I. The top upstream regulators were fibrillin-1, bromodo-main-containing protein 4, transcription factor GATA-6, TSIX and DNA-binding protein RFX5. Fig. 4 demonstrates the network of these top 5 upstream regulators and dysregulated genes. The top molecular and cellular functions included cell cycle, cell morphology, cellular development, lipid metabolism and molecular transport.
Table III

Summary of Ingenuity® Pathway Analysis of 42 dysregulated genes in idiopathic pulmonary fibrosis fibroblasts.

CategoriesP-valueOverlap (%)No. of molecules
Top canonical pathways
 cAMP-mediated signaling0.00074/225 (1.8)-
 GP6 signaling pathway0.00183/131 (2.3)-
 Cardiac-adrenergic signaling0.00213/140 (2.1)-
 Hepatic fibrosis/hepatic stellate cell activation0.00453/183 (1.6)-
 Calcium transport I0.01811/10 (10.0)-
Top upstream regulators
 FBN10.0002--
 BRD40.0021--
 GATA60.0030--
 TSIX0.0037--
 RFX50.0037--
Top molecular and cellular function
 Cell cycle0.0013-0.0181-6
 Cell morphology0.0018-0.0395-4
 Cellular development0.0018-0.0106-4
 Lipid metabolism0.0018-0.0199-2
 Molecular transport0.0018-0.0055-2

IPA, Ingenuity® Pathway Analysis; IPF, idiopathic pulmonary fibrosis; FBN1, fibrillin-1; BRD4, bromodomain-containing protein 4; GATA6, transcription factor GATA-6; RFX5, DNA-binding protein RFX5.

Figure 4

Network of top 5 upstream regulators and dysregulated genes in idiopathic pulmonary fibrosis fibroblasts. The top 5 upstream regulators, including FBN1, BRD4, GATA6, TSIX and RFX5, were generated using Ingenuity® Pathway Analysis based on the significance of overlap between the dysregulated genes and the genes that were regulated by the upstream regulators. The dysregulated genes targeted by these top upstream regulators were COL1A2, COL4A1 and COL4A2, all belonging to the fibrillar collagen family. Numbers in parentheses are the number of reports supporting interactions. E, expression; FC, expression fold-change; PD, protein-DNA binding; CLO1A2, collagen alpha-2(I) chain; COL4A1, collagen alpha-1(IV) chain; COL4A2, collagen alpha-2(IV) chain; FBN1, fibrillin-1; BRD4, bromodomain-containing protein 4; GATA6, transcription factor GATA-6; RFX5, DNA-binding protein RFX5.

The functional annotation of the 42 dysregulated genes was subsequently analyzed using DAVID. Notably, results within the ‘Biological Process’, ‘Cellular Component’ and ‘Molecular Function’ categories all indicated that the most significant function of these dysregulated genes was associated with the composition and function of the extracellular matrix (ECM) (Fig. 3).
Figure 3

Gene Ontology analysis of dysregulated genes identified in idiopathic pulmonary fibrosis fibroblasts. Functional annotation of the 42 dysregulated genes was determined using Database for Annotation, Visualization, and Integrated Discovery. Results within the (A) Biological Process, (B) Cellular Component and (C) Molecular Function categories all indicated that the most significant function of these genes was associated with the composition and function of extracellular matrix.

GEO database analysis of dysregulated genes in IPF fibro- blasts

To additionally verify the 42 dysregulated genes in clinical samples from patients with IPF, the GEO database was searched and two representative microarray datasets were selected: GSE44723 (IPF cultured lung fibroblasts) and GSE24206 (early and advanced IPF whole lung tissues). Dysregulated genes were considered verified if they exhibited concordant up-/downregulation in the two GSE44723 and GSE24206 datasets, with significantly different expression between patients with IPF and controls in at least one dataset. The results are summarized in Table IV. Based on the GEO analysis, neurotrimin (NTM), paired box 8 (PAX8) and mesoderm development LRP chaperone (MESD) were verified as the upregulated genes, and ITPR interacting domain containing 2 (ITPRID2) and Inka box actin regulator 2 (INKA2) as the downregulated genes in IPF. The detailed expression levels of these 5 genes in each group were plotted and compared in Fig. 5.
Table IV

Gene Expression Omnibus verification of 42 dysregulated genes in IPF fibroblasts.

A, Upregulated genes
GenesFold-changeGSE44723
GSE24206
Up-/DownregulationP-valueUp-/DownregulationP-value
HSPA12B18.15N/A-Up0.835
NTM13.94Up0.257Up0.010
PAX86.35Up0.341Up0.032
MKI675.15Down0.582Up0.017
POSTN5.14Down0.688Up0.008
MYO1C5.04Down0.810Up0.259
LINC015293.24N/A-N/A-
COL1A22.97Down0.397Up<0.001
CCNL12.88Up0.737Down<0.001
ATP2B42.78Up0.147Up0.159
ANP32B2.74Up0.443Down0.159
PKMYT12.69Down0.929Up0.832
GAB22.63Down0.873Down<0.001
GPM6B2.57Up0.890Down0.013
TDG2.45N/A-Down0.113
HSPA82.40N/A-Up0.081
CD2482.32Down0.255Up0.115
LINC008472.19Down0.053Up0.091
ANKRD18EP2.17N/A-N/A-
CNN32.11N/A-Down0.095
MESD2.09Up0.958Up0.002
IQGAP12.02Up0.437Down0.717
LINC012912.01N/A-N/A-

B, Downregulated genes

TYW1-6.06Up0.201Down0.716
COL4A1-5.81Down0.224Up0.354
AKAP12-5.45Up0.410Down0.004
ITPRID2-4.82Down0.019Down0.031
PRKAR1B-4.33Down0.194Up0.767
INKA2-4.14Down0.213Down0.001
AKAP2-3.88N/A-Down0.007
GPR17-3.51N/A-Down0.460
FAM151A-3.15N/A-Down0.081
UXT-AS1-3.10N/A-N/A-
IL17RC-2.88Down0.841Down0.630
SPDYE6-2.88N/A-Down0.220
MED31-2.73Up0.478Up0.831
FEM1B-2.38Up0.873Down0.024
NCL-2.35N/A-Down0.166
PTX3-2.19Up0.918Down<.001
OR7E12P-2.12N/A-Down0.937
SPON1-2.06Up0.986Down0.036
COL4A2-2.01Down0.214Up0.584

Fold-change indicates the results of RNA-seq analysis from the next-generation sequencing data. GEO, Gene Expression Omnibus; IPF, idiopathic pulmonary fibrosis; N/A, not available.

Figure 5

Gene Expression Omnibus analysis of 5 dysregulated genes in IPF fibroblasts. Representative microarray datasets GSE44723 (slowly and rapidly progressing IPF: cultured lung fibroblasts) and GSE24206 (early and advanced IPF: whole lung tissue) were used for verification of differential gene expressions identified from Next Generation Sequencing analysis. The results indicated that the expression of INKA2 and ITRPID2 were downregulated and PAX8, MESD and NTM were upregulated in IPF. Cultured lung fibroblasts and whole lung from healthy subjects were used as the normal controls. P-values were calculated using the Wilcoxon rank-sum test for comparisons of two groups, and the Kruskal-Wallis test for comparisons of three groups. Adjusted P-values were calculated using the Kruskal-Wallis test followed by Benjamini-Hochberg multiple-testing corrections. *Adjusted P<0.05, **adjusted P<0.01 and ***adjusted P<0.001. IPF, idiopathic pulmonary fibrosis; INKA2, Inka box actin regulator 2; ITPRID2, ITPR interacting domain containing 2; PAX8, paired box 8; MESD, mesoderm development LRP chaperone; NTM, neurotrimin.

Identification of dysregulated genes with potential miRNA-mRNA interactions in IPF fibroblasts

Venn diagram analysis of the 1,908 target genes of 60 dysregulated miRNAs, predicted from miRmap, and the 42 dysregulated genes revealed 5 dysregulated genes with potential miRNA-mRNA interactions in IPF fibroblasts (Fig. 6). A total of 2 miRNA-mRNA interactions, Homo sapiens (hsa)-miR-185-3p-heat shock protein family A member 12B (HSPA12B) and hsa-miR185-3p-GRB associated binding protein 2 (GAB2), were excluded due to a lack of biological plausibility (the miRNA and mRNA were upregulated). The remaining 3 genes with miRNA-mRNA interactions were subsequently verified in a second miRNA predicting database, TargetScan. The results identified hsa-miR-185-3p-TRNA-YW synthesizing protein 1 homolog (TYW1), hsa-miR-3662- glycoprotein M6B (GPM6B), hsa-miR-1254-INKA2 and hsa-miR-766-3p-INKA2 as the potential miRNA-mRNA interactions in IPF fibroblasts (Table V).
Figure 6

Venn diagram analysis of miRNA-mRNA interactions in idiopathic pulmonary fibrosis fibroblasts. RNA sequencing revealed 42 dysregulated genes (left). Small RNA sequencing revealed 60 dysregulated miRNAs, which predicted 1,908 target genes (right) based on miRmap database. Venn diagram analysis identified 5 dysregulated genes with potential miRNA-mRNA interaction. miRNA, microRNA.

Table V

Dysregulated genes with potential miRNA-mRNA interaction in IPF fibroblasts.

GeneGene full namemRNA fold-changemiRNAmiRNA fold-changemiRmap scoreTargetScan Total context++ score
TYW1tRNA-YW synthesizing protein 1 homolog-6.06hsa-miR-185-3p2.0699.12-0.37
GPM6Bglycoprotein M6B2.44hsa-miR-3662-3.3799.81-0.09
INKA2Inka box actin regulator 2-4.14hsa-miR-1254hsa-miR-766-3p2.383.0799.2899.09-0.32-0.67

IPF, idiopathic pulmonary fibrosis; miRNA, microRNA; hsa, Homo sapiens; TYW1, TRNA-YW synthesizing protein 1 homolog; GPM6B, glycoprotein M6B; INKA2, Inka box actin regulator 2.

Discussion

The pathogenic mechanisms of IPF have not yet been fully elucidated. The hallmarks of IPF are aberrant activation of lung epithelial cells, accumulation of fibroblasts and myofibroblasts, and excessive production of ECM (4,14-16). Previous data indicate that fibroblasts, as one of the key cells, serve a crucial role in the development and progression of IPF (16). An improved understanding of the gene regulation in IPF fibroblasts may assist in the development of novel therapeutics targeting this key cell. In the present study, the whole mRNA and miRNA profiles of IPF fibroblasts were obtained using an NGS platform, and bioinformatics analyses were performed in a step-wise manner. A total of 42 dysregulated genes were identified, and then bioinformatics tools IPA and DAVID were used for pathway and functional analyses. A total of 5 of the differentially expressed genes were verified in the representative GEO datasets, including NTM, PAX8 and MESD (upregulated), and ITPRID2 and INKA2 (downregulated). Integrated analysis of mRNA and miRNA expression data was also performed, and hsa-miR-185-3p-TY W1, hsa-miR-3662-GPM6B, hsa-miR-1254-INKA2 and hsa-miR-766-3p-INKA2 were identified as the potential miRNA-mRNA interactions in IPF fibroblasts. According to the GEO verification, hsa-miR-1254-INKA2 and hsa-miR-766-3p-INKA2 were considered as the most likely dysregulated miRNA-mRNA interactions in IPF fibroblasts. However, these interactions were identified based on bioinformatic analysis. Therefore, they require additional experiments to confirm the results. Hsa-miR185-3p-HSPA12B and hsa-miR185-3p-GAB2 were excluded from subsequent analysis, as the miRNAs and mRNAs were dysregulated in the same manner. There is a possibility of indirect modulation, in that the upregulated hsa-miR185-3p may control one or more other targets. Which may in turn upregulate the expression levels of HSPA12B or GAB2. Nevertheless, the upregulated levels of HSPA12B or GAB2 were not validated in the GEO database analysis. Whether these 2 miRNA-mRNA interactions were excluded or not did not affect the final results. In the GO analysis, it was identified that the most important function of the identified dysregulated genes was associated with the composition and function of the ECM. Replacement of the normal lung structure with an excessive deposition of disorganized collagen and ECM is the hallmark of IPF (40). Although previous evidence suggests that fibroblasts and myofibroblasts in the fibrotic foci are the key cells leading to excessive ECM production (41), the crosstalk between epithelial cells, fibroblasts, myofibroblasts and ECM remains largely uncharacterized. The results from the present study improve the understanding of the fibroblast-ECM interaction in the pathogenesis of IPF. The development of novel therapeutic strategies targeting the fibrotic ECM may provide an opportunity to halt fibrosis and restore organ function (42). A recent study confirmed the importance of the ECM in IPF pathogenesis and treatment: Kwapiszewska et al (43) compared transcriptomic profiles in lung homogenates and fibroblasts obtained from patients with IPF treated with or without pirfenidone. They identified that cell migration-inducing and hyaluronan-binding protein (CEMIP) was markedly downregulated by pirfenidone treatment. They also identified that circulating CEMIP levels were significantly increased in patients with IPF compared with the healthy controls, and that pirfenidone treatment was associated with a significant decrease in CEMIP levels. CEMIP has been previously associated with ECM production, inflammation and cell proliferation (44,45). They concluded that pirfenidone exhibited effects on multiple pathways in fibroblasts and other pulmonary cells, through the regulation of the ECM structure and inflammatory reactions. The INKA2 gene, also known as family with sequence similarity 212 member B or chromosome 1 open reading frame 183, encodes the protein serine/threonine-protein kinase PAK4 (PAK4)-inhibitor INKA2. The PAK4-inhibitor INKA proteins, with 2 isoforms in humans, are endogenous inhibitors of PAK4 (46). The PAK proteins are a family of serine/threonine cyclin-dependent kinase inhibitor 1-activating protein kinases and have been implicated in a range of biological activities (46). PAK4 is an effector molecule for the Rho GTPase cell division control protein 42 homolog. Previous experiments have indicated that active PAK4 protects cells from apoptosis (47,48). PAK4 is associated with tumorigenesis and progression in different types of cancer through promoting cell growth, proliferation, migration and metastasis (49-55). The downregulation of INKA2 identified in the present study may potentially upregulate the expression of PAK4, which in turn promotes cell proliferation and inhibits cell apoptosis in IPF fibroblasts. The NGS data suggested that the FPKM of PAK4 was 0.2922 in IPF fibroblasts, but undetectable in normal lung fibroblasts. Based on the quality control criterion to exclude genes with FPKM <0.3 in either IPF or normal fibroblasts, PAK4 was not able to be analyzed in the present study. The upregulation of PAK4 in IPF fibroblasts required additional confirmation. It was also identified that the miRNAs hsa-miR-1254 and hsa-miR-766-3p, 2 putative regulators of INKA2, were upregulated in IPF fibroblasts. Therefore, the results suggested that the miRNA-mRNA interaction in the INKA2 gene may serve a role in cell growth, proliferation or apoptosis inhibition in IPF fibroblasts. The PAX8 gene encodes protein PAX8, which is a member of the paired box family of transcription factors. PAX8 is involved in the development and differentiation of thyroid follicular cells (56,57), and is associated with the growth of cancer cells (58,59). Overexpression of PAX8 has been observed in a number of types of cancer. It also has a critical role in the survival and proliferation of epithelial cells (60). In the present study, upregulation of PAX8 was identified in IPF fibroblasts. These data suggested that PAX8 may serve a role in promoting the growth, survival or proliferation of fibroblasts in IPF. The upregulation of NTM and MESD (also known as mesoderm development candidate 2), and downregulation of ITPRID2 (also known as Ki-Ras-induced actin-interacting protein or sperm specific antigen 2) were also identified in IPF fibroblasts. Neurotrimin, a neural cell adhesion molecule, may promote neurite outgrowth and adhesion via a homophilic mechanism (61,62). In addition to brain tissue, human tissue-specific expression analysis indicated a high expression of NTM in human lung tissue (63). MESD, a specialized chaperone for low-density lipoprotein receptor-related protein (LRP)5 and LRP6, is a universal inhibitor of Wnt/LRP signaling on the cell surface (64,65). ITPRID2 encodes a protein that is involved in the regulation of filamentous actin and extracellular signals (66), and may be associated with structural integrity and/or signal transductions in human cancer (67). The functions of NTM, MESD and ITPRID2 remain largely unknown. Their associations with human lung diseases, particularly IPF, require additional investigation. The identified gene dysregulation in fibroblasts and their proposed mechanisms in IPF are summarized in Fig. 7. In summary, the present study demonstrated that NTM, PAX8 and MESD were upregulated and ITPRID2 and INKA2 were downregulated in IPF fibroblasts. Dysregulation of PAX8 and INKA2 may participate in the regulation of proliferation and survival in IPF fibroblasts. Functional analysis of the dysregu-lated genes suggested a marked association between fibroblasts and ECM. The integrated analysis of miRNA and mRNA expression profiles suggested that the upregulated miRNAs hsa-miR-1254 and hsa-miR-766-3p may downregulate INKA2 and serve important roles in IPF. These data improve the current understanding of fibroblasts as key cells in the pathogenesis of IPF. As a screening study using bioinformatics approaches, without biological or cross-platform replicates, the results of the present study require additional validation.
Figure 7

Proposed mechanisms demonstrating gene regulations of fibroblasts and their potential roles in IPF. Upregulation of hsa-miR-1254 and hsa-miR-766-3p may inhibit INKA2 expression, leading to overexpression of PAK4, which may promote fibroblast proliferation and migration, and protect fibroblasts from apoptosis. Overexpression of PAX8 may also have a role in fibroblast proliferation. The functions of NTM, MESD and ITPRID2 remain unknown. Whether they contribute to the pro-proliferative and pro-fibrotic microenvironment requires additional study. Based on the results of the Gene Ontology analysis, the gene dysregulations in IPF fibroblasts may also have a role in the excessive production and deposition of disorganized ECM. The hsa-miR-1254-INKA2 and hsa-miR-766-3p-INKA2 interactions were identified based on bioinformatic analysis, and therefore requires additional experiments to confirm. IPF, idiopathic pulmonary fibrosis; hsa, Homo sapiens; INKA2, Inka box actin regulator 2; PAK4, serine/threonine-protein kinase PAK4; PAX8, paired box 8; MESD, mesoderm development LRP chaperone; NTM, neurotrimin; ITPRID2, ITPR interacting domain containing 2; ECM, extracellular matrix.

  67 in total

1.  Occupational and environmental risk factors for idiopathic pulmonary fibrosis: a multicenter case-control study. Collaborating Centers.

Authors:  K B Baumgartner; J M Samet; D B Coultas; C A Stidley; W C Hunt; T V Colby; J A Waldron
Journal:  Am J Epidemiol       Date:  2000-08-15       Impact factor: 4.897

2.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources.

Authors:  Da Wei Huang; Brad T Sherman; Richard A Lempicki
Journal:  Nat Protoc       Date:  2009       Impact factor: 13.491

3.  PI3K p110γ overexpression in idiopathic pulmonary fibrosis lung tissue and fibroblast cells: in vitro effects of its inhibition.

Authors:  Enrico Conte; Elisa Gili; Mary Fruciano; Martina Korfei; Evelina Fagone; Maria Iemmolo; Debora Lo Furno; Rosario Giuffrida; Nunzio Crimi; Andreas Guenther; Carlo Vancheri
Journal:  Lab Invest       Date:  2013-02-25       Impact factor: 5.662

4.  Increased prevalence of gastroesophageal reflux in patients with idiopathic pulmonary fibrosis.

Authors:  R W Tobin; C E Pope; C A Pellegrini; M J Emond; J Sillery; G Raghu
Journal:  Am J Respir Crit Care Med       Date:  1998-12       Impact factor: 21.405

5.  Analysis of the human tissue-specific expression by genome-wide integration of transcriptomics and antibody-based proteomics.

Authors:  Linn Fagerberg; Björn M Hallström; Per Oksvold; Caroline Kampf; Dijana Djureinovic; Jacob Odeberg; Masato Habuka; Simin Tahmasebpoor; Angelika Danielsson; Karolina Edlund; Anna Asplund; Evelina Sjöstedt; Emma Lundberg; Cristina Al-Khalili Szigyarto; Marie Skogs; Jenny Ottosson Takanen; Holger Berling; Hanna Tegel; Jan Mulder; Peter Nilsson; Jochen M Schwenk; Cecilia Lindskog; Frida Danielsson; Adil Mardinoglu; Asa Sivertsson; Kalle von Feilitzen; Mattias Forsberg; Martin Zwahlen; IngMarie Olsson; Sanjay Navani; Mikael Huss; Jens Nielsen; Fredrik Ponten; Mathias Uhlén
Journal:  Mol Cell Proteomics       Date:  2013-12-05       Impact factor: 5.911

6.  Differential Binding of Three Major Human ADAR Isoforms to Coding and Long Non-Coding Transcripts.

Authors:  Josephine Galipon; Rintaro Ishii; Yutaka Suzuki; Masaru Tomita; Kumiko Ui-Tei
Journal:  Genes (Basel)       Date:  2017-02-11       Impact factor: 4.096

7.  Identification of novel genetic regulations associated with airway epithelial homeostasis using next-generation sequencing data and bioinformatics approaches.

Authors:  Chau-Chyun Sheu; Ming-Ju Tsai; Feng-Wei Chen; Kuo-Feng Chang; Wei-An Chang; Inn-Wen Chong; Po-Lin Kuo; Ya-Ling Hsu
Journal:  Oncotarget       Date:  2017-07-31

8.  Bleomycin induces molecular changes directly relevant to idiopathic pulmonary fibrosis: a model for "active" disease.

Authors:  Ruoqi Peng; Sriram Sridhar; Gaurav Tyagi; Jonathan E Phillips; Rosario Garrido; Paul Harris; Lisa Burns; Lorena Renteria; John Woods; Leena Chen; John Allard; Palanikumar Ravindran; Hans Bitter; Zhenmin Liang; Cory M Hogaboam; Chris Kitson; David C Budd; Jay S Fine; Carla M T Bauer; Christopher S Stevenson
Journal:  PLoS One       Date:  2013-04-02       Impact factor: 3.240

9.  Pax8 has a critical role in epithelial cell survival and proliferation.

Authors:  T Di Palma; M G Filippone; G M Pierantoni; A Fusco; S Soddu; M Zannini
Journal:  Cell Death Dis       Date:  2013-07-18       Impact factor: 8.469

10.  An in cellulo-derived structure of PAK4 in complex with its inhibitor Inka1.

Authors:  Yohendran Baskaran; Khay C Ang; Praju V Anekal; Wee L Chan; Jonathan M Grimes; Ed Manser; Robert C Robinson
Journal:  Nat Commun       Date:  2015-11-26       Impact factor: 14.919

View more
  7 in total

1.  Integrative omics analysis identifies biomarkers of idiopathic pulmonary fibrosis.

Authors:  Peiyan Zheng; Shixue Sun; Jingxian Wang; Xiaohua Douglas Zhang; Baoqing Sun; Zhangkai Jason Cheng; Kuan Cheok Lei; Mingshan Xue; Teng Zhang; Huimin Huang
Journal:  Cell Mol Life Sci       Date:  2022-01-11       Impact factor: 9.261

2.  Investigating Novel Genes Potentially Involved in Endometrial Adenocarcinoma using Next-Generation Sequencing and Bioinformatic Approaches.

Authors:  Feng-Hsiang Tang; Wei-An Chang; Eing-Mei Tsai; Ming-Ju Tsai; Po-Lin Kuo
Journal:  Int J Med Sci       Date:  2019-09-07       Impact factor: 3.738

3.  The Potential Effects of Curcumin on Pulmonary Fibroblasts of Idiopathic Pulmonary Fibrosis (IPF)-Approaching with Next-Generation Sequencing and Bioinformatics.

Authors:  Wei-An Chang; Chia-Min Chen; Chau-Chyun Sheu; Ssu-Hui Liao; Ya-Ling Hsu; Ming-Ju Tsai; Po-Lin Kuo
Journal:  Molecules       Date:  2020-11-21       Impact factor: 4.411

4.  Changing Expression Profiles of Messenger RNA, MicroRNA, Long Non-coding RNA, and Circular RNA Reveal the Key Regulators and Interaction Networks of Competing Endogenous RNA in Pulmonary Fibrosis.

Authors:  Xue Liu; Huaman Liu; Xinhua Jia; Rong He; Xinyue Zhang; Wei Zhang
Journal:  Front Genet       Date:  2020-09-24       Impact factor: 4.599

5.  IPF-Fibroblast Erk1/2 Activity Is Independent from microRNA Cluster 17-92 but Can Be Inhibited by Treprostinil through DUSP1.

Authors:  Sabrina Blumer; Lei Fang; Wei-Chih Chen; Petra Khan; Katrin Hostettler; Michael Tamm; Michael Roth; Christopher Lambers
Journal:  Cells       Date:  2021-10-21       Impact factor: 6.600

6.  Nintedanib induces gene expression changes in the lung of induced-rheumatoid arthritis-associated interstitial lung disease mice.

Authors:  Shintaro Mikami; Yoko Miura; Shinji Kondo; Kosuke Sakai; Hiroaki Nishimura; Hiroyuki Kyoyama; Gaku Moriyama; Nobuyuki Koyama; Hideki Noguchi; Hirotsugu Ohkubo; Satoshi Kanazawa; Kazutsugu Uematsu
Journal:  PLoS One       Date:  2022-06-17       Impact factor: 3.752

7.  Identification and Validation of Potential Biomarkers and Pathways for Idiopathic Pulmonary Fibrosis by Comprehensive Bioinformatics Analysis.

Authors:  Weibin Qian; Xinrui Cai; Qiuhai Qian; Xinying Zhang
Journal:  Biomed Res Int       Date:  2021-07-04       Impact factor: 3.411

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.