Literature DB >> 28436302

Gene set enrichment analysis: A genome-wide expression profile-based strategy for discovering functional microRNA-disease relationships.

Yin Ni1, Caiyun Song2, Shuqing Jin2, Zhoufeng Chen2, Ming Ni3, Lu Han3, Jiansheng Wu2, Yin Jin2.   

Abstract

Objective To explore stable and functional microRNA (miRNA)-disease relationships using a genome-wide expression profile pattern matching strategy. Methods We applied the ranked microarray pattern matching strategy Gene Set Enrichment Analysis to identify miRNA permutations with similar expression patterns to diseases. We also used quantitative reverse transcription PCR to validate the predicted expression levels of miRNAs in three diseases: inflammatory bowel disease (IBD), oesophageal cancer, and colorectal cancer. Results We found that hsa-miR-200 c was upregulated more than 40-fold in oesophageal cancer. The expression of miR-16 and miR-124 was not consistently upregulated in IBD or colorectal cancer. Conclusions Our results suggest that this expression profile matching strategy can be used to identify functional miRNA-disease relationships.

Entities:  

Keywords:  Gene Set Enrichment Analysis; cancer; genome-wide expression pattern; inflammatory bowel disease; microRNA

Mesh:

Substances:

Year:  2017        PMID: 28436302      PMCID: PMC5971487          DOI: 10.1177/0300060517693719

Source DB:  PubMed          Journal:  J Int Med Res        ISSN: 0300-0605            Impact factor:   1.671


Introduction

MicroRNAs (miRNAs) are a class of single-stranded small noncoding RNAs (∼22 nucleotides long) that negatively regulate messenger RNA (mRNA) expression at the post-transcriptional level.[1,2] miRNAs bind to the 3′-untranslated region of target genes through base-pairing, resulting in mRNA degradation and/or translational inhibition.[3] Accumulating evidence shows that miRNAs are involved in multiple biological processes and cellular signalling pathways,[4,5] and that mutations or the dysregulation of miRNAs can cause various diseases. Recently, several studies have identified close relationships between miRNAs and disease,[4,6-9] leading to the construction of dozens of miRNA-related databases. For example, miR2Disease, a resource of miRNAs that are dysregulated in human diseases, currently includes 3273 miRNA–disease relationship entries.[10] A common strategy to explore potential disease-related miRNAs is to identify miRNAs that are differentially expressed in a disease state using technologies such as quantitative reverse transcription (qRT)-PCR, miRNA microarray analysis, or small RNA deep sequencing. Alternatively, cell transcriptional responses to various treatments or perturbations can be compared using algorithms such as Gene Set Enrichment Analysis (GSEA)[11] and large gene expression profiling datasets to quantitatively calculate the relevance of different perturbations. The first large-scale effort to apply this principle was the Connectivity Map project, which aimed to find potential connections among molecule treatments, disease states, and other bioprocesses by querying large-scale expression profiling data and validated gene sets.[12] Since then, several studies have also demonstrated the feasibility of this approach in drug reposition studies.[13-15] Hypothetically, because miRNAs are integrated into regulatory networks that influence target and other downstream genes, cell transcriptional responses against miRNA perturbations (either overexpression or knockdown) may reflect the transcriptional response of the related disease state to some extent. Thus, miRNA–disease relationships can be identified using this transcriptional response comparison strategy. Previously, Jiang et al. suggested that it might be possible to determine miRNA–drug links by integrating miRNA targets with the expression profiles of cancers and cell responses to small molecules.[16] However, to the best of our knowledge, no investigation has directly compared the transcriptional responses induced by both miRNA genotype variation and disease. Recently, our group developed the ExpTreeDB database, which allows users to mine relationships among manually curated perturbations such as agent treatment, genotype variation, disease state, stress, and infection.[17] In this study, we explored miRNA–disease relationships using this methodology and datasets from the Gene Expression Omnibus (GEO). We collected global transcriptional response datasets representing 40 human diseases and 30 miRNA variation treatments. Pair-wise similarities were calculated to identify putative miRNA–disease links for a literature investigation and experimental validation in three diseases: inflammatory bowel disease (IBD), oesophageal cancer, and colorectal cancer. We found that miR-200 c was significantly overexpressed in oesophageal cancer. However, we did not observe the consistently upregulated expression of miR-16 or miR-124 in IBD or colorectal cancer.

Methods

Specimens

Subjects in this study were recruited by Zhejiang Provincial People’s Hospital, and included five oesophageal cancer patients (four men and one woman; average age, 67 years), five colorectal cancer patients (three men and two women; average age, 62 years), and five IBD patients (two Crohn’s disease cases and three ulcerative colitis cases; four women and one man; average age, 38 years). The diagnosis of all patients had been confirmed by pathology. Patients with cancer were free of other malignant neoplasms, and had not undergone radiotherapy or chemotherapy. This work was approved by the Ethics Committee of Zhejiang Provincial People’s Hospital (approval number: 2014KY059). Informed consent was obtained from all subjects prior to their participation. Two 1×1 cm tissue samples and the 5 cm margin of each carcinoma sample were obtained during surgical resection. All samples were washed with saline solution then immediately placed into 2 ml RNAlater solution and stored at 4℃ overnight (>16h). The tissues were then stored at −80℃ until required for analysis. Tissue samples from IBD and corresponding normal tissues were collected during pretreatment endoscopic biopsies and prepared as described above.

RNA isolation from clinical tissues

Total RNA was isolated using TRIzol reagent (Invitrogen, Waltham, MA, USA). Tissues were centrifuged at 2000 rpm, speed 500 xg for 5 min, then cell pellets were homogenized in 1.0 ml TRIzol reagent and incubated at room temperature for 5 min. Each sample was treated with 200 µl of chloroform, repeatedly inverted for 15 s, then incubated at room temperature for 10 min. Samples were then centrifuged at 12,800 rpm for 10 min at 4℃, and the colourless supernatant was transferred to a fresh tube and treated with the same volume of isopropyl alcohol. After incubation at 4℃ for 10 min, samples were again centrifuged at 12,800 rpm for 10 min at 4℃, the supernatant was removed, and the pellet was washed with 75% ethanol. Samples were centrifuged twice at 11,800 rpm for 5 min at 4℃, with removal of the supernatant after the first centrifugation, then the pellet was dried at room temperature and dissolved in RNase-free water. The RNA concentration was quantified by a Nanodrop 2000 Spectrophotometer (Thermo Scientific, Waltham, MA, USA).

qRT-PCR

cDNA was synthesized according to the manufacturer’s protocol (Promega). Briefly, 10 -µl reverse transcription reactions contained 2 nM miRNA RT primers, 500 µM dNTPs, 0.2 µl M-MLV reverse transcriptase, and 1 µg total RNA. Conditions were as follows: 16℃ for 30 min followed by 42℃ for 1 h and 75℃ for 10 min. The real-time PCR system contained 10 µl SYBR Premix Ex Taq, 0.5 µl upstream primer, 0.5 µl downstream primer (2.5 µM), 1 µl cDNA, and 8 µl RNase-free H2O. The reaction was incubated at 95℃ for 30 s, followed by 45 cycles of 95℃ for 5 s, then 60℃ for 30 s. miRNA expression was analysed using the 2 −ΔΔCt method, and U6 was selected as the control gene. Primer sequences are shown in Table S1. SPSS 17.0 software was used to perform statistical analysis. Data were shown as the mean ± SD. The t-test was used for analysis between cancer tissue and adjacent normal tissue or between disease and control group. Significance was defined as P value < 0.05.

Gene expression profiling data collection

We downloaded human global gene expression profiles representing transcriptional responses to miRNA perturbations and disease states from GEO dataset (GDS) records produced by two platforms, the Affymetrix human genome U133 plus 2.0 array (GPL570) and the Affymetrix human genome U133A array (GPL96). These two human microarray platforms are the most frequently used in GEO and include over 20,000 genes, which is favourable for use with the GSEA algorithm.[12,13,18,19] Human GDS records with a “disease state” subtype description were downloaded as disease state datasets and were manually examined in ExpTreeDB.[17] Datasets related to miRNA perturbations were downloaded from GEO series (GSE) resources, and detailed treatments as well as miRNA entries were manually extracted. A similar approach was used to obtain small RNA silencing datasets. We required that one gene expression profile related to a given perturbation must include a blank control group so that transcriptional responses to perturbations could be clearly defined.

Generating a ranked gene list

Within each GEO dataset, we manually selected the experimental group of a particular perturbation. The experimental groups were then manually matched to normal control groups. Specific genotype variations and disease states were defined as “perturbations” to cells that induce transcriptional responses. Following the approach used in ExpTreeDB,[17] we generated a phenotype ranked list (PRL) to denote the transcriptional response to a perturbation such as miRNA overexpression, gene silencing, or disease state. In brief, pair-wise global gene expression fold-changes were calculated, and all fold-change lists under a certain perturbation were merged into a PRL using a hierarchical majority voting scheme to prevent poor representation of heterogeneous experimental settings.[20] Thus, the genes showing the highest level of up/down-regulation (presented as microarray probe names) under each permutation were placed at the top/bottom of the corresponding PRL. For the disease state, two perturbations were defined to a different extent: a description of the disease, such as IBD, and the provision of more detailed information about stage or subtype, such as ulcerative colitis (UC) or Crohn’s disease (CD). We generated PRLs for both definitions, and correlation calculations showed that the two PRLs had high correlation coefficients (Enrichment Score = 0.662, Figure 1). Therefore, in this study we employed a general description of disease states and perturbation definitions (Table 1 and Figure 1).
Figure 1.

A heat map of classified disease correlation scores. Each permutation is represented by a phenotype ranked list where the genes showing the most up-regulation are placed at the top while the most down-regulated genes are at the bottom. The correlation scores were computed by measuring gene regulation in correspondence with the GSEA method to obtain a score from −1 to +1. The color scale shows positive correlations in red and negative correlations in blue.

Table 1.

Data sets used in this study.

TypeTermAccession no.Cell type or disease state pair
Disease stateAdenocarcinoma (oesophagus)GDS1321Barrett’s oesophagus/normal
adenocarcinoma/normal
Alzheimer’s diseaseGDS810moderate AD/control
severe AD/control
incipient AD/control
Bipolar disorderGDS2190bipolar disorder/control
GDS2191bipolar disorder/control
Breast cancerGDS2250basal-like cancer/normal
GDS2617non-basal-like cancer/normal
BRCA1-associated cancer/normal
tumorigenic cancer cell/normal
GDS2635invasive lobular carcinoma/lobular control
invasive ductal carcinoma/lobular control
Carious pulpalGDS1850carious/healthy
Chronic lymphocytic leukaemiaGDS2643Waldenstrom’s macroglobulinaemia/normal
chronic lymphocytic leukaemia/normal
multiple myeloma/normal
Chronic obstructive pulmonary diseaseGDS289chronic obstructive pulmonary disease/control
Colorectal cancerGDS2609early onset colorectal cancer/healthy control
Cushing’s syndromeGDS2374ACTH-dependent Cushing’s syndrome/control
GIP-dependent Cushing’s syndrome/control
GIP-dependent nodule/control
GIP-dependent adenoma/control
DermatomyositisGDS2153dermatomyositis/normal
EmphysemaGDS737severe emphysema/no or mild emphysema
EndometriosisGDS2737endometriosis/normal
Essential thrombocythaemiaGDS1376thrombocythaemia/normal
GDS552ET/normal
GDS761malignant/normal
Familial combined hyperlipidaemiaGDS946familial combined hyperlipidaemia/normal
Heart failureGDS1362ischaemic cardiomyopathy/non-failing heart
non-ischaemic cardiomyopathy/non- failing heart
GDS2154inflammatory dilated cardiomyopathy/healthy
GDS2205dilated cardiomyopathy/non-failing
GDS651idiopathic dilated/normal
ischaemic/normal
Hereditary gingival fibromatosisGDS1685hereditary gingival fibromatosis/normal
HIV infectionGDS2649non-progressive HIV infection/uninfected
early HIV infection/uninfected
chronic HIV infection/uninfected
Huntington’s diseaseGDS1331presymptomatic/normal
symptomatic/normal
Idiopathic myelofibrosisGDS2397idiopathic myelofibrosis/normal
Inflammatory bowel diseaseGDS1615ulcerative colitis/normal
Crohn's disease/normal
GDS559Crohn’s disease/control
ulcerative colitis/control
Kidney sarcomaGDS1282clear cell sarcoma of the kidney/control
Wilms’ tumour/control
GDS505RCC/normal
LeiomyomaGDS484leiomyoma/normal
Lethal congenital contracture syndromeGDS1295LCCS/control
Lung diseaseGDS2142severe cystic fibrosis/normal
mild cystic fibrosis/normal
Malignant pleural mesotheliomaGDS1220malignant pleural mesothelioma/normal
MelanomaGDS1375malignant melanoma/benign nevi
GDS1989lymph node metastasis/normal
vertical growth phase melanoma/ normal
melanoma in situ/normal
atypical nevus/normal
metastatic growth phase melanoma/normal
Muscle diseasesGDS1956juvenile dermatomyositis/normal
Emery–Dreifuss muscular dystrophy/normal
calpain 3 mutation/normal
Duchenne muscular dystrophy/normal
FKRP mutation/normal
amyotophic lateral sclerosis/normal
Becker muscular dystrophy/normal
acute quadriplegic myopathy/normal
fascioscapulohumeral muscular dystrophy/normal
dysferlin mutation/normal
spastic paraplegia/normal
Myelodysplastic syndromeGDS1392myelodysplastic syndrome/normal
rheumatoid arthritis, off treatment/normal
folate deficiency/normal
vitamin B12 deficiency/normal
rheumatoid arthritis, on methotrexate/normal
GDS2118refractory anaemia with excess blasts/normal
refractory anaemia/normal
refractory anaemia with ringed sideroblasts/normal
Non-melanoma skin cancerGDS2200squamous cell carcinoma/normal
actinic keratosis/normal
ObesityGDS268morbidly obese/non-obese
Papillary thyroid carcinomaGDS1665papillary thyroid carcinoma/normal
GDS1732papillary thyroid cancer/normal
Pituitary adenomaGDS1253GH-secreting adenoma/normal
PRL-secreting adenoma/normal
non-functioning adenoma/normal
ACTH-secreting adenoma/normal
GDS2432pituitary adenoma predisposition/control
Polycystic ovary syndromeGDS1050polycystic ovary syndrome/normal
GDS2084polycystic ovary syndrome/control
Prostate cancerGDS1423cancer/normal
GDS1439metastatic/benign
primary/benign
GDS1746metastatic/benign hyperplasia
basaloid/benign hyperplasia
SchizophreniaGDS1917schizophrenia/control
Severe combined immunodeficiencyGDS420SCID/control
Sickle cell plasma effect on pulmonary artery endothelial cellsGDS1257SCD with acute chest syndrome/normal
SCD steady state/normal
Squamous lung cancerGDS1312cancer/normal
TeratozoospermiaGDS2697teratozoospermia/normal
Vulvar intraepithelial neoplasiaGDS2418vulvar intraepithelial neoplasia/control
MicroRNA permutation or related gene silencingDGCR8GSE13639HeLa cells
DROSHAGSE13639HeLa cells
GSE6767HeLa cells
IPO8GSE14054HeLa cells
KSHV microRNA (OE)GSE16355LEC cells
miR-1 (OE)GSE22002HeLa cells
miR-124 (OE)GSE6207HepG2 cell line
miR-125b (KD)GSE19680M07 cells
miR-130b (OE)GSE17386PLC8024 CD133- HCC cells
miR-145 (OE)GSE18625DLD-1 cells
GSE19737MDA-MB-231 cells
miR-146a (KD)GSE21132Jurkat T cells
miR-146a (OE)GSE21132Jurkat T cells
miR-155 (KD)GSE13296dendritic cells
miR-155 (OE)GSE22002HeLa cells
GSE9264HEK293 cells
miR-15a/16-1 (OE)GSE18866CLL-I83E95 cells
miR-16 (KD)GSE24522MMS1 cell line
miR-16 (OE)GSE24522MMS1 cell line
miR-182 (KD)GSE24824human melanoma metastasis
miR-200c (OE)GSE25332Type 2 endometrial cancer cell line, Hec50
miR-210 (KD)GSE16962HUVEC
miR-210 (OE)GSE16962HUVEC
miR-210/338-3p/ 33a/451 (OE)GSE15385T84 cells
miR-221 (KD)GSE19777MCF7-FR
miR-222 (KD)GSE19777MCF7-FR
miR-26a (OE)GSE12278human BL cell lines:NAM (Ramos)
miR-30d (OE)GSE277185B1 melanoma cell line
miR-335 (OE)GSE9586LM2 cell line
miR-338-3p/451 (OE)GSE15385T84 cells
miR-34a (OE)GSE16674K562 cells
GSE7754HCT116 cells
miR-616 (OE)GSE20543LNCaP prostate cells
miR-7 (OE)GSE14507A549 cells
miR-99a (OE)GSE26332C4-2 cells
miR-K12-11 (OE)GSE9264HEK293 cells
top 25 miRNAs (KD)GSE21577HEK 293 cells

AD, Alzheimer’s disease; ACTH, adrenocorticotropic hormone; GIP, gastric inhibitory polypeptide; ET, essential thrombocythaemia; RCC, renal cell carcinoma; LCCS, lethal congenital contracture syndrome; FKRP, fukutin-related protein; GH, growth hormone; PRL, prolactin; SCID, severe combined immunodeficiency; HCC, hepatocellular carcinoma; OE, overexpression; KD, knockdown

A heat map of classified disease correlation scores. Each permutation is represented by a phenotype ranked list where the genes showing the most up-regulation are placed at the top while the most down-regulated genes are at the bottom. The correlation scores were computed by measuring gene regulation in correspondence with the GSEA method to obtain a score from −1 to +1. The color scale shows positive correlations in red and negative correlations in blue. Data sets used in this study. AD, Alzheimer’s disease; ACTH, adrenocorticotropic hormone; GIP, gastric inhibitory polypeptide; ET, essential thrombocythaemia; RCC, renal cell carcinoma; LCCS, lethal congenital contracture syndrome; FKRP, fukutin-related protein; GH, growth hormone; PRL, prolactin; SCID, severe combined immunodeficiency; HCC, hepatocellular carcinoma; OE, overexpression; KD, knockdown

Similarity determinations based on the GSEA algorithm

We quantified pair-wise similarities among PRLs as correlation scores based on GSEA. Following the strategy used by Iorio et al,[13,20] we extracted the top 250 and the bottom 250 probes as gene signatures. The R package Gene Expression Signature with integrated GSEA was used for the similarity calculations.[21] In brief, the enrichment of a signature in a PRL is estimated by the Kolmogorov–Smirnov test for uniform distributions. The signature of permutation A is represented as (up), with up representing the top 250 probe sets and down representing the bottom 250 ones. The final similarity score of PRLs between perturbation A and B (S) is defined as the average enrichment score: is the enrichment score of signature s (up- and down-regulated parts separated) with respect to the PRL p. Thus, positive similarity scores represent similar regulation tendencies, while negative correlation scores represent opposite regulation tendencies.

Results

Overview of miRNA–disease similarities

From GSE records, we obtained transcriptional responses including genotype variation of 30 miRNAs for nine knock-down and 21 overexpression treatments. Variations in Drosha, importin 8 (IPO8), and DGCR8 were also included for their functions in miRNA biogenesis. Transcriptional responses representing 42 distinct disease states were collected from GDS records. Based on a GSEA-centred pipeline, we calculated the pair-wise similarities between the transcriptional responses of miRNA variations and disease states (Figure 2(a)). We found that the overall miRNA–disease similarity values followed a normal distribution (mean = −0.002, variance = 0.074, Figure 2(b)). However, we also observed some outliers with extreme similarity scores.
Figure 2.

Pair-wise similarity scores between the transcriptional responses of miRNA variation and disease states. (a) A heat map of similarity scores. miRNA variations and diseases were sorted by their correlations with each other in the top 5%. (b) A distribution and normal probability plot of similarity scores.

Pair-wise similarity scores between the transcriptional responses of miRNA variation and disease states. (a) A heat map of similarity scores. miRNA variations and diseases were sorted by their correlations with each other in the top 5%. (b) A distribution and normal probability plot of similarity scores.

Correlated miRNA–disease links

We next focused on highly correlated transcriptional responses to miRNA variation and disease states. The top 5% of miRNA–disease links (n = 69) involving 23 miRNAs and 28 diseases, which are referred to as correlated, are shown in Table 2. The correlations were also illustrated by network to exhibit hubs of miRNA variations and diseases. Among these correlations, we observed both “hub” miRNAs and diseases. Four diseases were linked to four miRNA variations, including Cushing’s syndrome, IBD, multiple myeloma, and carious pulp. The overexpression of miR-210 was correlated with eight disease states, with endometriosis found at the 0.43% percentile. The co-overexpression of miR-338-3 p and miR-451 was also correlated with seven diseases. Interestingly, we found that silencing of IPO8 was correlated with five diseases, and that Drosha or DGCR8, also functional during miRNA biogenesis, were within the top 5% of associations.
Table 2.

miRNA–disease relationships within the top five percentile.

miRNA perturbationDiseaseCSPR
miR-182 (KD)Carious pulpal0.30780.07%
IPO8Multiple myeloma0.23760.14%
miR-26a (OE)Idiopathic myelofibrosis0.22470.22%
KSHV microRNA (OE)Inflammatory bowel disease0.22460.29%
miR-182 (KD)Lethal congenital contracture syndrome0.22260.36%
miR-210 (OE)Endometriosis0.20260.43%
miR-16 (OE)Sickle cell plasma effect0.20130.51%
KSHV microRNA (OE)Sickle cell plasma effect0.19720.58%
miR-338-3p/451 (OE)Multiple myeloma0.19530.65%
IPO8Dermatomyositis0.19180.72%
miR-210/338-3p/33a/451 (OE)Multiple myeloma0.19090.79%
miR-16 (OE)Inflammatory bowel disease0.18170.87%
miR-26a (OE)Sickle cell plasma effect0.17940.94%
miR-1 (OE)Carious pulpal0.17841.01%
miR-335 (OE)Polycystic ovary syndrome0.17511.08%
miR-155 (OE)Carious pulpal0.17471.15%
miR-210 (KD)Malignant pleural mesothelioma0.16621.23%
IPO8Inflammatory bowel disease0.16551.30%
miR-210/338-3p/33a/451 (OE)Huntington’s disease0.16211.37%
miR-338-3p/451 (OE)Huntington’s disease0.16161.44%
miR-210 (OE)Kidney sarcoma0.15791.52%
miR-210 (OE)Squamous lung cancer0.15481.59%
miR-335 (OE)Multiple myeloma0.15291.66%
miR-338-3p/451 (OE)Cushing’s syndrome0.15031.73%
miR-7 (OE)Cushing’s syndrome0.14871.80%
miR-335 (OE)Squamous lung cancer0.14671.88%
miR-146a (KD)Cushing’s syndrome0.14651.95%
miR-124 (OE)Sickle cell plasma effect0.14652.02%
miR-200c (OE)Adenocarcinoma (oesophagus)0.14632.09%
miR-1 (OE)Heart failure0.14512.16%
miR-182 (KD)Endometriosis0.14482.24%
IPO8Cushing’s syndrome0.14362.31%
miR-210/338-3p/33a/451 (OE)Cushing’s syndrome0.14232.38%
miR-210 (KD)Squamous lung cancer0.14182.45%
miR-338-3p/451 (OE)Vulvar intraepithelial neoplasia0.14162.53%
miR-182 (KD)Heart failure0.14122.60%
miR-155 (OE)Bipolar disorder0.14052.67%
miR-K12-11 (OE)Carious pulpal0.13982.74%
miR-16 (OE)Bipolar disorder0.13852.81%
miR-338-3p/451 (OE)Idiopathic myelofibrosis0.13812.89%
miR-210 (OE)Carious pulpal0.13412.96%
miR-210/338-3p/33a/451 (OE)Idiopathic myelofibrosis0.13343.03%
miR-124 (OE)Colorectal cancer0.13283.10%
miR-155 (OE)Non-melanoma skin cancer0.13233.17%
KSHV microRNA (OE)Severe combined immunodeficiency disease0.13233.25%
miR-124 (OE)Polycystic ovary syndrome0.13223.32%
miR-30d (OE)Lethal congenital contracture syndrome0.13193.39%
miR-210 (OE)Chronic lymphocytic leukaemia0.13193.46%
miR-338-3p/451 (OE)Inflammatory bowel disease0.13043.54%
miR-335 (OE)Leiomyoma0.12973.61%
miR-155 (OE)Heart failure0.12943.68%
miR-146a (OE)Severe combined immunodeficiency disease0.12833.75%
miR-335 (OE)Waldenstrom’s macroglobulinaemia0.12823.82%
miR-210 (OE)Bipolar disorder0.12793.90%
miR-210 (OE)Leiomyoma0.12663.97%
miR-616 (OE)Huntington’s disease0.12594.04%
miR-210/338-3p/33a/451 (OE)Vulvar intraepithelial neoplasia0.12594.11%
miR-16(OE)Colorectal cancer0.12364.18%
miR-124 (OE)Inflammatory bowel disease0.12304.26%
miR-222 (KD)Cushing’s syndrome0.12284.33%
miR-338-3p/451 (OE)Prostate cancer0.12244.40%
miR-182 (KD)Papillary thyroid carcinoma0.12244.47%
IPO8Severe combined immunodeficiency disease0.12214.55%
top 25 miRNAs (KD)Multiple myeloma0.12094.62%
miR-210 (OE)Papillary thyroid carcinoma0.12064.69%
miR-335 (OE)Melanoma0.12024.76%
miR-146a (KD)Teratozoospermia0.12014.83%
miR-15a/16-1 (OE)Idiopathic myelofibrosis0.11964.91%
miR-16 (OE)Chronic lymphocytic leukaemia0.11834.98%

KD, knockdown; OE, overexpressionl; CS, Correlation Score; PR, Percentile Rank

miRNA–disease relationships within the top five percentile. KD, knockdown; OE, overexpressionl; CS, Correlation Score; PR, Percentile Rank We found that the overexpression of miR-210 was most strongly correlated with disease state, showing a wide range of relationships to different diseases. Of the eight diseases linked to miR-210, four were solid cancers, including lung cancer and kidney sarcoma. IBD was identified as a disease connected with the most miRNA genotype variations, and was predicted to share transcriptional phenotype relationships with IPO8, KSHV miRNA, miR-124, miR-16, and miR-338-3 p/451 dysregulation. We also identified multiple myeloma as a disease with five relationships, but connecting with only one miRNA genotype variation containing a single miRNA type: miR-335.

Experimental validation of miRNA differential expression

We selected correlated miRNA links in both disease state samples and controls. Based on bio-sample availability, we selected oesophageal cancer samples to analyse miR-200 c expression (OE), and colorectal cancer samples to assess miR-16 (OE) and miR-124 expression (OE). We also included IBD samples for miR-124 expression as a comparison with previous studies. We observed consistently high up-regulation of miR-200 c in the oesophageal cancer samples compared with adjacent control samples (Figure 3). Of four sample pairs, miR-200 c was expressed >47-fold in adenocarcinoma, and in another pair it was expressed 261-fold compared with controls. This result strongly agreed with our prediction that microRNA-200c highly correlated with the development of oesophageal cancer. We also identified six genes that were significantly (P < 0.01) downregulated in association with miR-200 c overexpression and in oesophageal cancer samples using GEO2R of the GEO database. We queried these six genes in TarBase,[15] and found that one, ELMO2, has been experimentally validated as a target of miR-200 c.22
Figure 3.

Quantitative RT-PCR assay for miR-200 c expression in paired normal and tumour tissues from five oesophageal adenocarcinoma patients. The expression levels of normal tissues were standardized to 1, and the fold-changes are plotted as means ± SD of three replicates.

Quantitative RT-PCR assay for miR-200 c expression in paired normal and tumour tissues from five oesophageal adenocarcinoma patients. The expression levels of normal tissues were standardized to 1, and the fold-changes are plotted as means ± SD of three replicates. We did not observe consistent upregulated expression of miR-16 or miR-124 in either IBD or colorectal cancer samples compared with controls (Figure 4). In colorectal tissue samples from two CD and three UC patients, miR-124 was shown to be upregulated in two (one CD and one UC). The expression profiles were similar to those of miR-16/24 in colorectal cancer samples.
Figure 4.

Quantitative RT-PCR assays for miR-16 and miR-124 expression in colorectal cancer patients and miR-124 expression in IBD patients. The expression levels of normal tissues are standardized to 1, and the fold-changes are plotted as means ± SD of three replicates.

Quantitative RT-PCR assays for miR-16 and miR-124 expression in colorectal cancer patients and miR-124 expression in IBD patients. The expression levels of normal tissues are standardized to 1, and the fold-changes are plotted as means ± SD of three replicates.

Discussion

Screening results based on microarray analysis are usually unstable and less functionally related than those derived from other assays. In this study, we connected miRNAs with diseases to better determine their functional relationships. We found that hsa-miR-200 c was upregulated >47-fold in five oesophageal cancer samples compared with normal samples. We also identified the experimentally validated miRNA target ELMO2 as a potential functional link connecting hsa-miR-200 c and oesophageal cancer through a simple overlap analysis of significantly down-regulated genes in hsa-miR-200 c interference and oesophageal cancer samples. We observed only a slight increase or no change in miR-124 expression in IBD patient samples compared with controls. This is inconsistent with the findings of Koukos et al., who reported miR-124 downregulation in UC samples. This difference could be explained by individual variation in miRNA levels even among patients with the same disease, reflecting differences in development stage or genetic background. Alternatively, it could be caused by the small sample size in our study. Relationships between IBD, miR-16, and miR-124 have previously been reported,23,24 and the dysregulation of miR-16 has been observed in both CD and UC.[23,25] The decreased expression of miR-124 has also been suggested to play a role in promoting inflammation and the pathogenesis of UC through up-regulating signal transducer and activator of transcription 3.[23] Moreover, miR-335 has previously been shown to play a role in multiple myeloma,[26] while miR-210 was reported to be induced by hypoxia in breast cancer.[27] This agrees with our observation that half of the diseases associated with miR-210 in the present study were solid tumours, which are known to often be hypoxic.28,29 A large systematic collection of gene expression changes resulting from cellular exposure to perturbations such as small molecules and genetic variations would help the understanding of cellular pathways and the development of therapies and biomarkers. The Library of Integrated Network-Based Cellular Signatures project founded by the National Institutes of Health has therefore been established. This work should increase the power of simulated screening following the accumulation of expression change information under various perturbations. The main limitation of our study is that we only investigated a small number of miRNAs in association with disease. In the future, we plan to include more diseases and miRNAs and to increase the number of patient samples. We also aim to further explore the relationships between miRNA and diseases using in vivo studies.
  29 in total

Review 1.  The functions of animal microRNAs.

Authors:  Victor Ambros
Journal:  Nature       Date:  2004-09-16       Impact factor: 49.962

Review 2.  The diverse functions of microRNAs in animal development and disease.

Authors:  Wigard P Kloosterman; Ronald H A Plasterk
Journal:  Dev Cell       Date:  2006-10       Impact factor: 12.270

Review 3.  MicroRNAs in cancer: biomarkers, functions and therapy.

Authors:  Josie Hayes; Pier Paolo Peruzzi; Sean Lawler
Journal:  Trends Mol Med       Date:  2014-07-12       Impact factor: 11.951

4.  GeneExpressionSignature: an R package for discovering functional connections using gene expression signatures.

Authors:  Fei Li; Yang Cao; Lu Han; Xiuliang Cui; Dafei Xie; Shengqi Wang; Xiaochen Bo
Journal:  OMICS       Date:  2013-02

5.  ExpTreeDB: web-based query and visualization of manually annotated gene expression profiling experiments of human and mouse from GEO.

Authors:  Ming Ni; Fuqiang Ye; Juanjuan Zhu; Zongwei Li; Shuai Yang; Bite Yang; Lu Han; Yongge Wu; Ying Chen; Fei Li; Shengqi Wang; Xiaochen Bo
Journal:  Bioinformatics       Date:  2014-08-24       Impact factor: 6.937

6.  Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles.

Authors:  Aravind Subramanian; Pablo Tamayo; Vamsi K Mootha; Sayan Mukherjee; Benjamin L Ebert; Michael A Gillette; Amanda Paulovich; Scott L Pomeroy; Todd R Golub; Eric S Lander; Jill P Mesirov
Journal:  Proc Natl Acad Sci U S A       Date:  2005-09-30       Impact factor: 11.205

Review 7.  MicroRNAs (miRNAs) in neurodegenerative diseases.

Authors:  Peter T Nelson; Wang-Xia Wang; Bernard W Rajeev
Journal:  Brain Pathol       Date:  2008-01       Impact factor: 6.508

8.  Identification of links between small molecules and miRNAs in human cancers based on transcriptional responses.

Authors:  Wei Jiang; Xiaowen Chen; Mingzhi Liao; Wei Li; Baofeng Lian; Lihong Wang; Fanlin Meng; Xinyi Liu; Xiujie Chen; Yan Jin; Xia Li
Journal:  Sci Rep       Date:  2012-02-21       Impact factor: 4.379

9.  MicroRNAs: new players in inflammatory bowel disease.

Authors:  R Kalla; N T Ventham; N A Kennedy
Journal:  Gut       Date:  2015-06       Impact factor: 23.059

10.  An integrative genomic approach reveals coordinated expression of intronic miR-335, miR-342, and miR-561 with deregulated host genes in multiple myeloma.

Authors:  Domenica Ronchetti; Marta Lionetti; Laura Mosca; Luca Agnelli; Adrian Andronache; Sonia Fabris; Giorgio Lambertenghi Deliliers; Antonino Neri
Journal:  BMC Med Genomics       Date:  2008-08-13       Impact factor: 3.063

View more
  1 in total

1.  A predicted risk score based on the expression of 16 autophagy-related genes for multiple myeloma survival.

Authors:  Fang-Xiao Zhu; Xiao-Tao Wang; Hui-Qiong Zeng; Zhi-Hua Yin; Zhi-Zhong Ye
Journal:  Oncol Lett       Date:  2019-09-19       Impact factor: 2.967

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.