Literature DB >> 26646696

Signature miRNAs in colorectal cancers were revealed using a bias reduction small RNA deep sequencing protocol.

Guihua Sun1, Ya-Wen Cheng2, Lily Lai3, Tsui-Chin Huang2, Jinhui Wang4, Xiwei Wu4, Yafan Wang1, Yasheng Huang1, Jinghan Wang1, Keqiang Zhang1, Shuya Hu1, Ji-Rui Yang2, Yun Yen1,2.   

Abstract

To explore the role of miRNAs in colorectal cancers (CRC), we have deep sequenced 48 pairs of frozen CRC samples, of which 44 pairs produced high quality sequencing data. By using a combined approach of our bias reduction small RNA (smRNA) deep sequencing protocol and Illumina small RNA TruSeq method for sample bar coding, we have obtained data from samples of relatively large size with bias reduced digital profile results. This novel approach allowed us to validate many previously published results using various techniques to profile miRNAs in CRC tissues or cell lines and to characterize 'true' miRNA signatures highly expressed in colon/rectum (CR) or CRC tissues. According to our results, miR-21, a miRNA that is up-regulated in CRC, and miR-143, a miRNA that is down-regulated in CRC, are the two miRNAs that dominated the miRNA population in CR tissues, and probably are also the most important miRNAs in CRCs. These two miRNAs, together with the other eight miRNAs, miR-148a, -194, -192, 200b, -200c, -10b, -26a, and -145, with descending expressing levels, constituted the top 10 highly expressed miRNAs in CR/CRC. Using TaqMan miRNA qPCR, we detected the relative expression of some of the CRC miRNAs in 10 CRC cell lines, validated their dysregulation under cancer condition, and provided possible explanation for their dysregulation, which could be caused by APC, KRAS, or TP53 mutations. We believe these results will provide a new direction in future miRNA-related CRC development studies, and application of miRNAs in CRC diagnosis/prognosis, and therapy.

Entities:  

Keywords:  colorectal cancer; miR-143; miR-21; microRNA; small RNA deep sequencing

Mesh:

Substances:

Year:  2016        PMID: 26646696      PMCID: PMC4826175          DOI: 10.18632/oncotarget.6460

Source DB:  PubMed          Journal:  Oncotarget        ISSN: 1949-2553


INTRODUCTION

CRC includes two major entities: colon cancer (CC) and rectal cancer (RC). It ranks the 3rd in all cancer cases worldwide and represents one of the cancers with high mortality [1]. CRC rates are much higher in populations consuming low fiber, high fat, and high protein diet, compared with populations consuming high fiber, low fat, and low protein diet [2]. It is conceivable that as more populations in the world are moving towards high standard of living, which allows consuming varieties of foods, more CRC cases will be inevitably increasing worldwide. One supporting evidence for the above conclusion is that CRC already becomes the number one cancer cases in some rich areas that allows consuming large amount of high protein, high fat foods. CRC can be cured if it is diagnosed early. But because of its location, it is not easily spotted early and currently is usually diagnosed through unpleasant colonoscopy and biopsy (most in advanced stages already). It is conceivable researches that address novel mechanisms in CRC development may be able to provide noninvasive, cost-effective approaches to diagnose CRC in early stages and provide new avenues for treatment. MicroRNA (miRNA) is a family of conserved small RNAs (smRNAs) that can regulate target gene expression through mediating mRNA degradation or translational repression. It was proposed that miRNAs will become new frontier players in cancer biology [3]. Reduced accumulation of specific miRNAs in colorectal neoplasia was reported early on [4]. The milestone miRNA-cancer paper that reported the profile of miRNAs in cancers to be correlated with the origin, progression, and metastasis of many cancers have attracted the attention of more cancer biologists in this new field. Since then, more and more miRNA-cancer studies have advanced basic cancer research, the development of biomarkers for diagnosis/prognosis, and the identification of miRNAs as targets for cancer therapy [5]. There are thousands of human miRNAs and their abundance is varied among different tissues [6, 7]. It is more practical to use a panel of miRNAs that are specifically expressed in certain cancer to study their role in carcinogenesis and to use them as biomarkers in cancer clinical practice. Therefore, it is critical to profile miRNAs that are highly or specifically expressed in certain cancer. While both microarray and deep sequencing are excellent platforms for global miRNA profiling, they are labor intensive, costly, have relatively low detecting range when compared to RT-qPCR, and the turnaround time is in days. In clinical setting, a small number of highly expressed miRNAs in the panel that can be detected by qRT-PCR within a few hours will be ideal. So the ideal approach is to screen a panel of miRNA using global miRNA profiling approaches, such as deep sequencing and microarray, then detect this panel of miRNAs by qRT-PCR that has high specificity, sensitivity, and fast turnaround time. To explore the role of miRNAs in CRCs, Northern blotting, miRNA microarray, miRNA qRT-PCR, and high-throughput small RNA sequencing methods have been applied in miRNA-CRC studies using CRC cell lines or patient samples in the past several years (See latest review [8]). Although results from these published studies are encouraging, they are limited by the techniques available, sample size, and the difficulty in sampling CRCs at different stages [8-10]. We are still in the early stage of collecting CRC-miRNA data and much more of this kind of data are necessary to advance this field and achieve our goal of using miRNAs in CRC diagnosis/prognosis, and therapy. To achieve this goal, CR/CRC miRNAs profiling will be the first step. Among all the miRNA profiling platforms, deep sequencing is the best technique to identify signature/marker miRNAs because they can detect thousands of annotated and novel miRNAs, as well as other small RNAs simultaneously. Here we report our finding of a pilot study on miRNA deep sequencing using 48 paired frozen biopsy CRC samples, of which 44 paired samples produced high sequencing quality data and were used for further analysis. This study allowed us to validate many previously published results. Moreover, by using a bias reduction protocol for smRNA deep sequencing, we were able to provide novel digital miRNA profile data of CRC samples. The bias reduced digital profile results allowed us to characterize a ‘true’ miRNA signature that is highly expressed in CR or CRC tissues. Using TaqMan miRNA qPCR, we detected the relative expression of some of the CRC miRNAs obtained from deep sequencing in 10 CRC cell lines, which validated their dysregulation under cancer condition. We also provide possible explanation for their dysregulation, which could be caused by APC, KRAS, or TP53 mutations. We believe these results will provide a new direction in future miRNA-CRC development studies, and the application of miRNAs in CRC diagnosis/prognosis, and therapy.

RESULTS

Deep sequencing data summary and signature miRNAs in CR or CRC

Today, the bias results in ligation-based deep sequencing gene profiling remains as one major technical problem for its application to quantity gene expression after its discovery and publication of our bias reduction workaround solution [11-13]. In the current study, CRC samples were sequenced using a combined approach of our bias reduction smRNA deep sequencing method with the protocol provided in the Illumina TRUE smRNA sequencing kit for both bias reduction and sample bar coding [11]. Total 48 paired samples were used for this pilot experiment (one and half runs, each run using twelve lanes on an Illumina HighSeq2000 machine with eight bar coded samples per lane). After data validation, we were able to get 46 paired samples (46 samples set) that produced high quality sequencing data. After filtering out low quality reads, we obtained about 7 to 10 M smRNA reads per sample for most of the samples (Figures 1a, S1, Table S1). Analysis of reads composition of smRNA fragments revealed that majority of them are mature miRNAs, ranging from 60 to 80 % for most of the samples (Figures 1b, S2). Total reads agreed well with mature miRNA reads in each sample (Figure 1c). Summarized composition of smRNAs in reads from all sequenced samples showed that miRNAs represent the largest group, comprising of 72% of the total population. The second largest population is tRNA-derived smRNAs, which consist of 16% of the total population (Figure 1d). This data showed that we have high quality RNA samples and small RNA libraries, and our smRNA deep sequencing produced highly reliable data. Next, we performed miRNA qRT-PCR for five miRNAs to validate our smRNA deep sequencing results. The qRT-PCR data agrees well with the deep sequencing reads (Figure 1e, 1f). Two pairs of samples were later reclassified as none cancer samples by pathologist and were removed from further analysis (44 samples set in Table 1; most data analysis refers to this set hereafter unless specified).
Figure 1

Summary of smRNA deep sequencing reads

a. Histogram of smRNA deep sequencing reads for all samples. Majority of samples have smRNA reads between 7 to 10 millions. b. Histogram of miRNA percentage for all samples. Most of the samples have miRNA percentage ranging from 60% to 80%. c. Paired x-y plot of total smRNA reads per samples versus % of miRNA population per sample. d. Summary of smRNA composition for all samples. e. Box plot for log transformed T to N ratio of deep sequencing reads of miR-21, -31, -135b, -143, and -484 in the 44 pairs sample set. f. Box plot for ΔΔCt value of miRNA qRT-PCR detected miR-21, -31, -135b, -143, and -484 in the 44 pairs sample set.

Table 1

Summary information of CRC samples used in this study

SamplesAllColonRectal
SizeNormal442915
Tumor442915
Stage0110
1752
2752
3332211
4110
SexMale1376
Female31238
RaceCaucasian321913
African431
Asian761
Unknown110
Age30-39211
40-49211
50-5916106
60-69972
70-791064
80-89541

Summary of smRNA deep sequencing reads

a. Histogram of smRNA deep sequencing reads for all samples. Majority of samples have smRNA reads between 7 to 10 millions. b. Histogram of miRNA percentage for all samples. Most of the samples have miRNA percentage ranging from 60% to 80%. c. Paired x-y plot of total smRNA reads per samples versus % of miRNA population per sample. d. Summary of smRNA composition for all samples. e. Box plot for log transformed T to N ratio of deep sequencing reads of miR-21, -31, -135b, -143, and -484 in the 44 pairs sample set. f. Box plot for ΔΔCt value of miRNA qRT-PCR detected miR-21, -31, -135b, -143, and -484 in the 44 pairs sample set. Our bias reduction approach showed miR-21 and miR-143 together represent over 60% of all miRNAs, and the rest of the eight miRNAs from the top 10 highly expressed miRNAs covers 17% of total miRNAs (Figure 2a). These data implied CR tissues are mainly controlled by miR-21 and miR-143. While miR-21 is up-regulated, miR-143 is down-regulated in majority of the tumor samples, which results as the rank of their abundance was switched in tumor samples versus normal samples (Figure 2b). Only the rank of miR-148a remains unchanged (by %, Tumor versus Normal) in the top 10 miRNAs that are highly expressed in CR/CRC (Tables 2, S2).
Figure 2

Highly expressed miRNAs in CR/CRC

a. Box plot of top 25 miRNAs by total reads in 44 pairs sample set. b. Circular plot of top 10 miRNA in normal tissue versus tumor tissue (44 pairs sample set).

Table 2

Top 25 miRNAs in samples by reads or percentage

By Reads ( in 1000, Sorted by All)By % in all reads (Sorted by All)
RankmiRNANormalTumorAllp-valuemiRNANormalTumorAll
11431831.291058.941445.110.002814338.1321.7229.86
221817.001993.571405.280.00002117.0140.8929.04
3148a232.12233.52232.820.8591148a4.834.794.81
4194165.30103.07134.180.00481943.442.112.77
5192146.8696.18121.520.00751923.061.972.51
6200b110.35109.81110.080.6898200b2.302.252.27
7200c80.2872.6976.490.465810b1.931.001.46
810b92.6448.6270.630.000026a1.881.041.46
926a90.0850.8570.460.00001451.700.671.18
1014581.4632.8257.140.0495200c1.671.491.58
1110a55.0155.6855.340.886010a1.151.141.14
1210153.5741.3347.450.00381011.120.850.98
13let-7f38.8635.0836.970.0774let-7f0.810.720.76
1420a23.4546.1834.820.003210.760.320.54
1527b34.5026.9530.730.000227b0.720.550.64
16200a30.0029.1829.590.6801let-7g0.670.490.58
1723a27.6828.9528.310.64552150.660.200.43
18let-7g32.1923.9328.060.00013780.640.250.44
19199a-3p24.3229.8527.090.1416200a0.620.600.61
20199b-3p24.3229.8527.090.141630d0.590.410.50
21136.3315.6826.000.000123a0.580.590.59
2230d28.3620.1224.240.0023let-7a0.540.460.50
23let-7a25.7122.3724.040.0439199a-3p0.510.610.56
24146b-5p20.2827.0023.640.1105199b-3p0.510.610.56
2537830.5412.2521.400.000020a0.490.950.72

Highly expressed miRNAs in CR/CRC

a. Box plot of top 25 miRNAs by total reads in 44 pairs sample set. b. Circular plot of top 10 miRNA in normal tissue versus tumor tissue (44 pairs sample set).

Compare published profile data with results in the current study

The advantage for miRNA deep sequencing is that the status (up/down in expression level) of miRNA dysregulation can be directly linked with the expression level of miRNAs in all CR/CRC miRNAs and the result can be digitalized. We compared our profile data with published CRC-miRNA results (Data was summarized in reference [8-10]). These top 10 miRNAs (by %) we identified in this study are also among the highly expressed miRNAs in CR/CRC reported before albeit in a different rank [14-16]. Overall, the status of miRNA dysregulation in published data agreed well with the results in our study (Tables 3, S3). However, several of dysregulated miRNAs that were reported previously are expressed at very low level according to our data, with only few reads per sample on average. This comparison showed the limitation of previous profiling methods. Therefore, some of the previously reported miRNAs may not play a critical physiological role in CR/CRC and will be difficult for detection and clinical applications due to their low abundance (Table 3). Furthermore, many of the reported dysregulated miRNAs are up/down less than 50%, implying that they will not be good candidates to be used as biomarkers (Table S3).
Table 3

Dysregulated CRC miRNAs: published list versus this study (changes by > 50%)

RankmiRNANormalTumorAll (T+N)RefThisT/N ratio
161135b94733414UpUp7.80
596663143DownUp4.00
149224239815527UpUp3.41
10118366721741421UpUp3.26
79182164151973419UpUp3.17
22181700119935671405284UpUp2.44
2289676168122UpUp2.21
3417108822355317218UpUp2.16
39780421718512614UpUp2.14
1420a234524618334818UpUp1.97
532036086101858136UpUp1.67
93106a151323221918UpUp1.53
519214686396181121522DownDown0.65
7399b548635494518DownDown0.65
37125a-5p170521079013921DownDown0.63
4194165296103069134183DownDown0.62
5230c1111766258871DownDown0.60
84195365421232889DownDown0.58
1143183128610589431445115DownDown0.58
80133a421823403279DownDown0.55
4245115459804811754DownDown0.52
11529c14497541102DownDown0.52
4330a15825757511700DownDown0.48
63375785633995628DownDown0.43
211363291568026005DownDown0.43
423129-3p17712DownDown0.41
10145814573281657137DownDown0.40
25378305421225221397DownDown0.40
411139-3p20814DownDown0.40
1479765294530DownDown0.38
318135a602141UpDown0.35
2621531876954220709DownDown0.30
41012423514DownDown0.22
37813731619DownDown0.19

The power of paired samples and miRNAs pool to distinguish tumor tissues from normal tissues

There are hundreds of miRNAs detected in our samples and the variation of reads is large, it is unlikely all of them can be used to classify tumor versus normal tissues. Unsupervised hierarchical cluster analysis using reads count of all miRNAs to distinguish tumor versus normal tissues failed to give a clear results. Instead, normal tissues were mainly clustered into two groups (Figure S3). To reduce the variation among samples, we applied the power of paired sample by using tumor to normal reads count ratio (TN ratio) in each paired sample and selected a list of miRNAs using the criteria of p-value < 0.05. Using this approach, we were able to differentiate most tumor samples from normal samples and these differentially expressed miRNAs are classified into two groups that are up-regulated or down-regulated in tumor samples, compared with normal samples (Figure 3).
Figure 3

Hierarchical cluster analysis classifies tumor versus normal tissues

Dendrogram of miRNAs with p<0.05 cluster normal tissue versus tumor tissue (44 pairs sample set). miRNAs are clustered into two groups that are either correlated with normal tissue or tumor tissue. There are several cases of tumor tissues are classified into normal tissues.

Hierarchical cluster analysis classifies tumor versus normal tissues

Dendrogram of miRNAs with p<0.05 cluster normal tissue versus tumor tissue (44 pairs sample set). miRNAs are clustered into two groups that are either correlated with normal tissue or tumor tissue. There are several cases of tumor tissues are classified into normal tissues.

Detect the CRC miRNAs in CRC cell lines

Cancer cell lines have advantages over tissues for many cancer-related studies and research, such as drug testing, to study cancer causes by gene mutations, and using tumor mouse model for cancer studies. Therefore, we detected some CRC miRNAs from our deep sequencing results in 10 CRC cell lines using miRNA Taqman qPCR and correlated their expression status with APC, KRAS and TP53 mutations. Expression levels were measured by relative –delta Ct value (miRNA to U6sn). The expression level of miR-21, miR-200 family, miR-194, miR -192, miR-148a, and miR-26a, was relatively high, consistent with their high expression in CRC tissues; the expression level of miR-143-3p, miR-10b, and miR-145 was relative low, agreeing with their down-regulation in CRC tissues. Heatmap analysis revealed that mutation in either TP53 (HCT116 and LoVo are wild type) or APC (HCT116 is wild type) may be responsible for the down-regulated miRNAs (Figures 4, S4, S5). These miRNAs may also be used to separate k-ras mutation from wt (HT-29 and WiDr are wild type) (Figures 4, S6). These data provide the opportunity to manipulate these miRNAs in CRC cell lines for loss or gain of miRNA function studies using cell lines and tumor mouse model.
Figure 4

Relative expression level of miRNAs in CRC cell lines

Total of 18 miRNAs expression were measured by Taqman miRNA qPCR. ΔCt value of each miRNA was calculated as average value of ΔCtmiRNA = Ctcell line-miRNA − Ctcell line-U6sn from three independent reactions with duplicates in each reaction, and −ΔCtmiRNA values were used for hierarchical clustering.

Relative expression level of miRNAs in CRC cell lines

Total of 18 miRNAs expression were measured by Taqman miRNA qPCR. ΔCt value of each miRNA was calculated as average value of ΔCtmiRNA = Ctcell line-miRNA − Ctcell line-U6sn from three independent reactions with duplicates in each reaction, and −ΔCtmiRNA values were used for hierarchical clustering.

Functional enrichment analysis of miR-21, miR-200c, and miR-26a target genes and pathways

To explore the possible role of oncogenic miRNAs in CRC, we performed enrichment analysis of experimentally validated target genes of miRNAs in oncogenic signatures and biological pathways listed in MSigDB. We listed the top 5 enriched terms in each databases in Table 4. The results demonstrated that miR-21-regulated genes are involved in RAS/RAF activation, including mitogen-activated signaling and TBK1-regulated pathways, miR-200 family-targeted genes are suppressed by VEGF signaling, and miR-26a-modulated genes are correlated with cell cycle progression.
Table 4

Top 5 gene sets in oncogenic signature, Reactome, and Biocarta overlapped with experimental validated miR target genes

miRGene Set Name# Genes in Gene Set (K)# Genes in Overlap (k)k/Kp-valueaFDR q-valueb
miR-21TBK1.DF_DN287320.11152.41E-234.56E-21
P53_DN.V1_UP194180.09282.51E-121.58E-10
RAF_UP.V1_DN194180.09282.51E-121.58E-10
TBK1.DF_UP290190.06552.66E-101.26E-08
PIGF_UP.V1_UP191150.07851.74E-096.23E-08
REACTOME_METABOLISM_OF_LIPIDS_AND_LIPOPROTEINS478220.0467.82E-092.72E-06
REACTOME_DEVELOPMENTAL_BIOLOGY396200.05058.06E-092.72E-06
REACTOME_FATTY_ACID_TRIACYLGLYCEROL_AND_KETONE_BODY_METABOLISM168120.07142.09E-074.69E-05
REACTOME_SIGNALING_BY_TGF_BETA_RECEPTOR_COMPLEX6380.1272.85E-074.80E-05
REACTOME_HEMOSTASIS466190.04085.23E-075.92E-05
BIOCARTA_MAPK_PATHWAY87130.14946.50E-121.41E-09
BIOCARTA_HIVNEF_PATHWAY58100.17244.16E-104.51E-08
BIOCARTA_KERATINOCYTE_PATHWAY4690.19579.62E-106.96E-08
BIOCARTA_CTCF_PATHWAY2370.30432.53E-091.37E-07
BIOCARTA_ALK_PATHWAY3780.21623.59E-091.56E-07
miR-200cTBK1.DF_DN28740.01392.53E-042.71E-02
BMI1_DN_MEL18_DN.V1_UP14530.02075.10E-042.71E-02
ESC_V6.5_UP_LATE.V1_DN18630.01611.05E-032.71E-02
P53_DN.V1_DN19230.01561.15E-032.71E-02
VEGF_A_UP.V1_DN19330.01551.17E-032.71E-02
REACTOME_INTRINSIC_PATHWAY_FOR_APOPTOSIS3030.14.53E-063.05E-03
REACTOME_HEMOSTASIS46660.01291.02E-053.43E-03
REACTOME_VEGF_LIGAND_RECEPTOR_INTERACTIONS1020.24.98E-059.68E-03
REACTOME_RIG_I_MDA5_MEDIATED_INDUCTION_OF_IFN_ALPHA_BETA_PATHWAYS7330.04116.72E-059.68E-03
REACTOME_PLATELET_ACTIVATION_SIGNALING_AND_AGGREGATION20840.01927.36E-059.68E-03
BIOCARTA_VEGF_PATHWAY2930.10344.08E-068.86E-04
BIOCARTA_AKAP13_PATHWAY1220.16677.30E-055.97E-03
BIOCARTA_HIF_PATHWAY1520.13331.16E-045.97E-03
BIOCARTA_RELA_PATHWAY1620.1251.32E-045.97E-03
BIOCARTA_IL7_PATHWAY1720.11761.50E-045.97E-03
miR-26aRB_DN.V1_UP13780.05846.20E-081.17E-05
E2F1_UP.V1_UP18970.0378.79E-065.73E-04
CYCLIN_D1_KE_.V1_UP19070.03689.09E-065.73E-04
CAMP_UP.V1_UP20070.0351.27E-055.99E-04
MTOR_UP.N4.V1_UP19660.03061.12E-044.24E-03
REACTOME_CELL_CYCLE421180.04285.79E-143.90E-11
REACTOME_CELL_CYCLE_MITOTIC325140.04313.62E-111.22E-08
REACTOME_MITOTIC_G1_G1_S_PHASES137100.0731.50E-103.36E-08
REACTOME_G1_S_TRANSITION11280.07141.28E-082.15E-06
REACTOME_CYCLIN_E_ASSOCIATED_EVENTS_DURING_G1_S_TRANSITION6560.09231.92E-072.48E-05
BIOCARTA_G1_PATHWAY2850.17866.92E-081.50E-05
BIOCARTA_CELLCYCLE_PATHWAY2340.17391.70E-061.23E-04
BIOCARTA_CTCF_PATHWAY2340.17391.70E-061.23E-04
BIOCARTA_WNT_PATHWAY2640.15382.84E-061.54E-04
BIOCARTA_ALK_PATHWAY3740.10811.22E-055.28E-04

P-value from the hypergeometric distribution for (k-1, K, N - K, n) where k is the number of genes in the intersection of the query set with a set from MSigDB, K is the number of genes in the set from MSigDB, N is the total number of all known human gene symbols, and n is the number of genes in the query set.

False discovery rate analog of hypergeometric p-value after correction for multiple hypothesis testing according to Benjamini and Hochberg.

P-value from the hypergeometric distribution for (k-1, K, N - K, n) where k is the number of genes in the intersection of the query set with a set from MSigDB, K is the number of genes in the set from MSigDB, N is the total number of all known human gene symbols, and n is the number of genes in the query set. False discovery rate analog of hypergeometric p-value after correction for multiple hypothesis testing according to Benjamini and Hochberg. Furthermore, we analyzed the co-expressed genes and miRNAs in CRC patients from an independent dataset in terms of oncogenic signature and signaling transduction pathway using gene set enrichment analysis (GSEA, Table 5). Of interest, our results revealed that genes involving EGFR, TBK1, KRAS signaling, as well as cell surface interactions at the vascular wall were positively correlated with miR-21 expression (Figure 5), supporting its oncogenic roles with integrated oncogenic gene expression in CRC progression.
Table 5

Top 3 enriched terms of miR correlated gene expression in oncogenic signatures and signaling pathway database Reactome

miRNameSizeESNESp-valFDR q-val
miR-21Oncogenic signature
EGFR_UP.V1_UP*1770.5781.9350.0020.061
TBK1.DN.48HRS_DN*490.5191.6660.0240.210
KRAS.DF.V1_UP*1740.4441.6630.0290.180
YAP1_DN40−0.480−1.7410.0060.448
KRAS.600_UP.V1_DN266−0.379−1.4690.0481.000
KRAS.AMP.LUNG_UP.V1_UP128−0.418−1.4620.0661.000
Pathway
REACTOME_CELL_SURFACE_INTERACTIONS_AT_THE_VASCULAR_WALL*840.5341.9210.0020.247
REACTOME_IL1_SIGNALING320.6281.8640.0020.275
REACTOME_SIGNALING_BY_ILS980.5051.7480.0150.785
REACTOME_DEFENSINS*26−0.636−1.9130.0040.242
REACTOME_ACYL_CHAIN_REMODELLING_OF_PC16−0.604−1.7690.0110.690
REACTOME_ACYL_CHAIN_REMODELLING_OF_PE17−0.594−1.7680.0100.467
miR-200cOncogenic signature
JAK2_DN.V1_DN1320.4101.5180.0791.000
SNF5_DN.V1_DN1460.3321.4080.0571.000
CAHOY_OLIGODENDROCUTIC890.2911.2160.1761.000
RPS14_DN.V1_UP179−0.528−1.6190.0671.000
SNF5_DN.V1_UP163−0.440−1.6120.0481.000
KRAS.LUNG.BREAST_UP.V1_UP134−0.389−1.4200.0921.000
Pathway
REACTOME_CHOLESTEROL_BIOSYNTHESIS210.6881.5450.0761.000
REACTOME_TRANSPORT_OF_VITAMINS_NUCLEOSIDES_AND_RELATED_MOLECULES310.4101.4540.0301.000
REACTOME_BIOLOGICAL_OXIDATIONS1200.3231.3140.1191.000
REACTOME_GPVI_MEDIATED_ACTIVATION_CASCADE30−0.593−1.8070.0041.000
REACTOME_GENERATION_OF_SECOND_MESSENGER_MOLECULES24−0.754−1.8010.0140.780
REACTOME_COSTIMULATION_BY_THE_CD28_FAMILY57−0.552−1.7910.0120.576
miR-26aOncogenic signature
TBK1.DN.48HRS_DN*490.6021.9200.0000.056
BCAT_BILD_ET_AL_UP440.4651.5200.0461.000
BCAT_GDS748_DN400.4111.5110.0201.000
KRAS.AMP.LUNG_UP.V1_UP128−0.434−1.5360.0351.000
KRAS.LUNG.BREAST_UP.V1_UP134−0.394−1.5260.0551.000
KRAS.600.LUNG.BREAST_UP.V1_UP260−0.376−1.5100.0480.809
Pathway
REACTOME_TRANSCRIPTIONAL_REGULATION_OF_WHITE_ADIPOCYTE_DIFFERENTIATION530.4781.7500.0061.000
REACTOME_ARMS_MEDIATED_ACTIVATION150.6501.6880.0061.000
REACTOME_PROLONGED_ERK_ACTIVATION_EVENTS170.6101.6600.0131.000
REACTOME_OLFACTORY_SIGNALING_PATHWAY78−0.636−1.8160.0120.507
REACTOME_LIGAND_GATED_ION_CHANNEL_TRANSPORT21−0.661−1.6910.0220.979
REACTOME_DEFENSINS26−0.575−1.6470.0180.946

Bold indicates significantly enriched terms with FDR less than 0.25.

Figure 5

miR-21 expression is positive correlated with cancer-associated pathways

Top ranked enriched terms of gene set enrichment analysis (GSEA) on miR-21 correlated gene expression. a. Oncogenic signature (NES = 1.935, FDR = 0.061). b. The expression levels of leading edge genes in oncogenic signature and miR-21 with descending levels from red to blue and orange to green, respectively. c. Reactome pathway database (NES = 1.921, FDR = 0.247). d. The expression levels of leading edge genes in reactome pathway database and miR-21 with descending levels from red to blue and orange to green, respectively.

Bold indicates significantly enriched terms with FDR less than 0.25.

miR-21 expression is positive correlated with cancer-associated pathways

Top ranked enriched terms of gene set enrichment analysis (GSEA) on miR-21 correlated gene expression. a. Oncogenic signature (NES = 1.935, FDR = 0.061). b. The expression levels of leading edge genes in oncogenic signature and miR-21 with descending levels from red to blue and orange to green, respectively. c. Reactome pathway database (NES = 1.921, FDR = 0.247). d. The expression levels of leading edge genes in reactome pathway database and miR-21 with descending levels from red to blue and orange to green, respectively.

DISCUSSION

The role of miRNAs in CRC has attracted attention from researchers as well from physicians. MiRNAs in CRC tissues may correlate with it disease progression and disease stages and serve as biomarkers for diagnosis, prognosis, or suitable targets for treatments. Because miRNAs can also be transported through exosome and circulate exosomally, miRNAs have become very attractive noninvasive diagnosis biomarkers for cancers [17, 18]. In the past 10 years, many data have been generated using a variety of techniques and CRC samples [8-10]. Among all the techniques, deep sequencing is the most advanced and cost effective method to deal thousands of miRNAs at the same time, the ability to detect samples in a high dynamic range, and the potential to provide digital signature of miRNAs in profiled samples. It can also simultaneously detect novel miRNAs, others smRNAs, such as tRNA-derived smRNA fragments. But, due to the technical and bioinformatics challenges, and the high cost per run, only a few publications have employed deep sequencing technology in CRC-miRNA studies [14, 16]. In the current report, we performed a pilot study using deep sequencing technology to profile smRNAs from paired frozen CRC tissue samples. We also applied sample bar-coding to reduce cost and increase efficiency. Compared with published results, we get more reads (about 5 fold) and may have produced a list of ‘true’ signature miRNAs in CR/CRC. These signature miRNAs are critical candidate miRNAs that can be used in future miRNAs-CRC studies, CRC diagnosis/prognosis, and therapy. Their high expression level supports the notion that they could be biologically significant targets. The high level expression of these miRNAs may also reduce the technique challenge for detecting them to facilitate their application in clinical setting. According to our results, miR-21, a miRNA that is up-regulated in CRC, and miR-143, a miRNA that is down-regulated in CRC, are the two miRNAs that dominated the miRNA population in CR tissue, and probably are also the most important miRNAs in CRC. The next eight miRNAs, from high to low in relative expression level, are miR-148a, -194, -192, 200b, -200c, -10b, -26a, and -145, which constituted the top 10 highly expressed miRNAs in CR/CRC. According to the changes in expression level, it seems miR-21 could act as an oncogene and miR-143 could serve a tumor suppressor role, but their exactly roles need to be defined by identifying their bona fide targets that can play direct physiology roles in CRC, and the cause of their dysregulation. The function of miR-21 has been well established because it is up-regulated in nearly all types of cancers and target tumor suppressor genes PTEN and PDCD4 [19-21]. The role of miR-143 in CRC is complicated despite many earlier reports indicating that miR-143 is a critical CRC miRNA [22-28], a recent careful study put this into question [29]. This previous study demonstrated that miR-143 and miR-145 are highly expressed in mesenchymal cells and are undetectable in colonic epithelial cells by various techniques, including the same deep sequencing platform we have used. Therefore, the role of miR-143 in CRC may be questionable. This result reminds us of a limitation of our study: the heterogeneity of tissue samples used in our study, especially the adjacent normal tissues that were resected during surgery. This may also have reflected in our clustering data (Figure S3). The current study can be further refined by detecting the list of identified candidate miRNAs by miRNA qRT-PCR using RNA samples from FFPE samples. It also needs to be noted that some miRNAs with relatively low expression level, such as miR-31, miR-9, miR-135b, have been shown to play critical roles in CRC by several previous reports [22, 27, 30-35]. It is possible that these miRNAs may not need high expression level to be physiologically relevant. Alternatively, it could be technical difficulties that have prevented the detection of these miRNAs by ligation-based deep sequencing technology. One example is miR-31, which was detected as a low abundant miRNA using deep sequencing but can be detected relatively easy by qRT-PCR in this study and another independent study [16]. CRCs origins in the lining of the bowel and can invade the muscle layers underneath, and then grow through the bowel wall, and some will eventually develop into cancer. Our results showed that some muscle-specific-miRNAs (miR-1, miR-133a, ranked #21 and 80 respectively, Table S2) were detected to be highly expressed in CRC and could be involved in the progress of CRCs. Another limitation of this study is that most of our samples are at stages 3, therefore it is impossible to draw a conclusion on how their expressions are related to CRC progression. However, by comparing the normal tissues to CRC tissues, our data support the conclusion that dysregulation of miRNA expression could contribute to diseases development. More stage 1, 2, and 4 samples will help to correlate miRNA expression to CRC development process and it can provide physicians treatment options in addition to tradition method. Our Taqman miRNA qPCR expression data in CRC cell lines validated the some CRC miRNAs identified by deep sequencing. It will provide a basis for functional studies of miRNAs in CRC in the future.

MATERIALS AND METHODS

Patient cohort and samples collection

Paired samples (CRC and adjunct normal tissues) were collected during surgery at City of Hope using IRB protocol #COH05130. The biopsies were immediate frozen in liquid nitrogen after surgery and stored at −80°C until RNA isolation. Total 48 pairs of samples (CRC and adjunct region normal tissues) were used (Table 1). CRCs were staged according to American Joint Committee on Cancer (AJCC) staging criteria. Samples are most at stage 3, with age from 50 to 80, and ratio of colon to rectal about 2:1, and female to male ratio about 3:1.

RNA isolation

Total RNA was isolated from frozen tissues or cell lines using Trizol (Life Technologies, Carlsbad, CA). RNA quality was checked and was quantified using a Nanodrop and an Agilent Bioanalyzer.

Small RNA deep sequencing

One μg of total RNA was used to construct small RNA deep sequencing libraries as described in our previous publication [11] with the following modifications. Briefly, we mixed 64 equal molar oligos (adding three nt to the 3′ end of Illumina default 5′ adaptor) to produce a mixture of small RNA library 5′ ligation adaptor. We chose eight 3′ adaptors from a panel of 3′ adaptor in smRNA TruSeq kit (Illumina, San Diego, CA) as 3′ ligation adaptor and also bar-coding samples. Each lane was loaded with eight bar-coded samples on a HiSeq2000 (Illumina, San Diego, CA) machine using all eight lanes for the first run (64 samples) and four lanes for the second run (32 samples).

Deep sequencing data analysis

Deep sequencing data analysis was performed as previously reported [11] with the following modification: 1) Reads were aligned to human genome hg19; 2) The mapping table was created using the human miRNA mature sequences from miRBase release16 and aligned back to human hg19 genome afterward [11]; 3) MiRNAs with less than 10 reads per sample in both normal and tumor samples are removed for further data analyzed. Deep sequencing data were analyzed, summarized, and plotted using R or Excel. Partek genomic suite (Partek Incorporated, St. Louis, Missouri) was used for hierarchical cluster analysis.

SmRNA qRT-PCR for tissue samples

We followed the S-Poly(T) smRNA qRT-PCR detection protocol as previously reported [36]. Briefly, 100 ng total RNA was poly-A tailed using the poly-A tailing kit from Epicentre (Madison, WI). U47 snoRNA was chosen as RNA sample control. ΔΔCt value of each paired samples was calculated as (CtTumor-miRNA-CtTumor-U47) – (CtNormal-miRNA-CtNormal-U47).

Taqman miRNA qPCR for cell lines

Taqman miRNA assay kits were purchased from Life Technologies (Grand Island, New York, USA). We followed the protocol from the manufacture. U6 snRNA was used as RNA sample control. ΔCtmiRNA values were used as their relative expression to U6 snRNA for data analysis.

Functional gene set enrichment analysis

For miRNA-targeted genes, we obtained experimentally validated miRNA-target gene pairs (474, 49, and 175 experimentally validated target genes of miR-21-5p, miR-200c-3p, and miR-26a-5p, respectively) from miRTarBase [37] and assigned them as the query set for oncogenic signatures (C6), Reactome (CP:REACTOME of C2), and Biocarta (CP:BIOCARTA of C2) in MSigDB [38]. There are 189 and 674 gene sets collection in the oncogenic, and the Reactome category, respectively. For overlaps analysis, we used hypergeometric test to estimate the probability that the number of miR targeted genes overlapped with the genes from a given collection from the number of input genes randomly selected genes in 45956 human genes. For instance, the top term of miR-21 in oncogenic signature results in a probability of 4.56E-21 to draw 32 TBK1.DF_DN-associated genes or more from 474 randomly selected genes in the list. Significance is indicated by p-value from the hypergeometric test and adjusted by false discovery rate (FDR) after correction for multiple hypothesis testing according to Benjamini and Hochberg. For gene set enrichment analysis (GSEA), gene expression dataset was downloaded from GEO (GSE29623 [39]) with corresponding miRNA expression profiles. We permutated phenotype labels 1000 times and performed GSEA on the permutated data to obtain a random ES distribution. For the GSEA test, a p-value is calculated on the original data, and the resulting enrichment score is compared to the distribution of the values obtained from the permuted data. Distinct GSEA on miR-21, miR-26a, and miR-200c expression levels were performed on gene set in oncogenic signatures and Reactome using Pearson matrix. FDR < 0.25 was used to define significant enriched gene sets as suggested by GSEA documentation [40].
  40 in total

Review 1.  Exosomes: a role for naturally occurring nanovesicles in cancer growth, diagnosis and treatment.

Authors:  Akhil Srivastava; Justyna Filant; Katherine M Moxley; Anil Sood; Scott McMeekin; Rajagopal Ramesh
Journal:  Curr Gene Ther       Date:  2015       Impact factor: 4.391

2.  The expression and clinical significance of microRNAs in colorectal cancer detecting.

Authors:  Xiaoyu Yang; Jiateng Zhong; Yinghua Ji; Jinsong Li; Yu Jian; Jinghang Zhang; Wancai Yang
Journal:  Tumour Biol       Date:  2014-12-06

Review 3.  MicroRNAs in colorectal cancer: small molecules with big functions.

Authors:  Yu Xuan; Huiliang Yang; Linjie Zhao; Wayne Bond Lau; Bonnie Lau; Ning Ren; Yuehong Hu; Tao Yi; Xia Zhao; Shengtao Zhou; Yuquan Wei
Journal:  Cancer Lett       Date:  2014-12-15       Impact factor: 8.679

4.  Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles.

Authors:  Aravind Subramanian; Pablo Tamayo; Vamsi K Mootha; Sayan Mukherjee; Benjamin L Ebert; Michael A Gillette; Amanda Paulovich; Scott L Pomeroy; Todd R Golub; Eric S Lander; Jill P Mesirov
Journal:  Proc Natl Acad Sci U S A       Date:  2005-09-30       Impact factor: 11.205

5.  Hsa-miR-31-3p expression is linked to progression-free survival in patients with KRAS wild-type metastatic colorectal cancer treated with anti-EGFR therapy.

Authors:  Gilles Manceau; Sandrine Imbeaud; Raphaële Thiébaut; François Liébaert; Karine Fontaine; Francis Rousseau; Bérengère Génin; Delphine Le Corre; Audrey Didelot; Marc Vincent; Jean-Baptiste Bachet; Benoist Chibaudel; Olivier Bouché; Bruno Landi; Frédéric Bibeau; Karen Leroy; Frédérique Penault-Llorca; Jean-Luc Van Laethem; Pieter Demetter; Sabine Tejpar; Simona Rossi; Neda Mosakhani; Pia Osterlund; Raija Ristamäki; Virinder Sarhadi; Sakari Knuutila; Valérie Boige; Thierry André; Pierre Laurent-Puig
Journal:  Clin Cancer Res       Date:  2014-04-25       Impact factor: 12.531

6.  Role of anti-oncomirs miR-143 and -145 in human colorectal tumors.

Authors:  Y Akao; Y Nakagawa; I Hirata; A Iio; T Itoh; K Kojima; R Nakashima; Y Kitade; T Naoe
Journal:  Cancer Gene Ther       Date:  2010-01-22       Impact factor: 5.987

7.  Reduced accumulation of specific microRNAs in colorectal neoplasia.

Authors:  Michael Z Michael; Susan M O' Connor; Nicholas G van Holst Pellekaan; Graeme P Young; Robert J James
Journal:  Mol Cancer Res       Date:  2003-10       Impact factor: 5.852

8.  Altered expression of miR-21, miR-31, miR-143 and miR-145 is related to clinicopathologic features of colorectal cancer.

Authors:  O Slaby; M Svoboda; P Fabian; T Smerdova; D Knoflickova; M Bednarikova; R Nenutil; R Vyzula
Journal:  Oncology       Date:  2008-01-15       Impact factor: 2.935

9.  Identification of microRNA-135b in stool as a potential noninvasive biomarker for colorectal cancer and adenoma.

Authors:  Chung Wah Wu; Siew Chien Ng; Yujuan Dong; Linwei Tian; Simon Siu Man Ng; Wing Wa Leung; Wai Tak Law; Tung On Yau; Francis Ka Leung Chan; Joseph Jao Yiu Sung; Jun Yu
Journal:  Clin Cancer Res       Date:  2014-04-01       Impact factor: 12.531

10.  High-throughput sequencing to identify miRNA biomarkers in colorectal cancer patients.

Authors:  Yan Sun; Lin Wang; Sheng-Chao Guo; Xiao-Bing Wu; Xue-Hu Xu
Journal:  Oncol Lett       Date:  2014-06-03       Impact factor: 2.967

View more
  11 in total

1.  Digital analysis and epigenetic regulation of the signature of rejection in colorectal cancer.

Authors:  Viktor H Koelzer; Lena Sokol; Stefan Zahnd; Lucine Christe; Heather Dawson; Martin D Berger; Daniel Inderbitzin; Inti Zlobec; Alessandro Lugli
Journal:  Oncoimmunology       Date:  2017-02-06       Impact factor: 8.110

Review 2.  Exosomal microRNA Biomarkers: Emerging Frontiers in Colorectal and Other Human Cancers.

Authors:  Oscar A Tovar-Camargo; Shusuke Toden; Ajay Goel
Journal:  Expert Rev Mol Diagn       Date:  2016-03-16       Impact factor: 5.225

3.  Plasma and saliva miR-21 expression in colorectal cancer patients.

Authors:  A A Sazanov; E V Kiselyova; A A Zakharenko; M N Romanov; M I Zaraysky
Journal:  J Appl Genet       Date:  2016-12-02       Impact factor: 3.240

4.  A comprehensive framework for analysis of microRNA sequencing data in metastatic colorectal cancer.

Authors:  Eirik Høye; Bastian Fromm; Paul H M Böttger; Diana Domanska; Annette Torgunrud; Christin Lund-Andersen; Torveig Weum Abrahamsen; Åsmund Avdem Fretland; Vegar J Dagenborg; Susanne Lorenz; Bjørn Edwin; Eivind Hovig; Kjersti Flatmark
Journal:  NAR Cancer       Date:  2022-01-14

Review 5.  High throughput RNA sequencing utility for diagnosis and prognosis in colon diseases.

Authors:  Mamie Gao; Allen Zhong; Neil Patel; Chiraag Alur; Dinesh Vyas
Journal:  World J Gastroenterol       Date:  2017-04-28       Impact factor: 5.742

6.  SmithRNAs: Could Mitochondria "Bend" Nuclear Regulation?

Authors:  Andrea Pozzi; Federico Plazzi; Liliana Milani; Fabrizio Ghiselli; Marco Passamonti
Journal:  Mol Biol Evol       Date:  2017-08-01       Impact factor: 16.240

Review 7.  Epigenetics of colorectal cancer: biomarker and therapeutic potential.

Authors:  Gerhard Jung; Eva Hernández-Illán; Leticia Moreira; Francesc Balaguer; Ajay Goel
Journal:  Nat Rev Gastroenterol Hepatol       Date:  2020-01-03       Impact factor: 46.802

8.  Differences in silencing of mismatched targets by sliced versus diced siRNAs.

Authors:  Guihua Sun; Jinghan Wang; Yasheng Huang; Christine Wan-Yin Yuan; Keqiang Zhang; Shuya Hu; Linling Chen; Ren-Jang Lin; Yun Yen; Arthur D Riggs
Journal:  Nucleic Acids Res       Date:  2018-07-27       Impact factor: 16.971

9.  Extensive screening of microRNA populations identifies hsa-miR-375 and hsa-miR-133a-3p as selective markers for human rectal and colon cancer.

Authors:  David Weber; Laurence Amar; Daniel Gödde; Christian Prinz
Journal:  Oncotarget       Date:  2018-06-05

10.  In Vitro and In Silico Mechanistic Insights into miR-21-5p-Mediated Topoisomerase Drug Resistance in Human Colorectal Cancer Cells.

Authors:  Jung-Chien Chen; Yao-Yu Hsieh; Hsiang-Ling Lo; Albert Li; Chia-Jung Chou; Pei-Ming Yang
Journal:  Biomolecules       Date:  2019-09-09
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.