Literature DB >> 28690395

An Assessment of Database-Validated microRNA Target Genes in Normal Colonic Mucosa: Implications for Pathway Analysis.

Martha L Slattery1, Jennifer S Herrick1, John R Stevens2, Roger K Wolff1, Lila E Mullany1.   

Abstract

BACKGROUND: Determination of functional pathways regulated by microRNAs (miRNAs), while an essential step in developing therapeutics, is challenging. Some miRNAs have been studied extensively; others have limited information. In this study, we focus on 254 miRNAs previously identified as being associated with colorectal cancer and their database-identified validated target genes.
METHODS: We use RNA-Seq data to evaluate messenger RNA (mRNA) expression for 157 subjects who also had miRNA expression data. In the replication phase of the study, we replicated associations between 254 miRNAs associated with colorectal cancer and mRNA expression of database-identified target genes in normal colonic mucosa. In the discovery phase of the study, we evaluated expression of 18 miR-NAs (those with 20 or fewer database-identified target genes along with miR-21-5p, miR-215-5p, and miR-124-3p which have more than 500 database-identified target genes) with expression of 17 434 mRNAs to identify new targets in colon tissue. Seed region matches between miRNA and newly identified targeted mRNA were used to help determine direct miRNA-mRNA associations.
RESULTS: From the replication of the 121 miRNAs that had at least 1 database-identified target gene using mRNA expression methods, 97.9% were expressed in normal colonic mucosa. Of the 8622 target miRNA-mRNA associations identified in the database, 2658 (30.2%) were associated with gene expression in normal colonic mucosa after adjusting for multiple comparisons. Of the 133 miRNAs with database-identified target genes by non-mRNA expression methods, 97.2% were expressed in normal colonic mucosa. After adjustment for multiple comparisons, 2416 miRNA-mRNA associations remained significant (19.8%). Results from the discovery phase based on detailed examination of 18 miRNAs identified more than 80 000 miRNA-mRNA associations that had not previously linked to the miRNA. Of these miRNA-mRNA associations, 15.6% and 14.8% had seed matches for CRCh38 and CRCh37, respectively.
CONCLUSIONS: Our data suggest that miRNA target gene databases are incomplete; pathways derived from these databases have similar deficiencies. Although we know a lot about several miRNAs, little is known about other miRNAs in terms of their targeted genes. We encourage others to use their data to continue to further identify and validate miRNA-targeted genes.

Entities:  

Keywords:  Colon cancer; RNA-Seq; bias; functionality; miRNA

Year:  2017        PMID: 28690395      PMCID: PMC5484592          DOI: 10.1177/1176935117716405

Source DB:  PubMed          Journal:  Cancer Inform        ISSN: 1176-9351


Background

MicroRNAs (miRNAs) regulate gene expression by either repressing translation of messenger RNAs (mRNAs) or by inducing mRNA degradation by complementarity binding of target sequences.1 Given their role in gene expression, it is not surprising that miRNAs have been shown to be associated with many physiological processes and have been linked to numerous diseases.2–10 As such, miRNAs have the potential of being valuable biomarkers for both early disease detection and prognosis as well as serving as targets for therapeutic purposes. An understanding of the functionality of miRNAs and how they relate to various biological pathways is essential to move the field of miRNA research from associations with diseases and conditions to therapeutic and interventional tools. Determination of functional pathways regulated by miRNAs, while an essential step in developing therapeutics centering on miRNA expression, is challenging. Functional pathway analysis is largely dependent on databases such as miRTarBase which have identified miRNA target genes that have been validated by a variety of methods.11 MicroRNA target interactions (MTIs) incorporated into the database have varying degrees of supporting evidence. The minority of MTIs incorporated into the database have strong evidence and include reporter assays and Western blot methods. Most of the MTIs in the database come from what is considered less stringent methods of target gene identification such as microarray and next-generation sequencing methods, one of which is cross-linking and immunoprecipitation sequencing or cross-linking immunoprecipitation (CLIP)-Seq. Although quantitative polymerase chain reaction (qPCR), microarray, RNA-Seq, and Northern blot methods detect MTI associations through gene expression, other experiments such as reporter assay and Western blot studies measure expression levels of proteins. Comparisons of miRNA expression and mRNA expression patterns have been used as methods to identify target genes.11–13 Although the study of miRNAs as they relate to disease is a rapidly expanding field of research, databases that contain information on miRNA-validated targeted genes are dependent on what is known in the literature. It is not surprising that some miRNAs have been studied extensively, whereas others have limited information. The extent to which information bias regarding miRNA-targeted genes influences our ability to accurately infer functional pathways associated with miRNAs is unexplored. In this study, we examine 254 miRNAs that we have previously identified as being associated with colorectal cancer, colorectal cancer survival, or the microsatellite instability (MSI) tumor phenotype2,3,14,15 with mRNA expression. We compare expression levels between mRNA for previously identified target genes and the miRNA expression level. We analyze separately associations for those target genes previously identified by mRNA expression methods versus those identified by other methods. For a smaller subset of miRNAs with 20 or fewer reported target genes and for 3 miRNAs with hundreds of database-identified target genes, we compare miRNA expression level with all mRNA expression of protein-coding genes in colon tissue. We further compare seed region for those newly discovered mRNAs associated with miRNAs to help identify target genes that may be directly influenced by miRNAs. Figure 1 depicts the study flow.
Figure 1

Study flow.

Methods

Study participants were part of a colon cancer case-control study that included incident first primary adenocarcinoma of the colon who were diagnosed between 30 and 79 years of age and resided in Utah or were members of the Kaiser Permanente Medical Care Program (KPMCP) in Northern California. Participants were non-Hispanic white, Hispanic, or African American.16 Local Surveillance, Epidemiology, and End Results tumor registries verified all cases that were diagnosed between October 1991 and September 1994. Detailed study methods have been described.3 Participants signed informed consent prior to release of confidential data. The Institutional Review Boards of the University of Utah and the KPMCP approved the study.

RNA Processing

Formalin-fixed paraffin-embedded tissue was used to extract RNA. Normal mucosae adjacent to the carcinoma tissue and matched carcinoma tissue were used to make RNA. Total RNA was extracted, isolated, and purified using the RecoverAll Total Nucleic Acid Isolation Kit (Ambion, Austin, Texas); RNA yields were determined using a NanoDrop spectrophotometer.

MicroRNA

The Agilent Human miRNA Microarray V19.0 was used. The microarray contains probes for 2006 unique human miRNAs as described previously. Data were required to pass stringent quality control (QC) parameters established by Agilent that included tests for excessive background fluorescence, excessive variation among probe sequence replicates on the array, and measures of the total gene signal on the array to assess low signal. If samples failed to meet quality standards for any of these parameters, the sample was relabeled, hybridized to arrays, and rescanned. If a sample failed QC assessment a second time, the sample was deemed to be of poor quality and was excluded from analysis. Our previous analysis has shown that the repeatability associated with this microarray was extremely high (r = 0.98),3 and that comparison of miRNA expression levels obtained from the Agilent microarray with those obtained from qPCR had an agreement of 100% in terms of directionality of findings and that the fold change calculated for the miRNA expression difference between carcinoma and normal colonic mucosa was almost identical.2 Of the 2006 unique human miRNAs assessed on the Agilent microarray, 1226 were expressed in colon carcinoma tissue and 1179 in normal colon mucosa. To normalize differences in miRNA expression that could be attributed to the array, amount of RNA, location on array, or factors that could erroneously influence miRNA expression levels, total gene signal was normalized by multiplying each sample by a scaling factor,17 which was the median of the 75th percentiles of all the samples divided by the individual 75th percentile of each sample.

RNA-Seq Sequencing Library Preparation and Data Processing

Total RNA was run on 197 carcinoma and normal mucosa pairs; 157 of these passed QC. These samples were taken from the study subjects used for miRNA analysis and were extracted, isolated, and purified in the same manner as previously described.18 RNA library construction was done with the Illumina TruSeq Stranded Total RNA Sample Preparation Kit with Ribo-Zero. The samples were then fragmented and primed for complementary DNA (cDNA) synthesis, adapters were then ligated onto the cDNA, and the resulting samples were then amplified using polymerase chain reaction (PCR); the amplified library was then purified using Agencourt AMPure XP beads. A more detailed description of the methods can be found in our previous work.19 Illumina TruSeq v3 single read flow cell and a 50-cycle single-read sequence run were performed on an Illumina HiSeq instrument. Reads were aligned to a sequence database containing the human genome (build GRCh37/hg19, February 2009 from genome.ucsc.edu). Python and a pysam module were used to calculate counts for each exon and untranslated region (UTR) of the genes using a list of gene coordinates obtained from http://genome.ucsc.edu. Total gene counts were determined. We dropped features that were not expressed in our data or for which the expression was missing for most of the samples.19

Bioinformatics Analysis

Our assessment of miRNAs focused on 254 miRNAs that we have previously reported as being associated with either differences in tumor and normal mucosa expression with a fold change of at least 1.5, survival, or with MSI tumor phenotype.2,3,14 We identified experimentally validated target gene these miRNAs using miRNA-mRNA pairs from miRTarBase v6.0.11 As our miRNA names are those from Agilent v19 (corresponding to miRBase v19) and miRTar-Base v6.0 includes newer associations, we determined the new nomenclature for any miRNAs whose names did not match using archived miRBase (http://www.mirbase.org)20 “miRNA.diff.zip” files. To determine associations between miRNA-mRNA pairs in colon tissue and to help estimate the extent of completeness of existing databases, we conducted both a replication and discovery analysis. For the replication component of the study, we determined how many and which target genes were identified using gene expression methods, namely, “microarray,” RNA-Seq, “qRT-PCR,” and “Northern blot” experiments (we refer to gene expression methods as “mRNA-methods”) in miRTarBase and whether the miRNA-mRNA pairs could be replicated in normal colon tissue. Target genes for miRNAs identified by methods other than gene expression (which we refer to as “non-mRNA methods”) were analyzed separately to determine how many of these could be identified by RNA-Seq, an mRNA expression method. Our hypothesis is that those target genes identified by mRNA methods should correlate more positively to our comparison of miRNA with target genes using RNA-Seq data. For the discovery component of the study, we focused on miRNAs that had 20 or fewer validated targets in miRTarBase as well as 3 commonly validated miRNAs (hsa-miR-21-5p, hsa-miR-124-3p, and hsa-miR-215). We analyzed these miR-NAs with mRNA expression generated by RNA-Seq in our data set excluding those validated target genes that had previously been identified in miRTarBase (see section “Statistical Methods”). We further analyzed miRNAs and targeted mRNAs for seed region matches, and we analyzed the mRNA 3′ UTR FASTA as well as the seed region sequence of the associated miRNA to determine seed region pairings between miRNA and mRNA. MicroRNA seed regions were calculated as described in our previous work,21 and we calculated and included seeds of 6, 7, and 8 nucleotides in length. Our hypothesis is that a seed match would increase the likelihood that identified genes associated with a specific miRNA were more likely to have a direct association given a higher propensity for binding. As miRTarBase uses findings from many different investigations spanning across years and alignments, we used FASTA sequences generated from both GRCh37 and GRCh38 Homo sapiens alignments, using UCSC Table Browser (https://genome.ucsc.edu/cgi-bin/hgTables).22 We downloaded FASTA sequences that matched our Ensembl IDs and had a consensus coding sequences available. Analysis was done using scripts in R 3.2.3 and in perl 5.018002.

Statistical Methods

Our final analysis consisted of 157 subjects with high-quality mRNA expression data and high-quality miRNA data. After excluding from the analysis any non–protein-coding mRNAs and those with fewer than 0.5 reads on average across all samples, we were able to examine 17 434 mRNAs that had unique Ensembl IDs. We used the log base 2–transformed RPKM (reads per kilobase per million) normal colonic mucosa mRNA expression data. We examined the association between the mRNAs and the candidate miRNAs by fitting a linear model and adjusting for age at diagnosis, study center, and sex. P values were generated using the bootstrap method by creating a distribution of 10 000 F statistics derived by resampling the residuals from the null hypothesis model of no association between the mRNAs and miRNAs using the boot package in R. Associations were considered significant if the false discovery rate (FDR)– adjusted P values were less than .05.23

Results

Most of the study population was men, and the average age of study participants was 65.1 years (Table 1). The major molecular tumor phenotypes for the colon cancer study participants were 18.5% MSI, 42.7% with a mutated TP53 gene, 28% with a mutation in KRAS gene, and 27.4% with a CpG island methylator phenotype high tumor. The distribution of targeted genes per miRNA shows that 117 of the 254 miRNAs we were able to evaluate had fewer than 100 target genes, 60 miRNAs had 100 to 200 identified target genes, 51 miRNAs had between 200 and 500 targeted genes, and 26 miRNAs had more than 500 database-identified target genes (Figure 2).
Table 1

Description of the study population.

NO. (%)
AgeMean (SD)65.1 (10.2)
SexMale88 (56.1)
Female69 (44.0)
AJCC stage136 (23.1)
251 (32.7)
350 (32.1)
419 (12.2)
Tumor instabilityMSS128 (81.5)
MSI29 (18.5)
TP53Nonmutated90 (57.3)
Mutated67 (42.7)
KRASNonmutated113 (72.0)
Mutated44 (28.0)
CIMPLow114 (72.6)
High43 (27.4)

Abbreviations: AJCC, American Joint Committee on Cancer; CIMP, CpG island methylator phenotype; MSI, microsatellite instability; MSS, microsatellite stable.

Figure 2

Distribution of database-validated target genes by individual microRNAs (miRNAs).

Replication phase

Of the 121 miRNAs that had at least 1 targeted gene identified by mRNA gene expression methods, 97.9% were expressed in normal colonic mucosa (Table 2 shows 50 of the 121 miR-NAs; Supplemental Table 1 shows all 121 miRNAs). Of the 8622 target miRNA-mRNA associations identified in the database, 2658 (30.2%) were associated with gene expression in normal colonic mucosa after adjusting for multiple comparisons; prior to adjustment for multiple comparisons, 42.9% of database-identified associations were detected. For these miRNAs that had at least 1 targeted gene identified by a gene expression method, 31 399 targeted genes were database validated by non-mRNA methods; of these, 98.3% were expressed in normal colonic mucosa. Of the database-identified target genes identified by non-mRNA expression methods, 48.0% were associated with the targeted gene in our data prior to adjustment for multiple comparisons and 37.6% were significant when an FDR of 0.05 was applied.
Table 2

Comparison of miRNA-validated targeted genes with mRNA expression to colon RNA-Seq data.

MIRNAMRNA DATABASE–IDENTIFIED TARGET GENESNON-MRNA DATABASE–IDENTIFIED TARGET GENES
TOTAL, NEXPRESSED IN COLON TISSUE, NVALIDATED BY RNA-SEQ PUNADJ < .05VALIDATED BY RNA-SEQ FDR < 0.05, NTOTAL, NEXPRESSED IN COLON TISSUE, NVALIDATED BY RNA-SEQ PUNADJ < .05, NVALIDATED BY RNA-SEQ FDR < 0.05, N
Total targets880886223782265831 39930 85515 08011 795
hsa-let-7a-5p37361710528522341257
hsa-let-7e-5p12124151951116587
hsa-let-7f-5p2020119325318208173
hsa-let-7g-5p1717118274269198163
hsa-let-7i-5p5433278273242226
hsa-miR-1-3p419411389424421287
hsa-miR-10a-5p111165408400229179
hsa-miR-15a-5p7978371457356617956
hsa-miR-16-5p10810896891339132011591077
hsa-miR-17-5p4040272210501034772661
hsa-miR-19b-3p1212316586509619
hsa-miR-20a-5p33332316953939674537
hsa-miR-20b-5p111165838824617488
hsa-miR-21-3p333262613835
hsa-miR-21-5p4384343653191201189379
hsa-miR-23a-3p23211613184181146122
hsa-miR-23b-3p131253282278155102
hsa-miR-24-3p259255217194509492415397
hsa-miR-25-3p16161212401394337312
hsa-miR-26a-5p26251917376374298261
hsa-miR-26b-5p14651383267912672666122
hsa-miR-27a-3p38373023336328250202
hsa-miR-27b-3p2524116345336185135
hsa-miR-28-5p220069685650
hsa-miR-29a-3p6464474014614010797
hsa-miR-29b-3p686824121501446850
hsa-miR-29c-3p40401061721675334
hsa-miR-30a-5p1515116586535515
hsa-miR-30b-5p16165136836611453
hsa-miR-30c-5p2524434544512311
hsa-miR-30d-5p5532337335275243
hsa-miR-30e-5p7700314313328
hsa-miR-31-5p37361013212720
hsa-miR-34a-5p67635043511491452425
hsa-miR-92a-3p343431301222120511441124
hsa-miR-92b-3p22006035936538
hsa-miR-93-5p2828131211141099868753
hsa-miR-98-5p47046122511123523010356
hsa-miR-99a-5p1010311151144019
hsa-miR-99b-5p33214242177
hsa-miR-100-5p23238820920711565
hsa-miR-101-3p23230030029410
hsa-miR-106b-5p797514193492020558
hsa-miR-124-3p12321209105891812311610593
hsa-miR-125b-5p70705038305303250218
hsa-miR-127-3p7732131354
hsa-miR-129-5p1212119353344291267
hsa-miR-130a-3p202011933432515783
hsa-miR-130b-3p990050749751

Abbreviations: FDR, false discovery rate; mRNA, messenger RNA; miRNA, microRNA.

Evaluation of the 133 miRNAs with database-identified target genes only by non-mRNA expression methods showed that 11 850 of the 12 191 target genes were expressed in normal colonic mucosa (97.2%) (Table 3 shows 50 of the miRNAs; Supplemental Table 2 shows all 133 miRNAs). Of those expressed in normal colonic mucosa, 3770 (30.9%) miRNA-mRNA were associated in normal colonic mucosa using RNA-Seq data prior to adjustment for multiple comparisons. After adjustment for multiple comparisons, 2416 miRNA-mRNA associations remained significant (19.8%); this compares with 30.2% of miRNA:mRNA associations being detected when the original detection was a gene expression method.
Table 3

Comparison of miRNA targets for 50 miRNAs with non-mRNA expression methods was used to RNA-Seq in colon tissue.

MIRNATOTAL, NEXPRESSED IN COLON TISSUE, NVALIDATED BY RNA-SEQ PUNADJ < .05, NVALIDATED BY RNA-SEQ DR < 0.05, N
Total target genes12 19111 85037702416
hsa-miR-30c-1-3p421410206
hsa-miR-139-3p706610
hsa-miR-145-3p454454
hsa-miR-151a-3p73714640
hsa-miR-151b242462
hsa-miR-192-3p121117135
hsa-miR-204-3p949010
hsa-miR-324-5p27627212145
hsa-miR-361-3p1211188173
hsa-miR-378b272600
hsa-miR-378d272610
hsa-miR-378g575731
hsa-miR-378i272610
hsa-miR-425-3p38362216
hsa-miR-455-3p42641662
hsa-miR-466157155179
hsa-miR-501-3p625843
hsa-miR-513c-3p17216912480
hsa-miR-532-3p200196124
hsa-miR-550a-3-5p8177124
hsa-miR-550b-2-5p868200
hsa-miR-583133129168
hsa-miR-62321220110
hsa-miR-652-3p1301299561
hsa-miR-654-5p89847473
hsa-miR-659-5p23231713
hsa-miR-662161600
hsa-miR-664a-3p847900
hsa-miR-664a-5p108105136
hsa-miR-664b-3p1381359970
hsa-miR-664b-5p131386
hsa-miR-769-3p145140129119
hsa-miR-877-5p210207158118
hsa-miR-892b545472
hsa-miR-934171711
hsa-miR-939-5p125120276
hsa-miR-1183777510
hsa-miR-1203242330
hsa-miR-1207-3p67632510
hsa-miR-1225-5p32302414
hsa-miR-1228-5p262690
hsa-miR-1229-5p4240187
hsa-miR-1233-5p12312242
hsa-miR-1271-5p676640
hsa-miR-1288-3p2525106
hsa-miR-13051761738457
hsa-miR-191311211000
hsa-miR-19735543
hsa-miR-21175048205
hsa-miR-2392106104195

Abbreviations: FDR, false discovery rate; mRNA, messenger RNA; miRNA, microRNA.

Discovery phase

Examination of those miRNAs that had fewer than 20 database-identified target genes along with miR-21-5p, miR-215-5p, and miR-124-3p which all have more than 500 database-identified target genes with all genes expressed in normal colonic mucosa gave us an indication as to how the databases compared at both ends of the mRNA target gene spectrum. We showed a large number of miRNA-mRNA associations for that had not previously linked to the miRNA (Table 4). Of the more than 80 000 miRNA-mRNA associations we detected using RNA-Seq data, 15.6% and 14.8% had seed matches for CRCh38 and CRCh37, respectively, supporting the hypothesis that seed matches would increase the likelihood of a direct association given the higher propensity for binding.
Table 4

miRNA-mRNA associations in colon tissue using RNA-Seq data and matching seed region.

MIRNADATABASE-VALIDATED TARGET GENES, NVALIDATED TARGET GENES EXPRESSING IN COLON TISSUE, NDATA BASE-VALIDATED TARGET GENES BY MRNA EXPRESSION METHODS, NMIRNA-MRNA ASSOCIATIONS IDENTIFIED IN COLON TISSUE (PUNADJ < .05), NMIRNA-MRNA ASSOCIATIONS IDENTIFIED IN COLON TISSUE (FDR < 0.05), NNEW MIRNA-MRNA ASSOCIATIONS IDENTIFIED (FDR < 0.01) USING CRCH 38 TO IDENTIFY SEED MATCH, NNEW MIRNA-MRNA ASSOCIATIONS IDENTIFIED (FDR < 0.01) USING CRCH37 TO IDENTIFY SEED MATCH
hsa-miR-3677-3p44013 36313 0591039966
hsa-miR-606844098056834486450
hsa-miR-197355069181891231216
hsa-miR-318154011 98311 268765688
hsa-miR-4315141407280N/AN/A
hsa-miR-664b-5p1313011 80310 47218761757
hsa-miR-47301515087964884564527
hsa-miR-36211616010 1007869657589
hsa-miR-662161604610N/AN/A
hsa-miR-6717-5p15150598213005856
hsa-miR-4681171701230N/AN/A
hsa-miR-5721717161780N/AN/A
hsa-miR-934171707850N/AN/A
hsa-miR-127-3p2020769088444136
hsa-miR-4787-5p2020010 883961611421068
hsa-miR-21-5p5585524348715664618511754
hsa-miR-215-5p71370767026640N/AN/A
hsa-miR-124-3p13551325120913 37012 86049614815

Abbreviations: FDR, false discovery rate; mRNA, messenger RNA; miRNA, microRNA.

Discussion

Evaluation of 254 miRNAs previously associated with colon cancer, MSI tumor phenotype, or with survival after diagnosis with colorectal cancer showed a great deal of variability in the number of targeted genes for each miRNA in miRTarBase. A major finding is the documentation of the incompleteness of the target genes for many miRNAs. Although our study was limited to colon cancer and could be considered a partial database with respect to miRTarBase, we identified numerous genes whose expression was associated with miRNA expression in colon tissue that were not identified in miRTarBase by similar gene expression methods. We believe that this variability is indicative of the extent to which miRNAs have been studied than actual differences in the number of targeted genes by specific miRNAs. However, the implications for determining functionality and pathways associated with genes targeted by miRNAs resulting from this discrepancy are many; pathways are driven by those miRNAs which have been researched the most. Second, although our partial database is restricted to only colon tissue and gene expression, it points out other limitations. First, our inability to identify target genes incorporated in the database from similar gene expression studies suggests that tissue specificity is important in determine disease-specific pathways. Pathways that encumber non–site-specific genes could obviously be irrelevant for the disease and tissue of interest. In short, although miRNA-mRNA associations may exist in some tissues, they may not be relevant in other tissue types. Our data suggest that 2% to 3% of targeted genes are not expressed in colon tissue and of those miRNA-mRNA associations reported in databases that are expressed in colon tissue, and less than 50% of targeted genes previously identified with mRNA validation methods could be validated with RNA-Seq data in our colon cancer samples. Lack of specificity can limit the accuracy of projected pathways generated by existing databases. The variability in the number of targeted genes identified in the existing database with the 254 miRNAs that we evaluated is considerable (Figure 2). Although some miRNAs such as miR-21-5p had more than 500 previously identified targeted genes, 15 miRNAs had fewer than 20 targeted genes identified. The effect of this disparity is evident in current pathway tools. For instance, when hsa-miR-21-5p, which had 558 validated targets, is entered into miRPath24 (http://snf-515788.vm.okeanos.grnet.gr), an updated miRNA pathway tool, 34 pathways associated with an FDR correction applied and when using TarBase-validated target genes. When hsa-miR-3677-3p, which had 4 validated targets, is entered using the same parameters, 1 significant pathway is returned. Development of pathways associated with dysregulated miRNAs is thus heavily influenced by those miRNAs that have many genes previously identified and is only minimally represented by many miRNAs that have been examined in less detail and are shown to be associated with diseases. Pathways used to determine functionality of miRNAs or to identify therapeutic targets are incomplete and are dominated by a subset of the total miRNAs associated. Tissue specificity of miRNA expression, while a concern, appears to have a less impact in terms of pathway identification because 97% to 98% of mRNAs were expressed in our targeted colon tissue. However, a greater concern is that although miR-NAs and mRNAs may be expressed in the tissue, they may not have the same regulatory impact. We saw that less than half of the mRNAs previously identified by gene expression methods were actually associated with their identified target in our data. This lack of reproducibility could be from tissue specificity. However, other reasons for the lack of association could also exist. Because we were looking at gene expression from RNA-Seq, we adjusted for more comparisons than perhaps some of the previously reported studies. It should be kept in mind that the microarray and next-generation sequencing methods are considered less reliable and our lack of confirmation of some of these associations could stem from the level of confidence in data such as these. Validation of miRNA target genes includes methods that have varying degrees of their strength of evidence as well as what they are validating. Western blot and reporter assays that detect protein levels along with qPCR which measures mRNA expression levels have been considered stronger evidence than microarrays or next-generation sequencing techniques such as RNA-Seq or CLIP-Seq.11 Methods that can validate protein expression and are target-specific miRNAs to determine whether changing miRNA alters mRNA expression level or protein expression are ideal.25–27 However, these techniques do not lend themselves to more widespread exploration of targeted genes. Microarrays and RNA-Seq, both methods that can more broadly curate gene expression, detect mRNA expression levels and do not evaluate protein expression. Because miRNAs function through posttranscriptional regulation to effect protein expression, these techniques could fail to identify associations with target genes that might exist. MicroRNAs can also affect mRNA expression through degradation by partial complementarity binding of target sequences. Target gene validation techniques, such as RNA-Seq, would detect variation in mRNA that could be correlated with miRNA expression. It has been stated that RNA-Seq provides “an alternative to microarray gene expression analysis allowing a deeper analysis to provide a larger list of inferring miRNA targets in comparable over-expression studies.”28 Although we have shown that some miRNAs are associated with thousands of mRNAs, we have attempted to identify mRNAs that are more directly associated with the miRNA of interest. It is most likely that many mRNAs are not directly associated with the miRNA of interest, but rather dysregulation is seen secondary to other directly targeted genes.27 To identify direction interactions, we used seed matches between the 5′ region of the mature miRNA from nucleotides 2 to 8, or the seed region, and the 3′ UTR of the mRNA.21 A seed region match suggests a more functional binding between the miRNA and the targeted gene. It also has been suggested by Thomson et al28 that seed matches can provide a mechanism of enriching for direct miRNA targets over indirect or secondary effects. Evaluation of seed region matches between significant miRNA-mRNA associations showed that there were a considerable number of previously unreported target genes (Table 4). This would suggest that many of the associations are indirectly related, but that the databases are incomplete for most miR-NAs when it comes to identifying targeted genes and subsequent pathways constructed from those genes. Another important consideration when interpreting miRNA and their targeted pathways is the complexity of miR-NAs and their associations with gene expression. MicroRNAs regulate hundreds if not thousands of genes, and individual genes are regulated by many miRNAs. Furthermore, it is unlikely that miRNA-mRNA associations apply to all tissues. The lack of specificity in these associations adds complexity to constructing pathways because it is difficult to know which miRNA-mRNA–targeted pairs are most relevant for the condition being studied. The study has several strengths. First is our depth of data that includes individuals with both miRNA and RNA-Seq data. This allows us to attempt colon cancer–specific functionality assessment. In addition, using RNA-Seq data to evaluate mRNA expression, we are able to consider many more genes expressed in colon tissue than other methods that are more labor-intensive and tissue-intensive. This allows us to undertake a broader discovery of new miRNA-mRNA associations. However, there are limitations in that associations were detected by non-mRNA methods that may imply translational miRNA mechanisms that would not be detected when looking at gene expression. Availability of protein data would enhance our analysis but is unavailable. We adjusted for multiple comparisons and through this adjustment could have missed previously identified associations. Although we have applied rigid QC and compared subsets of both our miRNA and mRNA data to qPCR with good results,2,29 there is potential for inaccuracies in these techniques. There are several other considerations when constructing functional pathways associated with miRNAs. Our data suggest that miRNA-targeted gene databases are incomplete; pathways derived from these databases have similar deficiencies. Although we know a lot about several miRNAs, we know very little about other miRNAs in terms of what genes they target. It appears that for most miRNAs, the information is incomplete in terms of validated targeted genes and that tissue-specific associations exist.

Conclusions

Existing databases of miRNA-targeted genes have limitations both in terms of coverage for specific miRNAs and tissue-specific miRNA-mRNA associations. We encourage others to use their data to continue to further identify and validate miRNA-targeted genes to improve the likelihood that research conducted on miR-NAs will help translate to improve medical care.
  27 in total

Review 1.  Experimental validation of miRNA targets.

Authors:  Donald E Kuhn; Mickey M Martin; David S Feldman; Alvin V Terry; Gerard J Nuovo; Terry S Elton
Journal:  Methods       Date:  2008-01       Impact factor: 3.608

2.  Energy balance and colon cancer--beyond physical activity.

Authors:  M L Slattery; J Potter; B Caan; S Edwards; A Coates; K N Ma; T D Berry
Journal:  Cancer Res       Date:  1997-01-01       Impact factor: 12.701

Review 3.  miRNA and vascular cell movement.

Authors:  Junming Yue
Journal:  Adv Drug Deliv Rev       Date:  2011-01-15       Impact factor: 15.470

4.  Expression Profiles of miRNA Subsets Distinguish Human Colorectal Carcinoma and Normal Colonic Mucosa.

Authors:  Daniel F Pellatt; John R Stevens; Roger K Wolff; Lila E Mullany; Jennifer S Herrick; Wade Samowitz; Martha L Slattery
Journal:  Clin Transl Gastroenterol       Date:  2016-03-10       Impact factor: 4.488

5.  miR-221 affects multiple cancer pathways by modulating the level of hundreds messenger RNAs.

Authors:  Laura Lupini; Cristian Bassi; Manuela Ferracin; Nenad Bartonicek; Lucilla D'Abundo; Barbara Zagatti; Elisa Callegari; Gentian Musa; Farzaneh Moshiri; Laura Gramantieri; Fernando J Corrales; Anton J Enright; Silvia Sabbioni; Massimo Negrini
Journal:  Front Genet       Date:  2013-04-25       Impact factor: 4.599

6.  Differential Gene Expression in Colon Tissue Associated With Diet, Lifestyle, and Related Oxidative Stress.

Authors:  Martha L Slattery; Daniel F Pellatt; Lila E Mullany; Roger K Wolff
Journal:  PLoS One       Date:  2015-07-31       Impact factor: 3.240

7.  miRBase: annotating high confidence microRNAs using deep sequencing data.

Authors:  Ana Kozomara; Sam Griffiths-Jones
Journal:  Nucleic Acids Res       Date:  2013-11-25       Impact factor: 16.971

8.  MicroRNA Target Identification-Experimental Approaches.

Authors:  Aida Martinez-Sanchez; Chris L Murphy
Journal:  Biology (Basel)       Date:  2013-01-25

9.  MicroRNA profiles in colorectal carcinomas, adenomas and normal colonic mucosa: variations in miRNA expression and disease progression.

Authors:  Martha L Slattery; Jennifer S Herrick; Daniel F Pellatt; John R Stevens; Lila E Mullany; Erica Wolff; Michael D Hoffman; Wade S Samowitz; Roger K Wolff
Journal:  Carcinogenesis       Date:  2016-01-06       Impact factor: 4.944

10.  Site-specific associations between miRNA expression and survival in colorectal cancer cases.

Authors:  Martha L Slattery; Jennifer S Herrick; Daniel F Pellatt; Lila E Mullany; John R Stevens; Erica Wolff; Michael D Hoffman; Roger K Wolff; Wade Samowitz
Journal:  Oncotarget       Date:  2016-09-13
View more
  8 in total

1.  The p53-signaling pathway and colorectal cancer: Interactions between downstream p53 target genes and miRNAs.

Authors:  Martha L Slattery; Lila E Mullany; Roger K Wolff; Lori C Sakoda; Wade S Samowitz; Jennifer S Herrick
Journal:  Genomics       Date:  2018-06-01       Impact factor: 5.736

2.  The NF-κB signalling pathway in colorectal cancer: associations between dysregulated gene and miRNA expression.

Authors:  Martha L Slattery; Lila E Mullany; Lori Sakoda; Wade S Samowitz; Roger K Wolff; John R Stevens; Jennifer S Herrick
Journal:  J Cancer Res Clin Oncol       Date:  2017-11-29       Impact factor: 4.553

Review 3.  Precision Medicine for CRC Patients in the Veteran Population: State-of-the-Art, Challenges and Research Directions.

Authors:  Shyam S Mohapatra; Surinder K Batra; Srinivas Bharadwaj; Michael Bouvet; Bard Cosman; Ajay Goel; Wilma Jogunoori; Michael J Kelley; Lopa Mishra; Bibhuti Mishra; Subhra Mohapatra; Bhaumik Patel; Joseph R Pisegna; Jean-Pierre Raufman; Shuyun Rao; Hemant Roy; Maren Scheuner; Satish Singh; Gitanjali Vidyarthi; Jon White
Journal:  Dig Dis Sci       Date:  2018-03-23       Impact factor: 3.199

4.  Dysregulated genes and miRNAs in the apoptosis pathway in colorectal cancer patients.

Authors:  Martha L Slattery; Lila E Mullany; Lori C Sakoda; Roger K Wolff; Wade S Samowitz; Jennifer S Herrick
Journal:  Apoptosis       Date:  2018-04       Impact factor: 4.677

5.  Expression of Wnt-signaling pathway genes and their associations with miRNAs in colorectal cancer.

Authors:  Martha L Slattery; Lila E Mullany; Lori C Sakoda; Wade S Samowitz; Roger K Wolff; John R Stevens; Jennifer S Herrick
Journal:  Oncotarget       Date:  2017-12-23

6.  The TGFβ-signaling pathway and colorectal cancer: associations between dysregulated genes and miRNAs.

Authors:  Andrew J Pellatt; Lila E Mullany; Jennifer S Herrick; Lori C Sakoda; Roger K Wolff; Wade S Samowitz; Martha L Slattery
Journal:  J Transl Med       Date:  2018-07-09       Impact factor: 5.531

7.  The PI3K/AKT signaling pathway: Associations of miRNAs with dysregulated gene expression in colorectal cancer.

Authors:  Martha L Slattery; Lila E Mullany; Lori C Sakoda; Roger K Wolff; John R Stevens; Wade S Samowitz; Jennifer S Herrick
Journal:  Mol Carcinog       Date:  2017-11-19       Impact factor: 4.784

8.  The MAPK-Signaling Pathway in Colorectal Cancer: Dysregulated Genes and Their Association With MicroRNAs.

Authors:  Martha L Slattery; Lila E Mullany; Lori C Sakoda; Roger K Wolff; Wade S Samowitz; Jennifer S Herrick
Journal:  Cancer Inform       Date:  2018-03-26
  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.