Literature DB >> 32128468

Single nucleotide alterations in MicroRNAs and human cancer-A not fully explored field.

Dan Zhao1.   

Abstract

MicroRNAs are ~20 nt long small noncoding RNAs that are processed from stem-looped precursors and function mainly as posttranscriptional regulators of protein coding genes through binding to 3'-untranslated regions of messenger RNAs to inhibit the translation or cause RNA degradation. It is predicted microRNAs could regulate up to half of all human genes and are proved to play important roles in human diseases including cancer. They bind to target mRNAs based on complementary binding which is dominated by the so-called "seed" region which are the 5' 2-8 bases of the microRNA. Due to the small size in nature, even a single nucleotide variation in the precursor region especially those located in the seed regions could show big influence. Here, I summarized and reviewed the current knowledge of these single nucleotide alterations in microRNAs in human cancer including (i) common SNPs in the precursor region, (ii) isomiRs, (iii) somatic mutations of microRNAs. Briefly, this is an underexploited field and clearly, warrants further studies to reveal their biological and clinical significances. I believe they will be key to advancing personalized medicine.
© 2020 Production and hosting by Elsevier B.V. on behalf of KeAi Communications Co., Ltd.

Entities:  

Keywords:  Isomirs; SNPs; Single nucleotide alterations; Somatic mutations; microRNAs

Year:  2020        PMID: 32128468      PMCID: PMC7044681          DOI: 10.1016/j.ncrna.2020.02.003

Source DB:  PubMed          Journal:  Noncoding RNA Res        ISSN: 2468-0540


Introduction

Since the discovery of the first microRNA in 1993, this type of small non-coding RNAs has attracted the attention of scientists in nearly all fields of researches and has proved themselves as a superstar [1,2]. They have been reported to participate in various biological development processes and cell activities and are implicated in human diseases with diagnostic and therapeutic potential [3,4]. According to the microRNA registry miRbase (mirbase.org), the number of annotated human microRNAs has been expanded from 56 in miRBase release 1.0 in December 2002 to 2656 in the current release and is expected to be again growing in the future. They are scattered on all chromosomes except the Y chromosome in humans and locates in intergenic, intronic or exonic regions as either single microRNA or miR clusters [5]. They are briefly small 20–24 nucleotide noncoding RNAs derived from primary microRNA transcripts and processed by a two-step cleavage involving enzymes like Drosha and Dicer. The mature microRNAs are believed to function mainly as, although not exclusively, negative regulators of target genes through RNA decay (which is major) or translation inhibition [6]. Canonically, they bind to 3′-UTR of target genes through a complementary sequence of the 2–8 nucleotides in their 5′ end (so-called seed region) [7]. With the development of many microRNA target prediction algorithms, it is estimated that nearly half or even more of all human genes are directly regulated by microRNAs [8]. As such interesting and important players in cellular processes, microRNAs have been implicated in nearly all types of human cancers [[9], [10], [11]]. Considering the relatively small size of microRNAs and especially the small size of seed regions which are critical for microRNA target recognition, even a single nucleotide variation in the pre-miRNA regions particularly those in the mature and seed regions may greatly influence either the maturation, the binding affinity/stability or even target selectivity of microRNAs. Here, I summarized the current knowledge of microRNA biogenesis, targets recognition through canonical and non-canonical binding, and microRNAs in human cancers and gave a focused review on common SNPs in pre-microRNAs, isomiRs and somatic mutations of microRNAs. I found this is still by far a not fully explored field and warrants further investigations. Considering the great potential of both microRNAs and genome variations in precision medicine [[12], [13], [14], [15], [16], [17]], I believe future studies in this field are of great importance and significance.

MicroRNA biogenesis and target recognition

MicroRNAs are small noncoding RNAs that are 19–24 nt long. They are transcribed mainly through the RNA polymerase II and following the transcription the canonical biogenesis of microRNAs is a two-step process (cropping and dicing) which includes one cleavage event in the nucleus by RNase III enzyme Drosha and the other in the cytoplasm by another RNase III enzyme Dicer [18]. Drosha process the pri-microRNA into a ~60–100 nt stem-loop structured microRNA precursor (pre-microRNA). Then pre-microRNAs are exported into the cytoplasm with the help of exportins and then Dicer will cleave the pre-microRNA into a double stranded RNAs. Except this Drosha-dependent and Dicer-dependent canonical pathway, other non-canonical pathways of microRNA biogenesis have also been reported [19,20]. For example, some microRNAs are Drosha independent or Dicer independent. Good example of Drosha-independent microRNAs are mirtrons which are microRNAs derived from the introns of mRNAs and produced directly through RNA splicing. m7G (7-methylguanosine)-capped pre-microRNAs are another example which does not need Drosha cleavage but could directly be exported to the cytoplasm for Dicer processing. Also, it has been found that biogenesis of miR-451 is one rare event that requires Drosha but not Dicer and was cleaved by Ago2 instead in the cytoplasm [20]. After mature microRNAs are formed, they are incorporated into the RISC complex. Most microRNAs target the 3′-UTR regions of mRNAs through complementary Watson-Crick pairing of their seed region which is the 5′ 2–8 base of the mature microRNA. They either inhibits the translation or cause mRNA decapping and degradation. Many microRNA target prediction algorithms based on this theory have been developed and widely accepted including TargetScan, Pictar, RNA22, PITA etc [8]. However, it was found that all these algorithms showed a high false positive prediction rates and it was reported that up to 50% of the predicted targets are false positives [21]. This indicates that microRNAs may function through other direct or indirect mechanisms. Interestingly, it was found that some microRNAs could bind to the 5′-UTR, the CDS or the promoter regions to even upregulate transcription or enhance translation [22]. miR10a that binds to the 5′UTR, miR373 binding to promoter and miR34a binding to both 5′-UTR and 3′-UTR are all good examples for these nonclassical binding [[23], [24], [25]]. What is more, some argue that microRNAs may target transcription factors and exert their function through a two-layer miR-TF-mRNA axis [[26], [27], [28], [29], [30]]. All in all, microRNAs in human cells play important roles and they regulate gene expression through several different types of regulatory mechanisms.

Common single nucleotide polymorphisms (SNPs) in pre-microRNAs

Single nucleotide polymorphisms frequently called SNPs, are the most common type of genetic variations in human. It was estimated that there are more than 100 million SNPs in human genome and about 7.5 million of them are common SNPs (SNPs with minor allele frequency (MAF) no less than 5%) [31]. With limitations of samples size and statistical power, past case-control studies have also revealed the biological significance and functionality of common SNPs but not rare SNPs. To begin with, I first obtained all SNP data located in the pre-microRNAs from Ensemble and ranked them with MAF and number of related publications. In brief, there are in total 322 SNPs located in the microRNA precursor regions including 37 in the seed regions, 65 in the mature regions and the rest 220 in the precursor regions (Supplementary Table 1). Surprisingly, only 16 of these common SNPs were well studied with more than 10 related publication up to date (Table 1), with only 1 located in the seed regions and 3 in the mature regions and 12 in precursors.
Table 1

List of well-studied SNPs located in the pre-microRNA regions.

Gene nameVariant nameMinor allele frequencySNP positions# of related publications
MIR196A2rs116149130.332668in mature425
MIR499Ars37464440.183506in seed289
MIR499Brs37464440.183506in mature289
MIR149rs22928320.386581in precursor157
MIR27Ars8958190.363818in precursor136
MIR608rs49195100.363818in mature82
MIR423rs65051620.497804in precursor77
MIR1908rs1745610.279553in precursor31
MIR492rs22890300.114417in precursor25
MIR605rs20435560.259585in precursor24
MIR149rs714284390.14397in precursor15
MIR449Brs100611330.122204in mature13
MIR604rs23683920.323083in precursor12
MIR1307rs79114880.303714in precursor12
MIR618rs26828180.242412in precursor12
MIR202rs123558400.31889in precursor11
List of well-studied SNPs located in the pre-microRNA regions.

miR196A2 rs11614913

rs11614913 located in the mature miR-196a-3p is the most widely studied microRNA SNP so far. It was first discovered as a genetic variant impacting patients’ survival in non-small cell lung cancer [32] and breast cancer [33]. Since then, it has been implicated in various human cancers including breast, lung, prostate, colorectal, gastric, head and neck, hepatocellular carcinomas, etc [[34], [35], [36], [37], [38], [39], [40], [41]]. Scientists found this SNP influenced the pre-microRNA processing and maturation. Also, this position underwent somatic mutation in breast cancer in 14% of all patients and correlates with higher expression of miR196a [42]. However, conflicting results from large case-control studies have been observed for this SNP and it seems that this SNP has population and cancer type specific effect. For example, C allele is correlated with high risk of hepatocellular carcinoma [43] while T allele is correlated with high risk of Glioma, prostate cancer and colorectal carcinoma [[44], [45], [46]]. In a meta-analysis of 46 studies involving 20673 cases and 25143 controls, it was discovered that C allele showed significant association with cancer risk in Asians but not in Caucasians [38]. In another study, it was shown that rs11614913 T allele frequency is significantly low in African population compared to Non-African population [47].

miR499 rs3746444

The miR499 locus encodes two microRNAs miR499a (transcribing from the forward strand) and miR499b (transcribing from the reverse strand). rs3746444 is located in the seed region of miR499a-3p while in the mature region of miR499b-5p. It was first discovered together with miR196a2 SNP rs11614913 from a case-control study as risk variants for breast cancer involving 1009 cases and 1093 controls [33]. It also shows a population and caner type specific effects. In Chen 2014, T allele predicts risk for esophageal cancer while C allele for all other cancers [48]. Mi et al. found that C allele is associated with increased cancer risk in Asian but not in Caucasians [46].

miR149 SNPs rs2292832 and rs71428439

These two SNPs are the only common SNPs (MAF = 0.39 for rs2292832 and 0.14 for rs71428439) located in a single microRNA locus which are both well studied up to date. They are only 2 base pair away from each other and locates in the 3’ end of the pre-miR149. Rs71428439 G allele showed reduced level of maturation of miR149 compared with the A allele [49] and predicts high risk for lung cancer and hepatocellular carcinomas [50,51]. rs2292832 was found to be related to risk of multiple cancers including colorectal cancer, liver cancer and breast cancer [[52], [53], [54], [55]] and in a recent study [56], it was shown that this SNP may have a population specific effect in gastric cancer patients. While T allele carriers in Caucasians displayed decreased risk of gastric cancer, they have increased risk among Asians.

miR423 rs6505162

rs6505162 is located in the precursor region of miR423 and was discovered to influence the maturation and expression of miR423 [57,58]. It ranks #1 in minor allele frequency (49.78%, see Table 1) of all the 322 SNPs located in the pre-microRNA regions, which indicates its high prevalence in human diseases related to miR423. Up to date, miR423 SNP rs6505162 has been implicated in multiple cancers including esophageal, breast, ovarian and colorectal cancer [[59], [60], [61]]. Interestingly, there is a significant interaction between rs6505162 and age in esophageal cancers and the protective effect was more than twice among younger patients than older ones [59]. Except for the above summarized four well studied miR SNPs and several others, biological and clinical significance for the majority of these genetic variants located in microRNA precursor regions remain unclear. Most interesting SNPs among these include multiple common SNPs located in a single microRNA locus (e.g. SNPs for miR6891, miR5189, miR4719, miR7150, miR6839, miR548AJ2 and miR5700) and SNPs showing high minor allele frequencies (e.g. rs2663345, rs71974827). Whether they influenced the maturation, expression of microRNAs or targets selection need to be revealed. From past studies, we could imagine these SNPs may also show population, gender, age and cancer type specific effects and researches on these could lead to significant progression in precision medicine.

Isomirs

microRNAs were typically annotated as a single defined sequence before Landgraf et al. [62] have found that the same microRNA could have several variants which differ in length and sequence. The development of deep sequencing has enabled the identification of isomirs, which are microRNA variants that are heterogeneous in length or sequence. Importantly and interestingly, isomirs are not rare and was estimated to contribute to half of the miRnome in human cells and for each microRNA the contribution of isomirs to the expression levels varies from 0% to nearly 100% [63,64]. Some isomirs may affect stability of microRNAs and also target selection [65]. However, up to date, the functionality and biological and clinical significance of majority of these isomirs remain unclear and is yet to be revealed in the future. Isomirs could be classified to 4 main classes: 5′ isomiRs, 3′ isomiRs, polymorphic isomiRs and mixed type isomiRs and they were generated through multiple Drosha or Dicer cleavage, 5′ or 3′ trimming, nucleotide addition, removal or RNA editing [66]. It was found that [67] some isomiRs expression are more abundant and show superior performance as biomarkers than their canonical forms in cancer. Scientists then found that isomer expression could be used to classify breast cancer subtypes and some isomirs could be used as biomarkers in breast cancer and even showed improved performances compared with the traditional gene expression profiling [68]. Recently, scientists used 50 5′ isomir as a panel and showed the expression of these isomirs could effectively classify tumors from 32 different tumor types with high sensitivity [69]. Individual isomir have been validated to be involved in cancer progression and also have diagnostic values in cancer. For example, 5’ isomir of miR140-3p suppress breast cancer proliferation and migration [70], while miR196a isomirs are significantly different between cancer and normal tissues and is diagnostic in laryngeal cancer patients [71]. Isomirs also showed gender or race specific expression patterns [72]. One study from triple negative breast cancer revealed that a total of 21 isomiRs are differentially expressed between white patients and black patients including miR-183-5p isomiRs and they then verified each isomir has distinct effect in cancer cell through microarray analyses of the transcriptome [73]. Later, they performed a more through analyses of isomirs and corresponding mRNA targets in triple negative breast cancer and discovered that genes from key metastasis linked pathways associated with isomirs in a race dependent manner [74]. In another study which analyzed isomirs in prostate cancer TCGA cohort that contains 562 samples from 472 patients, scientists detected a total of more than 3000 isomiRs, of which 524 are differential between tumor and normal and a large portion (1519, ~50%) were specifically abundant in white patients [75]. Aberrant expression of isomirs could be potential cancer biomarkers. In one study, it was reported that some isomirs could be predictive of prostate cancer metastasis [76] and in another, isomirs expression in the urine extracellular vesicles could be used as a non-invasive method to detect prostate cancer [77].

Somatic mutations in microRNAs

Somatic mutations also lead to single nucleotide variations in microRNAs. At present, there is one database SomamiR which included data from more than 600 samples in 50 different type of cancers and mapped ~2500 somatic mutations in microRNAs including ~1800 located in pre-microRNA regions, 644 in mature regions and 181 in seed regions [78,79]. Up to date, there is still limited understanding of these mutations which could be due to overall low mutation rate and there are still no case control studies which contain enough number of samples to reach statistical significance for these mutations. One recently published paper in lung cancer [80] analyzing the TCGA cohort which contain ~500 lung adenocarcinomas and ~500 lung squamous cell carcinomas identified more than 1000 mutation affecting ~500 microRNAs. They identified 10 significantly over mutated hotspot microRNA genes including miR379. Future studies which include more samples or focus on other cancer types will lead to more comprehensive understanding of these somatic mutations in microRNAs.

Concluding remarks

In this review, I summarized the current knowledge of single nucleotide variations in human microRNAs including the single nucleotide polymorphisms with minor allele frequencies higher than 5%, the isomiRs, and somatic mutations in microRNA genes. A diagrammatic summary was shown in Fig. 1. In general, this field is not fully explored with too much emphasis on only several star SNPs. Majority of the common SNPs in microRNA regions are not studied, while it is only at the starting phase for the studies on isomiRs and microRNA somatic mutations. Considering the important role of microRNAs in gene regulation and their significance in human diseases, further studies are in need. This review also provided some hint for future studies in this field. For example, researches in the future should take into consideration whether there is age, gender, race/ethnic, tissue/cancer type or other relative factor(s) specific role for a given variation. Knowledge obtained from these studies will surely improve the understanding of microRNAs in human diseases and aid in further development of precision medicine.
Fig. 1

Diagrammatic summary of single nucleotide variations (SNVs) in microRNAs. SNVs can be classified into three categories: single nucleotide polymorphisms (SNPs), somatic mutations (somamiRs) and isomiRs. SNPs in pre-miRs, mature miRs or within the seed sequences may (1) influence the biogenesis/maturation of the mature miRs and cause change in 5p/3p ratios, (2) strengthen or reduce the binding affinity for the mature miRs to bind to their targets or (3) change miRs' targetome through loss or gain of binding sites. SomamiRs, which are somatically mutated microRNAs, could be considered as a combination of all possible SNPs. IsomiRs are bona fide microRNA variants or isoforms that result from either addition, depletion/trimming or microRNA editing and usually show race, gender, population, and disease subtype dependencies. All of these SNVs modulate the intracellular microRNA-mRNA interaction network.

Diagrammatic summary of single nucleotide variations (SNVs) in microRNAs. SNVs can be classified into three categories: single nucleotide polymorphisms (SNPs), somatic mutations (somamiRs) and isomiRs. SNPs in pre-miRs, mature miRs or within the seed sequences may (1) influence the biogenesis/maturation of the mature miRs and cause change in 5p/3p ratios, (2) strengthen or reduce the binding affinity for the mature miRs to bind to their targets or (3) change miRs' targetome through loss or gain of binding sites. SomamiRs, which are somatically mutated microRNAs, could be considered as a combination of all possible SNPs. IsomiRs are bona fide microRNA variants or isoforms that result from either addition, depletion/trimming or microRNA editing and usually show race, gender, population, and disease subtype dependencies. All of these SNVs modulate the intracellular microRNA-mRNA interaction network.

Declaration of competing interest

There is no competing interest.
  80 in total

1.  New class of microRNA targets containing simultaneous 5'-UTR and 3'-UTR interaction sites.

Authors:  Inhan Lee; Subramanian S Ajay; Jong In Yook; Hyun Sil Kim; Su Hyung Hong; Nam Hee Kim; Saravana M Dhanasekaran; Arul M Chinnaiyan; Brian D Athey
Journal:  Genome Res       Date:  2009-03-31       Impact factor: 9.043

Review 2.  IsomiRs--the overlooked repertoire in the dynamic microRNAome.

Authors:  Corine T Neilsen; Gregory J Goodall; Cameron P Bracken
Journal:  Trends Genet       Date:  2012-08-08       Impact factor: 11.639

3.  Association between the pre-miR-196a2 rs11614913 polymorphism and gastric cancer susceptibility in a Chinese population.

Authors:  M Li; R J Li; H Bai; P Xiao; G J Liu; Y W Guo; J Z Mei
Journal:  Genet Mol Res       Date:  2016-04-28

4.  Association Between miR-149 Gene rs2292832 Polymorphism and Risk of Gastric Cancer.

Authors:  Lili Zhang; Qingna Liu; Fan Wang
Journal:  Arch Med Res       Date:  2018-09-28       Impact factor: 2.235

Review 5.  The interplay between transcription factors and microRNAs in genome-scale regulatory networks.

Authors:  Natalia J Martinez; Albertha J M Walhout
Journal:  Bioessays       Date:  2009-04       Impact factor: 4.345

6.  The Association Between Three Genetic Variants in MicroRNAs (Rs11614913, Rs2910164, Rs3746444) and Prostate Cancer Risk.

Authors:  Yuanyuan Mi; Kewei Ren; Jiangang Zou; Yu Bai; Lifeng Zhang; Li Zuo; Atsushi Okada; Takahiro Yasui
Journal:  Cell Physiol Biochem       Date:  2018-07-12

7.  MicroRNAs associated with metastatic prostate cancer.

Authors:  Akira Watahiki; Yuwei Wang; James Morris; Kristopher Dennis; Helena M O'Dwyer; Martin Gleave; Peter W Gout; Yuzhuo Wang
Journal:  PLoS One       Date:  2011-09-30       Impact factor: 3.240

8.  Beyond the one-locus-one-miRNA paradigm: microRNA isoforms enable deeper insights into breast cancer heterogeneity.

Authors:  Aristeidis G Telonis; Phillipe Loher; Yi Jing; Eric Londin; Isidore Rigoutsos
Journal:  Nucleic Acids Res       Date:  2015-09-22       Impact factor: 16.971

9.  Tumor classification and biomarker discovery based on the 5'isomiR expression level.

Authors:  Shengqin Wang; Zhihong Zheng; Peichao Chen; Mingjiang Wu
Journal:  BMC Cancer       Date:  2019-02-07       Impact factor: 4.430

10.  High-resolution analysis of the human retina miRNome reveals isomiR variations and novel microRNAs.

Authors:  Marianthi Karali; Maria Persico; Margherita Mutarelli; Annamaria Carissimo; Mariateresa Pizzo; Veer Singh Marwah; Concetta Ambrosio; Michele Pinelli; Diego Carrella; Stefano Ferrari; Diego Ponzin; Vincenzo Nigro; Diego di Bernardo; Sandro Banfi
Journal:  Nucleic Acids Res       Date:  2016-01-26       Impact factor: 16.971

View more
  6 in total

1.  AM22, a novel synthetic microRNA, inhibits the proliferation of colorectal cancer cells by targeting core binding factor subunit β (CBFB).

Authors:  Fanyi Meng; Jiawei Li; Yajing Qiu; Haiyang Zhang; Hongjian Zhang; Weipeng Wang
Journal:  Invest New Drugs       Date:  2022-01-05       Impact factor: 3.651

2.  IsoMiRmap-fast, deterministic, and exhaustive mining of isomiRs from short RNA-seq datasets.

Authors:  Phillipe Loher; Nestoras Karathanasis; Eric Londin; Paul Bray; Venetia Pliatsika; Aristeidis G Telonis; Isidore Rigoutsos
Journal:  Bioinformatics       Date:  2021-01-20       Impact factor: 6.937

Review 3.  SNPs in miRNAs and Target Sequences: Role in Cancer and Diabetes.

Authors:  Yogita Chhichholiya; Aman Kumar Suryan; Prabhat Suman; Anjana Munshi; Sandeep Singh
Journal:  Front Genet       Date:  2021-12-01       Impact factor: 4.599

Review 4.  An Insight into miR-1290: An Oncogenic miRNA with Diagnostic Potential.

Authors:  Małgorzata Guz; Witold Jeleniewicz; Marek Cybulski
Journal:  Int J Mol Sci       Date:  2022-01-22       Impact factor: 5.923

5.  Mutational Signature and Integrative Genomic Analysis of Human Papillomavirus-Associated Penile Squamous Cell Carcinomas from Latin American Patients.

Authors:  Luisa Matos Canto; Jenilson Mota da Silva; Patrícia Valèria Castelo-Branco; Ingrid Monteiro da Silva; Leudivan Nogueira; Carlos Eduardo Fonseca-Alves; André Khayat; Alexander Birbrair; Silma Regina Pereira
Journal:  Cancers (Basel)       Date:  2022-07-20       Impact factor: 6.575

6.  Circulating miR-19a-3p and miR-19b-3p characterize the human aging process and their isomiRs associate with healthy status at extreme ages.

Authors:  Cristina Morsiani; Lucia Terlecki-Zaniewicz; Susanna Skalicky; Maria Giulia Bacalini; Salvatore Collura; Maria Conte; Federica Sevini; Paolo Garagnani; Stefano Salvioli; Matthias Hackl; Johannes Grillari; Claudio Franceschi; Miriam Capri
Journal:  Aging Cell       Date:  2021-06-23       Impact factor: 9.304

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.