| Literature DB >> 32542110 |
Tingming Liang1, Leng Han2, Li Guo3.
Abstract
Classical microRNA (miRNA) has been so far believed as a single sequence, but it indeed contains multiple miRNA isoforms (isomiR) with various sequences and expression patterns. It is not clear whether these diverse isomiRs have potential relationships and whether they contribute to miRNA:mRNA interactions. Here, we aimed to reveal the potential evolutionary and functional relationships of multiple isomiRs based on let-7 and miR-10 gene families that are prone to clustering together on chromosomes. Multiple isomiRs within gene families showed similar functions to their canonical miRNAs, indicating selection of the predominant sequence. IsomiRs containing novel seed regions showed increased/decreased biological function depending on whether they had more/less specific target mRNAs than their annotated seed. Few gene ontology(GO) terms and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways were shared among the target genes of the annotated seeds and the novel seeds. Various let-7 isomiRs with novel seed regions may cause opposing drug responses despite the fact that they are generated from the same miRNA locus and have highly similar sequences. IsomiRs, especially the dominant isomiRs with shifted seeds, may disturb the coding-non-coding RNA regulatory network. These findings provide insight into the multiple isomiRs and isomiR-mediated control of gene expression in the pathogenesis of cancer.Entities:
Keywords: ACC, adrenocortical carcinoma; BLCA, bladder urothelial carcinoma; BRCA, breast invasive carcinoma; CESC, cervical squamous cell carcinoma and endocervical adenocarcinoma; CHOL, cholangiocarcinoma; COAD, colon adenocarcinoma; ESCA, esophageal carcinoma; Function; GBM, glioblastoma multiforme; HNSC, head and neck squamous cell carcinoma; IsomiR; KICH, kidney chromophobe; KIRC, kidney renal clear cell carcinoma; KIRP, kidney renal papillary cell carcinoma; LAML, acute myeloid leukemia; LGG, brain Lower grade glioma; LIHC, liver hepatocellular carcinoma; LUAD, lung adenocarcinoma; LUSC, lung squamous cell carcinoma; Let-7; MESO, mesothelioma; MicroRNA (miRNA); Network; OV, ovarian serous cystadenocarcinoma; PAAD, pancreatic adenocarcinoma; PCPG, pheochromocytoma and paraganglioma; PRAD, prostate adenocarcinoma; READ, rectum adenocarcinoma; SARC, sarcoma; SKCM, skin cutaneous melanoma; STAD, stomach adenocarcinoma; TGCT, testicular germ cell tumors; THCA, thyroid carcinoma; THYM, thymoma; TSG, tumor suppressor gene; UCEC, uterine corpus endometrial carcinoma; UCS, uterine carcinosarcoma; UVM, uveal melanoma; miR-10
Year: 2020 PMID: 32542110 PMCID: PMC7280754 DOI: 10.1016/j.csbj.2020.05.001
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 7.271
Fig. 1IsomiR repertoires of the let-7 gene family and related family miR-10 in 16 cancers. (A) Expression patterns of selected abnormal isomiRs from the two gene families across 16 cancer types (isomiRs with an RPM value not less than 50 and a baseMean value not less than 500 in at least one cancer type were selected). The log2FC distribution was estimated based on the original log2FC values. The numbers of deregulated isomiRs are also presented (number (sig.)). (B) Correlations among related transcription factors (TF) and miRNAs from the two gene families in BRCA patients (n = 1,207). (C) Correlations among related transcription factors (TF) and miRNA from the two gene families in LUAD patients (n =567).
Fig. 2Correlations between internal isomiRs and external miRNAs, and phylogenetic relationships among isomiRs. (A) Length distributions of canonical miRNAs and their isomiRs from the two gene families, and length distributions of human isomiRs compared with mature miRNAs across animal species. (B) Sequence features of isomiRs and canonical miRNAs from the two gene families across animal species (5p and 3p were estimated). (C) Phylogenetic networks and trees of the most highly expressed isomiRs from 5p. IsomiRs with a RPM value ≥ 500 in any one sample were selected.
Fig. 3Seed variation among isomiRs. (A) Seed variations (annotated seed ± 5 nucleotides) and their deregulated expression patterns based on each miRNA locus. For multiple miRNAs, miRNAs are selected only if they had more isomiR types to avoid repeated estimation. Deregulated expression is estimated based on all relevant cancer types. If an isomiR shows decreased expression level, it will be removed from this analysis. miRNA loci are eliminated if no variations in the seed sequence are detected. (B) Detailed screening of deregulated seed sequences from the two miRNA gene families (a seed was is included if the corresponding isomiR is abnormally expressed in at least one cancer type), and distributions of predicted target mRNAs of annotated and novel seeds.
Fig. 4Target validation and analysis of function enrichment. (A) Correlation analysis of isomiRs and their targets. Interactions with a negative correlation (r < 0, FDR < 0.05) are selected for further analysis. (B) Distribution of potential differences between shifted seed sequences and annotated seeds as determined by hypergeometric distribution test, and Venn diagram showing detailed distributions among these novel seeds. (C) Bubble figure showing differences in enriched GO terms between annotated and novel seeds (FDR < 0.05). Biological process (1–14): 1, activation of phospholipase C activity; 2, axon guidance; 3, cell adhesion; 4, cell-cell signaling; 5, chemical synaptic transmission; 6, collagen catabolic process; 7, extracellular matrix organization; 8, homophilic cell adhesion via plasma membrane adhesion molecules; 9, nervous system development; 10, outflow tract morphogenesis; 11, platelet degranulation; 12, positive regulation of angiogenesis; 13, potassium ion transport; 14, smooth muscle contraction. Cellular component (15–37): 15, apical plasma membrane; 16, basolateral plasma membrane; 17, cell junction; 18, cell surface; 19, collagen trimer; 20, dendrite; 21, endocytic vesicle membrane; 22, endoplasmic reticulum lumen; 23, external side of plasma membrane; 24, extracellular region; 25, extracellular space; 26, integral component of membrane; 27, integral component of plasma membrane; 28, neuron projection; 29, plasma membrane; 30, platelet alpha granule lumen; 31, postsynaptic density; 32, postsynaptic membrane; 33, proteinaceous extracellular matrix; 34, receptor complex; 35, synapse; 36, voltage-gated potassium channel complex; 37, Z-disc. Molecular function (38–43): 38, calcium ion binding; 39, cholesterol binding; 40, extracellular matrix structural constituent; 41, heparin binding; 42, receptor activity; 43, transcriptional activator activity, RNA polymerase II core promoter proximal region sequence-specific binding. (D) Bubble figure showing differences in enriched KEGG pathways between annotated and novel seeds (Bonferroni, p < 0.05 and FDR < 0.2). (E) Survival analysis based on different isomiR types.
Fig. 5Rewired regulatory networks of novel seeds compared with annotated seeds. (A) Distribution of target genes with functional annotations between novel and annotated seeds, including actionable, CGC, core essential, oncogene, and TSG genes. (B) Rewired isomiR-mRNA interaction network based on target genes associated with cancer hallmarks among seeds from the let-7-5p locus. These genes are further classified according to their functional annotations, including actionable, CGC, core essential, oncogene, and TSG genes. Frequencies of genes associated with cancer hallmarks are also presented for specific targets of each involved seed. (C) Drug responses for six isomiRs (two seeds) of the let-7a-5p locus. A total of 48 drugs are shown, and a significant difference in drug response is observed for each in at least one cancer type. Data for six cancer types are presented (at least five points with significant difference), and other data can be found in Supplementary Fig. S5. Asterisks (*) indicate drugs for which a significant difference was observed between the abnormal expression and normal expression groups (DR > 0.1 or DR < − 0.1 and p < 0.05). Among the six isomiRs, only four were detected in CHOL. “DR” indicates a difference in the values of the shifted seed group and the annotated seed group. “Freq” indicates frequency. “Sig/total” indicates the ratio of the number of significant results and the total number of isomiRs (pie distribution, left) or the total number of drugs (pie distribution, right). The location annotation provides a detailed locus for each isomiR. For example, 51–73 indicates the location hg38:chr22:46112751–46112773:+.