| Literature DB >> 26389959 |
Wei-Ting Kuo1,2, Ming-Wei Su3,4, Yungling Leo Lee5,6, Chien-Hsiun Chen7, Chew-Wun Wu8, Wen-Liang Fang9, Kuo-Hung Huang10, Wen-Chang Lin11,12.
Abstract
MicroRNAs (miRNAs) play important roles in cellular functions and developmental processes. They are also implicated in oncogenesis mechanisms and could serve as potential cancer biomarkers. Using high-throughput miRNA sequencing information, expression of both the 5p-arm and 3p-arm mature miRNAs were demonstrated and generated from the single miRNA hairpin precursor. However, current miRNA annotations lack comprehensive 5p-arm/3p-arm feature annotations. Among known human mature miRNAs, only half of them are annotated with arm features. This generated ambiguous results in many miRNA-Sequencing (miRNA-Seq) studies. In this report, we have interrogated the TCGA (the Cancer Genome Atlas) miRNA expression datasets with an improved, fully annotated human 5p-arm and 3p-arm miRNA reference list. By utilizing this comprehensive miRNA arm-feature annotations, enhanced determinations and clear annotations were achieved for the miRNA isoforms (isomiRs) recognized from the sequencing reads. In the gastric cancer (STAD) dataset, as an example, 32 5p-arm/3p-arm OPEN ACCESS J. Clin. Med. 2015, 4 1799 specific miRNAs were found to be down-regulated and 24 5p-arm/3p-arm specific miRNAs were found to be up-regulated. We have further extended miRNA biomarker discoveries to additional TCGA miRNA-Seq datasets and provided extensive expression information on 5p-arm/3p-arm miRNAs across multiple cancer types. Our results identified several miRNAs that could be potential common biomarkers for human cancers.Entities:
Keywords: 3p-arm; 5p-arm; TCGA; bioinformatics; biomarker; microRNA
Year: 2015 PMID: 26389959 PMCID: PMC4600160 DOI: 10.3390/jcm4091798
Source DB: PubMed Journal: J Clin Med ISSN: 2077-0383 Impact factor: 4.241
Figure 1Next Generation Sequencing (NGS) reads alignments of hsa-let-7a-1 miRNA isoforms (isomiRs). A small RNA NGS library from a gastric cancer cell line (AGS cells) was prepared and sequenced with Illumina Solexa platform. The NGS reads were aligned to the hsa-let-7a-1 miRNA genomic locus using Bowtie mapping program following adaptor trimming. We allowed no mismatch at the mapping procedure using standard Bowtie parameter. We trimmed the last 3′ end mismatch one by one until the mapping perfect-match reads were at least 18 nucleotides in length [25]. Here, the hsa-let-7a-1 miRNA precursor sequences and genomic coordinates are displayed on the top section. The NGS reads are aligned and their sequences, read counts and rpm (reads per million) values are displayed. The miRBase annotated hsa-let-7a-1-5p (MIMAT0000062) and hsa-let-7a-1-3p (MIMAT0004481) are marked with 5p and 3p in red, respectively.
Figure 2Expression level distribution of miRNA isoform (isomiR) reads. The read counts of each isomiR were tabulated and ranked by their expression percentage within each miRNA gene loci, i.e., read counts of each isomiR divided by the total read counts of all isomiRs in the miRNA gene loci. The top ten expressed isomiRs percentages are displayed.
Figure 3Distribution of the top ten expressed miRNAs from the TCGA (the Cancer Genome Atlas) normal gastric tissue dataset. miRNA expression data of 38 normal samples in TCGA stomach adenocarcinoma (STAD) libraries were re-analyzed using the new arm feature list. (a) The rpm (reads per million) values of each miRNAs were tabulated by combining the 5p-arm and 3p-arm together to represent the miRNA loci expression; (b) 5p-arm and 3p-arm expression levels were tabulated separately and displayed. miR-143 is the most expressed miRNA in the TCGA clinical normal samples.
Figure 4Distribution of highly expressed miRNAs in 261 gastric cancer samples from TCGA (The Cancer Genome Atlas) datasets. miRNA expression data of TCGA stomach adenocarcinoma (STAD) were retrieved and re-analyzed using the comprehensive arm feature annotated miRNA list. (a) The rpm values of each miRNAs were tabulated by combining the 5p-arm and 3p-arm to represent the miRNA loci expression; (b) 5p-arm and 3p-arm expression levels were tabulated separately and displayed. miR-21 is the most expressed miRNAs in the TCGA clinical cancer samples.
Figure 5Volcano plot of combined miRNA expression and separate 5p-arm/3p-arm miRNA expression in TCGA (The Cancer Genome Atlas) gastric cancer tissues. miRNA expression data of TCGA stomach adenocarcinoma (STAD) were retrieved and re-analyzed using the comprehensive arm feature annotated miRNA list. The miRNA expression information of gastric cancer tissues is illustrated here by calculating the mean expression level from TCGA samples. Following analysis of variance (ANOVA) in the Partek software package, the volcano plot is displayed for selecting differentially expressed miRNA genes. (a) Precursor miRNAs gene loci expression by combining the 5p-arm and 3p-arm miRNAs together for analysis; (b) 5p-arm and 3p-arm miRNAs tabulated separately for their expression and analyzed.
Dys-regulated miRNAs in TCGA (the Cancer Genome Atlas) STAD gastric cancer tissues.
| TCGA STAD (Stomach Adenocarcinoma) | |||
|---|---|---|---|
| Overall miRNA Precursors | Separate 5p-arm and 3p-arm miRNAs | ||
| Down regulated | Up regulated | Down regulated | Up regulated |
| hsa-miR-139 | hsa-miR-21 | hsa-miR-139-5p | hsa-miR-21-5p |
| hsa-miR-29c | hsa-miR-196a-1 | hsa-miR-139-3p | hsa-miR-196a-1-5p |
| hsa-miR-486 | hsa-miR-146b | hsa-miR-29c-3p | hsa-miR-146b-5p |
| hsa-miR-133b | hsa-miR-196b | hsa-miR-29c-5p | hsa-miR-146b-3p |
| hsa-miR-145 | hsa-miR-135b | hsa-miR-29b-2-5p | hsa-miR-196b-5p |
| hsa-miR-133a-1 | hsa-miR-183 | hsa-miR-486-5p | hsa-miR-141-5p |
| hsa-miR-204 | hsa-miR-501 | hsa-miR-133b-3p | hsa-miR-135b-5p |
| hsa-miR-1-2 | hsa-miR-18a | hsa-miR-145-5p | hsa-miR-183-5p |
| hsa-miR-378a | hsa-miR-200a | hsa-miR-145-3p | hsa-miR-200a-5p |
| hsa-miR-30a | hsa-miR-141 | hsa-miR-133a-1-3p | hsa-miR-501-3p |
| hsa-miR-129-1 | hsa-miR-200b | hsa-miR-204-5p | hsa-miR-18a-5p |
| hsa-miR-129-2 | hsa-miR-194-2 | hsa-miR-378a-5p | hsa-miR-200b-3p |
| hsa-miR-378c | hsa-miR-194-1 | hsa-miR-1-2-3p | hsa-miR-194-2-5p |
| hsa-miR-195 | hsa-miR-182 | hsa-miR-195-3p | hsa-miR-194-1-5p |
| hsa-miR-144 | hsa-miR-200c | hsa-miR-30a-3p | hsa-miR-335-3p |
| hsa-miR-143 | hsa-miR-192 | hsa-miR-143-5p | hsa-miR-182-5p |
| hsa-miR-490 | hsa-miR-335 | hsa-miR-30a-5p | hsa-miR-200a-3p |
| hsa-miR-363 | hsa-miR-429 | hsa-miR-378a-3p | hsa-miR-200c-3p |
| hsa-miR-9-2 | hsa-miR-129-1-5p | hsa-miR-708-5p | |
| hsa-miR-9-1 | hsa-miR-129-2-5p | hsa-miR-192-5p | |
| hsa-miR-149 | hsa-miR-378c-5p | hsa-miR-141-3p | |
| hsa-miR-187 | hsa-miR-195-5p | hsa-miR-429-3p | |
| hsa-miR-30c-2-3p | hsa-miR-194-2-3p | ||
| hsa-miR-144-5p | hsa-miR-192-3p | ||
| hsa-miR-143-3p | |||
| hsa-miR-490-3p | |||
| hsa-miR-144-3p | |||
| hsa-miR-363-3p | |||
| hsa-miR-9-2-5p | |||
| hsa-miR-9-1-5p | |||
| hsa-miR-149-5p | |||
| hsa-miR-187-3p | |||
Figure 6Hierarchical clustering analysis on significantly dys-regulated miRNAs in 13 TCGA (the Cancer Genome Atlas) cancer types. 233 arm-specific miRNAs were expressed in all 13 cancer types and further selected for hierarchical clustering analysis to reveal their significance in each cancer type.