| Literature DB >> 25333330 |
Xinmei Wang1, Mingyue Li2, Yingchun Yin1, Liang Li3, Yuqian Tao2, Dengguo Chen4, Jianzhao Li1, Hongmei Han1, Zhenbo Hou1, Baohua Zhang1, Xinyun Wang1, Yu Ding2, Haiyan Cui1, Hengming Zhang1.
Abstract
Breast cancer (BC) is a leading cause of cancer-related mortality in females and is recognized as a molecularly heterogeneous disease. Previous studies have suggested that alternative messenger RNA (mRNA) processing, particularly alternative polyadenylation [poly(A)] (APA), can be a powerful molecular biomarker with prognostic potential. Therefore, in the present study, we profiled APA sites in the luminal B subtype of BC by sequencing APA sites (SAPAS) method, in order to assess the relation of these APA site-switching events to the recognized molecular subtypes of BC, and to discover novel candidate genes and pathways in BC. Through comprehensive analysis, the trend of APA site-switching events in the 3' untranslated regions (3'UTRs) in the luminal B subtype of BC were found to be the same as that in MCF7 cell lines. Among the genes involved in the events, a significantly greater number of genes was found with shortened 3'UTRs in the samples, which were samples of primary cancer with relatively low proliferation. These findings may provide novel information for the clinical diagnosis and prognosis on a molecular level. Several potential biomarkers with significantly differential tandem 3'UTRs and expression were found and validated. The related biological progresses and pathways involved were partly confirmed by other studies. In conclusion, this study provides new insight into the diagnosis and prognosis of BC from the APA site profile aspect.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25333330 PMCID: PMC4249744 DOI: 10.3892/ijmm.2014.1973
Source DB: PubMed Journal: Int J Mol Med ISSN: 1107-3756 Impact factor: 4.101
Figure 1Histological analysis showing the type of breast cancer in one Chinese patient with the luminal B subtype; ×200 magnification.
Clinical and immunohistochemical characteristics of the 9 patients with luminal B type BC.
| Case | Age (years) | ER | PR | c-erbB-2 | Ki-67 (%) | Stage |
|---|---|---|---|---|---|---|
| Case 0 | 29 | +++ | + | ++ | 15 | II |
| Case 1 | 36 | + | + | +++ | 60 | II–III |
| Case 2 | 58 | + | − | ++ | 40 | III |
| Case 3 | 50 | ++ | + | ++ | 70 | I |
| Case 4 | 30 | + | + | +++ | 30 | III |
| Case 5 | 44 | ++ | ++ | 0 | 55 | II–III |
| Case 6 | 46 | ++ | +++ | + | 20 | II–III |
| Case 7 | 41 | + | + | +++ | 80 | III |
| Case 8 | 47 | + | +++ | +++ | 25 | III |
Samples for SAPAS;
Samples for RT-qPCR validation.
BC, breast cancer; ER, estrogen receptor; PR, progesterone receptor; SAPAS, sequencing alternative polyadenylation sites.
Primers used in RT-qPCR.
| Genes | Forward primer | Reverse primer |
|---|---|---|
| RT-qPCR primers for genes with switched APA sites | ||
| ATCTACTTGCTTAAATTGTGGGC | GGTTGACATTTTCCATAACAGGT | |
| GCCAGTCTCATTTTCATCTTCTT | ATGCTTTATTTCATTTTTTTCACAA | |
| ACATAAAGCAAGTGAGCGACC | CCTCTACCCCTGGAACGAC | |
| CTTTCGGGGGAGAGGGTA | CAGGCTGGACACAACACACAT | |
| TTTTGGGTAAAGGAGAGTTGAGC | AAGGGAAAGGAAGTGGAGGTAG | |
| ACTACCACAAGAACCAACACTGAG | GGGTCACATCCAGCTAGTACATTT | |
| ACTCAGTGGGGCTGGCAT | AAGGTTTCTGTTGGTCAAGATTTA | |
| GCCCCCTCTTTGTAGCTCCT | CAGCAACACTTTGACTTTTCCTCT | |
| TGTGTTTGGACTTGGGGTT | AGTTAAATGAAGGGACTGGCT | |
| CAACCACCACCAATGCACA | TAGTCAAGGGATAACCAGGTAAGA | |
| TGTGGATGTGATGTGAGCAGG | GACAAAGTAAAAAAGACGGACGG | |
| GTAGTCGCCTCCAATAGCCAT | GGTTAGTAGGGTTAGGGGTCTGAA | |
| RT-qPCR primers for differentially expressed genes | ||
| CCTTCATATCCCCTCAGCAGA | GCAGCATTGTATCCATAAACCA | |
| CCTTGGGGACAGTAGTGGTTG | GTGATGTTTCCGTGGGGCT | |
| CATCGTTGAGGAGTGCTGTTTC | ACTGCTTCCAGGTGTCATATTG | |
| GATGCGACCCCAGTCATAAAC | TTCCTCTTCTTCCTCCTTCTCT | |
| GCATCTACACCGACAACTCCA | TTGTTCTCCTCCGCCTCTG | |
S, primer for shortened region of 3′UTRs; L, primer for lengthened region of 3′UTRs; APA, alternative polyadenylation.
Summary statistics of SAPAS data from Illumina GAIIx sequencing.
| s1 | s2 | |
|---|---|---|
| Raw reads | 27,392,854 | 26,620,195 |
| Clean reads, n (%) | 27,277,251 (99.6) | 26,510,992 (99.6) |
| Mapped to genome, n (%) | 25,188,415 (92.0) | 24,492,603 (92.0) |
| Uniquely mapped to genome, n (%) | 18,928,605 (69.1) | 18,042,088 (67.8) |
| Mapped to nuclear genome, n (%) | 16,684,903 (60.9) | 15,181,455 (57.0) |
| Passed internal priming filter, n (%) | 11,452,320 (41.8) | 10,866,946 (40.8) |
| Genes sampled by reads | 18,448 | 18,970 |
| Poly(A) sites | 206,585 | 263,911 |
| Known poly(A) sites sampled | 28,651 | 29,338 |
| Putative novel poly(A) sites | 177,934 | 234,573 |
| Genes sampled by poly(A) sites | 16,880 | 17,432 |
SAPAS, sequencing alternative polyadenylation [poly(A)] sites.
Figure 2The characteristics of the sequencing alternative polyadenylation (polyA) sites (SAPAS) data. Genomic locations of reads that were uniquely mapped to the nuclear genome after internal priming filtering. Inter, Intergenic; non, non-coding gene; <1 knt, <1 knt downstream; Tian, Tian polyA DB; UCSC, UCSC TTS; CDS, coding DNA sequence.
Figure 3Characteristics of the polyadenylation [poly(A)] site. (A) Distribution of numbers of poly(A) sites per gene. (B) Genomic locations of the poly(A) sites in all genes. Tian, Tian polyA DB; UCSC, UCSC TTS; CDS, coding DNA sequence.
Figure 4Alternative polyadenylation [poly(A)] (APA), site switching and gene expression levels of paracancerous (normal) and carcinoma tissues. Tandem APA sites switch index (TSI) is plotted against the logarithm of the expression level ratios between the carcinoma and paracancerous tissues. The x-axis denotes TSI, a larger positive value indicates that longer tandem untranslated regions (UTRs) are prone to be used in the carcinoma tissues. Genes with significant switching to longer (blue) or shorter (red) tandem UTRs in carcinoma tissues (FDR <0.01) are colored. The y-axis denotes the logarithm of the expression level of genes from the carcinoma tissues relative to the paracancerous tissues.
Enrichment of genes with tandem 3′UTR involved in various important GO and KEGG pathways.
| GO category | ID | Name | Counts | Fold | P-value |
|---|---|---|---|---|---|
| Shortened genes (n=97) | |||||
| KEGG_PATHWAY | hsa03040 | Spliceosome | 4 | 6.46 | 2.06E-02 |
| GOTERM_BP_FAT | GO:0016055 | Wnt receptor signaling pathway | 4 | 5.98 | 2.82E-02 |
| Lengthened genes (n=56) | |||||
| KEGG_PATHWAY | hsa04510 | Focal adhesion | 4 | 5.95 | 2.33E-02 |
| KEGG_PATHWAY | hsa05215 | Prostate cancer | 3 | 10.08 | 3.10E-02 |
| KEGG_PATHWAY | hsa04630 | Jak-STAT signaling pathway | 3 | 5.79 | 8.38E-02 |
3′UTR, 3′ untranslated region; GO, gene ontology.
Eleven genes enriched in GO terms associated with the negative regulation of macromolecule metabolic process.
| ucscID | Gene symbol | Description | Pearson’s r |
|---|---|---|---|
| uc010wbb.1 | PHD finger protein 12 isoform 1 | −0.18 | |
| uc011ebr.1 | Hairy/enhancer-of-split related with YRPW motif | −0.36 | |
| uc001aqk.1 | Catenin, β interacting protein 1 | −0.21 | |
| uc010ikt.2 | α-synuclein isoform NACP112 | −0.38 | |
| uc003anb.2 | Tissue inhibitor of metalloproteinase 3 | −0.12 | |
| uc003geu.1 | MAD4 | −0.19 | |
| uc001aja.3 | v-ski sarcoma viral oncogene homolog | −0.17 | |
| uc003oql.2 | Forkhead box P4 isoform 1 | −0.23 | |
| uc003bqf.2 | Basic helix-loop-helix family, member e40 | −0.25 | |
| uc002vyg.2 | Ankyrin repeat and SOCS box-containing protein | −0.12 | |
| uc003ccx.3 | Thyroid hormone receptor, β | −0.24 |
GO, gene ontology.
Four genes enriched in GO terms associated with focal adhesion.
| ucscID | Gene symbol | Description | Pearson’s r |
|---|---|---|---|
| uc002lit.1 | B-cell lymphoma protein 2α isoform | 0.21 | |
| uc003ung.1 | α2 type I collagen precursor | 0.18 | |
| uc011cir.1 | Platelet-derived growth factor C precursor | 0.16 | |
| uc003jva.2 | Phosphoinositide-3-kinase, regulatory subunit 1 | 0.14 |
GO, gene ontology.
Six tandem 3′UTR genes validated by qRT-PCR.
| ucscID | Gene symbol | Description | Pearson_r |
|---|---|---|---|
| uc003ung.1 | COL1A2 | α2 type I collagen precursor | 0.18 |
| uc010deh.2 | DDX5 | DEAD (Asp-Glu-Ala-Asp) box polypeptide 5 | −0.14 |
| uc002svt.2 | SNRNP200 | Activating signal cointegrator 1 complex subunit | −0.16 |
| uc001aqk.1 | CTNNBIP1 | Catenin, β interacting protein 1 | −0.21 |
| uc001xew.3 | PPM1A | Protein phosphatase 1A isoform 3 | −0.13 |
| uc003fms.2 | DVL3 | Dishevelled 3 | −0.21 |
3′UTR, 3′ untranslated region.
Figure 5Relative changes in expression level of tandem 3′ untranslated regions (3′UTR) genes in 9 patients detected by RT-qPCR. (A) COL1A2. (B) DDX5. (C) SNRNP200. (D) CTNNBIP1. (E) PPM1A. (F) DVL3. cUTR/eUTR, constitutive UTR/extended UTR, the expression ratios of the shortened region to the lengthened region. *P<0.05.
Figure 6Relative changes in expression level of differentially expressed genes in 9 patients detected by RT-qPCR. (A) MGP. (B) TGFBR3. (C) IGF2. (D) RCAN1. (E) CCND1. cUTR/eUTR, constitutive UTR/extended UTR, the expression ratios of the shortened region to the lengthened region. *P<0.05.