| Literature DB >> 34917903 |
Samuel Valentini1, Caterina Marchioretti1,2, Alessandra Bisio1, Annalisa Rossi1, Sara Zaccara1,3, Alessandro Romanel1, Alberto Inga1.
Abstract
Few studies have explored the association between SNPs and alterations in mRNA translation potential. We developed an approach to identify SNPs that can mark allele-specific protein expression levels and could represent sources of inter-individual variation in disease risk. Using MCF7 cells under different treatments, we performed polysomal profiling followed by RNA sequencing of total or polysome-associated mRNA fractions and designed a computational approach to identify SNPs showing a significant change in the allelic balance between total and polysomal mRNA fractions. We identified 147 SNPs, 39 of which located in UTRs. Allele-specific differences at the translation level were confirmed in transfected MCF7 cells by reporter assays. Exploiting breast cancer data from TCGA we identified UTR SNPs demonstrating distinct prognosis features and altering binding sites of RNA-binding proteins. Our approach produced a catalog of tranSNPs, a class of functional SNPs associated with allele-specific translation and potentially endowed with prognostic value for disease risk.Entities:
Keywords: Computational bioinformatics; Molecular mechanism of gene regulation; Transcriptomics
Year: 2021 PMID: 34917903 PMCID: PMC8666669 DOI: 10.1016/j.isci.2021.103531
Source DB: PubMed Journal: iScience ISSN: 2589-0042
Figure 1Identification of SNPs allelic imbalance across different RNA fractions
(A) Schematic representation of the approach developed to identify RNA fraction-specific SNP allelic imbalances. RNA-seq-based SNP allelic fraction variability is estimated both in total and polysomal RNA fractions. Then variability extended SNP allelic fractions are compared and only non-overlapping total versus polysomal imbalances are retained as tranSNPs. In the example, SNP2 satisfies the condition and is hence nominated as tranSNP. AF, allelic fraction; V, mean AF variability among replicates.
(B) Venn diagram showing private and shared tranSNPs identified across the three analyzed conditions.
(C) Allelic imbalance distribution of condition-associated tranSNPs is shown across the different conditions. Aggregate distribution is shown using boxplots, whereas single SNPs distribution is shown using a heatmap, where red intensity represents the level of imbalance. In the boxplot, the imbalance is shown as absolute log2 ratio of allelic fraction in polysomal RNA and allelic fraction in total RNA. In the heatmap, red intensity is proportional to this value; gray represents no value.
Summary table of MCF7 heterozygous SNPs
| MCF7 heterozygous SNPs | Analyzable SNPs | ||
|---|---|---|---|
| Number | 11,544 | 3,974 | 147 |
| Average Global MAF | 0.26 | 0.25 | 0.25 |
| Average EUR MAF | 0.28 | 0.28 | 0.27 |
| Overlapping genes | 5,493 | 2,532 | 139 |
| Intronic | 4,724 | 675 | 26 |
| Coding | 6,161 | 3,007 | 108 |
| UTR | 2,553 | 1,182 | 39 |
| LD blocks | 8,656 | 3,202 | 143 |
Summary table presenting the characteristics of tranSNPs together with same features measured from the set of the SNPs heterozygous in the MCF7 cell line initially considered, and the set of SNPs analyzable from the RNA-seq data by our approach. Average Global and EUR MAF (Minor Allele Frequency) were retrieved from 1,000 Genomes Project data. Details on how the LD blocks were defined are provided in the text.
Figure 2TranSNPs results in functionally distinct alleles
(A) MCF7 cells were transiently transfected with pGL4.13-based vectors containing BRI3BP 3′ UTR fragments differing for the indicated BRI3BP SNP allele, and the control pRLSV40 Renilla vector. After 24 h of transfection, cells were treated with Nutlin for 24 h before performing dual-luciferase assays. Firefly luciferase signals were normalized to Renilla to control for transfection efficiency and to relative Firefly mRNA levels to take into account differences in reporter's transcript levels. Individual values from independently transfected wells are plotted.
(B) Same as (A), except that the p21-5′ UTR was cloned in the low-expression pGL3-basic vector. ∗∗p value < 0.01; ∗∗∗∗p value < 0.0001, adjusted p value based on a two-way ANOVA with Sidak's multiple comparison test. Data are represented as mean ± SD.
Figure 3Prognostic significance of tranSNPs in breast cancer
(A) Progression-Free Interval analysis of BRI3BP-related tranSNP. Kaplan-Meyer curves along with summary statistics are reported.
(B–D) Examples of tranSNPs presenting prognostic significance. Kaplan-Meyer curves along with summary statistics are reported.
Figure 4Haplotype structure and allelic imbalance along the ATF6 gene and impact of UTR TranSNPs
(A) RNA-seq-based allelic fractions of ATF6 heterozygous SNPs are reported for both coding and 3′ UTR (in red) SNPs. On the top we show the distribution observed in the first biological replicate, and on the bottom we show the distribution observed in the second biological replicate.
(B) Significance of ATF6 3′ UTR SNPs allelic imbalance (red line) versus distribution of sequential random SNPs imbalances. On top using heterozygous SNPs data from the first biological replicate, and on the bottom using data from the second biological replicate.
(C) Dual-luciferase assays in MCF7 cells transiently transfected with reporter vectors containing ATF6 3′ UTR SNP alleles. Experiments were developed as described in Figure 2. ∗∗∗∗p value < 0.0001, adjusted p value based on a two-way ANOVA with Sidak's multiple comparison test. Data are represented as mean ± SD.
(D) RIP experiment probing the interaction of PABPC1 with the ATF6 transcript. Bars plot the average fold enrichment relative to the input sample. Individual average values from three biological replicates are also shown. Results obtained with an IgG control antibody are included. ∗p value < 0.05, two-tailed, unpaired t test. Data are represented as mean ± SD.
| REAGENT RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| Anti-PABP | Abcam | Cat# bb6125 [10E10] |
| Normal mouse IgG | Merck Millipore | Cat# 12-371 |
| E. coli strain | laboratory stock | DH5alpha |
| TRIzol | Thermo Fisher Scientific | Cat# 15596026 |
| Nutlin | Cayman chemicals | Cat#10004372 |
| Doxorubicin | Sigma Aldrich | Cat#25316-40-9 |
| Xba I | New England Biolabs | Cat# R0145S |
| Nco I | New England Biolabs | Cat# R3193S |
| T4 DNA ligase | New England Biolabs | Cat# M0202S |
| Protein A Magnetic Beads | Thermo Fisher Scientific | Cat# 88845 |
| TruSeq RNA Library preparation kit v2 | Illumina | Cat#RS-122-2001 |
| Agilent RNA 6000 Nano kit | Agilent | Cat#5067-1511 |
| RevertAid RT Kit | Thermo Fisher Scientific | Cat# K1691 |
| Dual Luciferase Reporter assay system | Promega | Cat# E1910 |
| Sequencing of total and polysomal RNA from normal and treated MCF7 cells | This paper | Bioproject: PRJNA693005 |
| TCGA SNP 6.0 array calls | Genomic Data Commons | |
| TCGA clinical information | ||
| MCF7 heterozygous SNPs (array data) | GEO | |
| MCF7 heterozygous SNPs (wes data) | ||
| RBPDB | ||
| MCF7 | ICLC | N/A |
| BRI3BP_F | Metabion | aatctagaAGGTCAGCCGGCCGGGCGGGTCCAC |
| BRI3BP_R | Metabion | aatctagaAACTTGACTCAATCTGCCTTTATTA |
| ATF6_852F | Metabion | aatctagaTTCATCCCTCGATTCCCAGC |
| ATF6_852R | Metabion | aatctagaGCAACCCCCAAAAGGCAATC |
| ATF6_848F | Metabion | aatctagaGGACACAGCTTCATTAGAGTGTT |
| ATF6_848R | Metabion | aatctagaGGCTGTGAAAGCAAAAGTGGT |
| p21_UTR_ref_F | Metabion | CATGGGGTGGCTATTTTGTCCTTGGGCTGCCT |
| p21_UTR_ref_R | Metabion | CATGGGGCGCCTGAACAGAAGAAATCCCTGTGGT |
| p21_UTR_ALT_F | Metabion | CATGGGGTGGCTATTTTGTCCTTGGGCTGCCTGT |
| p21_UTR_ALT_R | Metabion | CATGGGGCGCCTGAACAGAAGAAACCCCTGTGGT |
| GAPDH_F | Metabion | TCCAAAATCAAGTGGGGCGA |
| GAPDH_R | Metabion | AGTAGAGGCAGGGATGATGT |
| ATF6_F | Metabion | CCCGTATTCTTCAGGGTGCT |
| ATF6_R | Metabion | TCACTCCCTGAGTTCCTGCT |
| ATF6_UTR_F | Metabion | GAACCTTCCTCCCCTGTGTG |
| ATF6_UTR_R | Metabion | CAGAGTGAAAGGGGGCATCA |
| ASEQ | ||
| Ensembl rest API | ||
| SHAPEIT | ||
| IMPUTE2 | ||
| Survival R package | ||
| TESS | ||
| VariantAnnotation R package | ||