| Literature DB >> 27756200 |
Elena Grassi1, Elisa Mariella2, Antonio Lembo2, Ivan Molineris2, Paolo Provero2,3.
Abstract
BACKGROUND: Post-transcriptional regulation is a complex mechanism that plays a central role in defining multiple cellular identities starting from a common genome. Modifications in the length of 3'UTRs have been found to play an important role in this context, since alternative 3' UTRs could lead to differences for example in regulation by microRNAs and cellular localization of the transcripts thus altering their fate.Entities:
Keywords: 3’ UTR; Bioconductor; Polyadenylation; RNA-sequencing; Software
Mesh:
Substances:
Year: 2016 PMID: 27756200 PMCID: PMC5069797 DOI: 10.1186/s12859-016-1254-8
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1APASdb reported sites across tissues. a - mean (+/- sem) of the fraction of reads assigned to the two sites with more reads for every gene with at least two overlapping sites in APASdb across different tissues. For “alltogether” we put together sites annotations for all the 22 human normal tissues, normalizing reads with respect to the total number of reads found in that tissue and considering the sites supported by more than 2 normalized reads. b - percentages of sites found in different tissues that are found in other N tissues: on average 29 % of sites are found in all tissues. 50 % of the sites are found in at least 17 tissues
Fig. 2Pipeline. a - how we define gene structures starting from different transcripts. We obtain “melted genes” collapsing together the structures of all the transcripts assigned to a gene. aPA: alternative polyadenylation site. cPa: canonical polyadenylation site. Thicker blue rectangles represent coding exons, while the others depict untranslated regions. b - an example of how roar works with the single APA annotation: in sample #1 the shorter isoform is more expressed than the longer one with respect to sample #2. Blue wavy shapes represent aligned mRNAseq reads. c - how transcript fragments are defined in multiple APA analyses to efficiently count reads for all the possible APA choices. aPA1-2-3 are three different APA sites reported for this sample gene
Fig. 3Example of read density and corresponding m/M values. a- Sashimi plot produced with IGV of two alignments for representative samples for testes and brain over the PRE and POST portions that we consider for CAMSAP1, one of the genes with the strongest shortening signal in testes versus brain. Read density is clearly lower in testes on the POST portion. CAMSAP1 is on the negative strand and the PRE fragment overlaps with the coding portion and the beginning of the 3’UTR of its last exon. b- Dot plot representing the m/M values obtained for the two testes and six brain samples. The larger m/M values for testes reflect the preferential expression of the short isoform in that tissue
Fig. 4Venn diagrams of overlaps between roar results and a standard approach and between two different annotations for roar. a - MCF7 vs MCF10: overlap between shortened genes for roar and [13] b - MCF7 vs MCF10: overlap between shortened genes for roar using PolyA_DB or APASdb