Literature DB >> 25617413

3USS: a web server for detecting alternative 3'UTRs from RNA-seq experiments.

Loredana Le Pera1, Mariagiovanna Mazzapioda2, Anna Tramontano3.   

Abstract

UNLABELLED: Protein-coding genes with multiple alternative polyadenylation sites can generate mRNA 3'UTR sequences of different lengths, thereby causing the loss or gain of regulatory elements, which can affect stability, localization and translation efficiency. 3USS is a web-server developed with the aim of giving experimentalists the possibility to automatically identify alternative 3 ': UTRs (shorter or longer with respect to a reference transcriptome), an option that is not available in standard RNA-seq data analysis procedures. The tool reports as putative novel the 3 ': UTRs not annotated in available databases. Furthermore, if data from two related samples are uploaded, common and specific alternative 3 ': UTRs are identified and reported by the server.
AVAILABILITY AND IMPLEMENTATION: 3USS is freely available at http://www.biocomputing.it/3uss_server.
© The Author 2015. Published by Oxford University Press.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 25617413      PMCID: PMC4443675          DOI: 10.1093/bioinformatics/btv035

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 Introduction

The 3′ untranslated regions (3′UTRs) of mRNAs play a crucial role in post-transcriptional regulation. In fact, they contain cis-regulatory elements, such as AU-rich elements (ARE), microRNA binding sites, etc. (Khabar, 2005; Bartel, 2009), which can affect transcript stability, degradation, subcellular localization and translation. Through alternative polyadenylation, a widespread mechanism in eukaryotic organisms, different isoforms of the same gene can acquire alternative (longer or shorter) 3′ untranslated regions (Elkon ). The phenomenon seems to be different in different tissues (Lianoglou ), during development (Ji ) and, recently, it has also been observed in cancer cells (Mayr and Bartel 2009). With the emergence of next-generation sequencing technologies, it is now possible to sequence entire transcriptomes in one experiment. In standard RNA-seq data analysis protocols the reconstructed transcripts are compared with an annotated Reference transcriptome. The most commonly used methods [e.g. Cuffcompare and Cuffmerge (Trapnell )] distinguish known from putative novel isoforms, but they rely on the comparison of the intronic structure of the transcripts, thereby do not take into account reads that might support the existence of alternative 3′UTRs and therefore the existence of putative novel transcripts with different 3′UTR is not reported to the user. The great interest for these important regulatory regions is leading to the development of more sophisticated RNA sequencing library preparation protocols (the more powerful will be the possibility to entirely sequence the untranslated region, the more accurate will the reconstruction of putative novel 3'UTRs be). At the same time, different computational tools to capture [Lu ; Wang ; github.com/vodkatad/roar (R-package)] and store (compbio.dundee.ac.uk/polyADB/, mosas.sysu.edu.cn/utr) the 3′ ends of polyadenylated transcripts have been recently developed. Nevertheless, the identification of 3′UTR alternative regions still remains a challenge. 3USS (3′ UTR Sequence Seeker) is the only available web-server developed to automatically identify the assembled transcripts with alternative 3′UTRs with respect to the ones in Reference transcriptomes and provides the user with their nucleotide sequence together with their genomic coordinates and other information, such as their difference in length and the corresponding gene. Furthermore, when data from two related experiments are provided, the reconstructed 3′UTRs of each biological sample are identified and compared.

2 Description

The server starts the analysis from a transcriptome assembly file obtained by standard computational methods for RNA-seq transcript reconstruction, such as Cufflinks (Trapnell ), Scripture (Guttman ) and others. Indeed, the 3USS procedure is independent from the transcriptome reconstruction method used to obtain the RNA-seq assembly file as long as the data have been compared with a Reference annotation [such as UCSC (Rhead ), NCBI (Pruitt ), Ensembl (Flicek ), Gencode (Harrow ), etc.] using Cuffcompare or Cuffmerge (see the Help page for full information). 3USS also permits to easily retrieve the genomic coordinates and sequences of transcripts already annotated in public databases.

2.1 Input

For identifying RNA-seq assembled transcripts with alternative 3′UTRs with respect to a Reference, the user needs to provide: the organism and related genome version; the Reference annotation transcriptome and one or two RNA-seq transcriptome assembly files (as above described). An example using public data from GEO database (Barrett ) is accessible from the 3USS input page.

2.2 Results

Each assembled transcript is processed by selecting the isoforms sharing the same intron chain of known protein-coding transcripts in the Reference. The reconstructed 3′UTRs are consequently identified as the regions located immediately after the stop-codon and can be directly compared, detecting possible differences in their 3′UTR lengths. The alternative 3′UTRs are then compared with already annotated ones in other databases [the latest available data from iGenomes (http://cufflinks.cbcb.umd.edu/igenomes.html) and Gencode (Harrow )] to identify putative novel (not yet annotated) 3′UTRs. The most informative output files that the 3USS server produces are: a fasta file with the alternative 3′UTR nucleotide sequences and other information in their headers (gene Id, coding strand, 3′UTR genomic coordinates and length); a list of the transcripts with their 3′UTR length difference and a list of the alternative not yet annotated 3′UTRs (putative novel) in other databases as well as the annotated ones. The user can also easily visualize the transcripts in a graphical environment through the UCSC Genome browser (Kent ; see examples in the Supplementary Material). If an already annotated transcriptome is uploaded as input, the 3′UTR sequences and genomic coordinates of all the protein-coding transcripts are returned. If the user submits input files derived from two different related RNA-seq experiments, the 3USS server compares the alternative 3′UTRs, reporting how they are distributed across the two biological samples (Fig. 1).
Fig. 1.

Some results in case of two RNA-seq experiments (see Help page and Suppl. mat. for full information). A Venn diagram illustrates how the transcripts with alternative 3′UTRs are distributed across the two experiments. An example of the graphical view of the assembled transcripts (in dark green) with specific 3′UTRs in different experiments is obtained through the links to the UCSC Genome Browser (http://genome.ucsc.edu)

Some results in case of two RNA-seq experiments (see Help page and Suppl. mat. for full information). A Venn diagram illustrates how the transcripts with alternative 3′UTRs are distributed across the two experiments. An example of the graphical view of the assembled transcripts (in dark green) with specific 3′UTRs in different experiments is obtained through the links to the UCSC Genome Browser (http://genome.ucsc.edu) Each transcript with an alternative 3′UTR is associated to its assembled transcript id (e.g. TCONS_0000X, for Cufflinks transcriptome assembly files), and through it, to all the corresponding analysis results, so that useful information (read coverage, expression abundance, etc.) can be directly retrieved from the complete RNA-seq data analysis results.

3 Conclusion

To the best of our knowledge, 3USS is the only available web-server that can automatically detect the presence of isoforms with alternative 3′UTRs in the RNA-seq experiment under investigation and output their genomic coordinates and nucleotide sequences for further analysis. Moreover, we believe that a protocol independent tool, such as the one described here, is useful and not only to retrieve information on novel 3′UTRs, but also to easily compare the results of different methods and approaches. Through an easy to use interface, it also allows the comparison of the 3′UTRs of the sequenced transcripts in different biological conditions. This allows researchers to further examine genes affected by 3′UTR shortening or lengthening and, for example identify their biological relevance using public tools such as FIDEA (D'Andrea ) or others, as well as directly analyze the novel 3′UTR nucleotide sequences to detect differences in the presence or absence of regulatory motifs and/or binding sites.
  17 in total

1.  The human genome browser at UCSC.

Authors:  W James Kent; Charles W Sugnet; Terrence S Furey; Krishna M Roskin; Tom H Pringle; Alan M Zahler; David Haussler
Journal:  Genome Res       Date:  2002-06       Impact factor: 9.043

Review 2.  The AU-rich transcriptome: more than interferons and cytokines, and its role in disease.

Authors:  Khalid S A Khabar
Journal:  J Interferon Cytokine Res       Date:  2005-01       Impact factor: 2.607

3.  Progressive lengthening of 3' untranslated regions of mRNAs by alternative polyadenylation during mouse embryonic development.

Authors:  Zhe Ji; Ju Youn Lee; Zhenhua Pan; Bingjun Jiang; Bin Tian
Journal:  Proc Natl Acad Sci U S A       Date:  2009-04-16       Impact factor: 11.205

4.  Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks.

Authors:  Cole Trapnell; Adam Roberts; Loyal Goff; Geo Pertea; Daehwan Kim; David R Kelley; Harold Pimentel; Steven L Salzberg; John L Rinn; Lior Pachter
Journal:  Nat Protoc       Date:  2012-03-01       Impact factor: 13.491

5.  Ab initio reconstruction of cell type-specific transcriptomes in mouse reveals the conserved multi-exonic structure of lincRNAs.

Authors:  Mitchell Guttman; Manuel Garber; Joshua Z Levin; Julie Donaghey; James Robinson; Xian Adiconis; Lin Fan; Magdalena J Koziol; Andreas Gnirke; Chad Nusbaum; John L Rinn; Eric S Lander; Aviv Regev
Journal:  Nat Biotechnol       Date:  2010-05-02       Impact factor: 54.908

6.  Widespread shortening of 3'UTRs by alternative cleavage and polyadenylation activates oncogenes in cancer cells.

Authors:  Christine Mayr; David P Bartel
Journal:  Cell       Date:  2009-08-21       Impact factor: 41.582

Review 7.  MicroRNAs: target recognition and regulatory functions.

Authors:  David P Bartel
Journal:  Cell       Date:  2009-01-23       Impact factor: 41.582

8.  GENCODE: the reference human genome annotation for The ENCODE Project.

Authors:  Jennifer Harrow; Adam Frankish; Jose M Gonzalez; Electra Tapanari; Mark Diekhans; Felix Kokocinski; Bronwen L Aken; Daniel Barrell; Amonida Zadissa; Stephen Searle; If Barnes; Alexandra Bignell; Veronika Boychenko; Toby Hunt; Mike Kay; Gaurab Mukherjee; Jeena Rajan; Gloria Despacio-Reyes; Gary Saunders; Charles Steward; Rachel Harte; Michael Lin; Cédric Howald; Andrea Tanzer; Thomas Derrien; Jacqueline Chrast; Nathalie Walters; Suganthi Balasubramanian; Baikang Pei; Michael Tress; Jose Manuel Rodriguez; Iakes Ezkurdia; Jeltje van Baren; Michael Brent; David Haussler; Manolis Kellis; Alfonso Valencia; Alexandre Reymond; Mark Gerstein; Roderic Guigó; Tim J Hubbard
Journal:  Genome Res       Date:  2012-09       Impact factor: 9.043

9.  Ensembl's 10th year.

Authors:  Paul Flicek; Bronwen L Aken; Benoit Ballester; Kathryn Beal; Eugene Bragin; Simon Brent; Yuan Chen; Peter Clapham; Guy Coates; Susan Fairley; Stephen Fitzgerald; Julio Fernandez-Banet; Leo Gordon; Stefan Gräf; Syed Haider; Martin Hammond; Kerstin Howe; Andrew Jenkinson; Nathan Johnson; Andreas Kähäri; Damian Keefe; Stephen Keenan; Rhoda Kinsella; Felix Kokocinski; Gautier Koscielny; Eugene Kulesha; Daniel Lawson; Ian Longden; Tim Massingham; William McLaren; Karine Megy; Bert Overduin; Bethan Pritchard; Daniel Rios; Magali Ruffier; Michael Schuster; Guy Slater; Damian Smedley; Giulietta Spudich; Y Amy Tang; Stephen Trevanion; Albert Vilella; Jan Vogel; Simon White; Steven P Wilder; Amonida Zadissa; Ewan Birney; Fiona Cunningham; Ian Dunham; Richard Durbin; Xosé M Fernández-Suarez; Javier Herrero; Tim J P Hubbard; Anne Parker; Glenn Proctor; James Smith; Stephen M J Searle
Journal:  Nucleic Acids Res       Date:  2009-11-11       Impact factor: 16.971

10.  The UCSC Genome Browser database: update 2010.

Authors:  Brooke Rhead; Donna Karolchik; Robert M Kuhn; Angie S Hinrichs; Ann S Zweig; Pauline A Fujita; Mark Diekhans; Kayla E Smith; Kate R Rosenbloom; Brian J Raney; Andy Pohl; Michael Pheasant; Laurence R Meyer; Katrina Learned; Fan Hsu; Jennifer Hillman-Jackson; Rachel A Harte; Belinda Giardine; Timothy R Dreszer; Hiram Clawson; Galt P Barber; David Haussler; W James Kent
Journal:  Nucleic Acids Res       Date:  2009-11-11       Impact factor: 16.971

View more
  14 in total

1.  An integrative model for alternative polyadenylation, IntMAP, delineates mTOR-modulated endoplasmic reticulum stress response.

Authors:  Jae-Woong Chang; Wei Zhang; Hsin-Sung Yeh; Meeyeon Park; Chengguo Yao; Yongsheng Shi; Rui Kuang; Jeongsik Yong
Journal:  Nucleic Acids Res       Date:  2018-07-06       Impact factor: 16.971

2.  TAPAS: tool for alternative polyadenylation site analysis.

Authors:  Ashraful Arefeen; Juntao Liu; Xinshu Xiao; Tao Jiang
Journal:  Bioinformatics       Date:  2018-08-01       Impact factor: 6.937

3.  Computational Detection of MicroRNA Targets.

Authors:  Pedro Gabriel Nachtigall; Luiz Augusto Bovolenta
Journal:  Methods Mol Biol       Date:  2022

Review 4.  Analyses of alternative polyadenylation: from old school biochemistry to high-throughput technologies.

Authors:  Hsin-Sung Yeh; Wei Zhang; Jeongsik Yong
Journal:  BMB Rep       Date:  2017-04       Impact factor: 4.778

Review 5.  Discovery of alternative polyadenylation dynamics from single cell types.

Authors:  Congting Ye; Juncheng Lin; Qingshun Q Li
Journal:  Comput Struct Biotechnol J       Date:  2020-04-20       Impact factor: 7.271

6.  Alternative polyadenylation of single cells delineates cell types and serves as a prognostic marker in early stage breast cancer.

Authors:  Nayoung Kim; Woosung Chung; Hye Hyeon Eum; Hae-Ock Lee; Woong-Yang Park
Journal:  PLoS One       Date:  2019-05-17       Impact factor: 3.240

7.  Alternative polyadenylation drives oncogenic gene expression in pancreatic ductal adenocarcinoma.

Authors:  Swati Venkat; Arwen A Tisdale; Johann R Schwarz; Abdulrahman A Alahmari; H Carlo Maurer; Kenneth P Olive; Kevin H Eng; Michael E Feigin
Journal:  Genome Res       Date:  2020-02-06       Impact factor: 9.043

Review 8.  The Detection and Bioinformatic Analysis of Alternative 3' UTR Isoforms as Potential Cancer Biomarkers.

Authors:  Nitika Kandhari; Calvin A Kraupner-Taylor; Paul F Harrison; David R Powell; Traude H Beilharz
Journal:  Int J Mol Sci       Date:  2021-05-18       Impact factor: 5.923

9.  Roar: detecting alternative polyadenylation with standard mRNA sequencing libraries.

Authors:  Elena Grassi; Elisa Mariella; Antonio Lembo; Ivan Molineris; Paolo Provero
Journal:  BMC Bioinformatics       Date:  2016-10-18       Impact factor: 3.169

10.  ExUTR: a novel pipeline for large-scale prediction of 3'-UTR sequences from NGS data.

Authors:  Zixia Huang; Emma C Teeling
Journal:  BMC Genomics       Date:  2017-11-06       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.