Literature DB >> 22537006

PIntron: a fast method for detecting the gene structure due to alternative splicing via maximal pairings of a pattern and a text.

Yuri Pirola1, Raffaella Rizzi, Ernesto Picardi, Graziano Pesole, Gianluca Della Vedova, Paola Bonizzoni.   

Abstract

BACKGROUND: A challenging issue in designing computational methods for predicting the gene structure into exons and introns from a cluster of transcript (EST, mRNA) sequences, is guaranteeing accuracy as well as efficiency in time and space, when large clusters of more than 20,000 ESTs and genes longer than 1 Mb are processed. Traditionally, the problem has been faced by combining different tools, not specifically designed for this task.
RESULTS: We propose a fast method based on ad hoc procedures for solving the problem. Our method combines two ideas: a novel algorithm of proved small time complexity for computing spliced alignments of a transcript against a genome, and an efficient algorithm that exploits the inherent redundancy of information in a cluster of transcripts to select, among all possible factorizations of EST sequences, those allowing to infer splice site junctions that are largely confirmed by the input data. The EST alignment procedure is based on the construction of maximal embeddings, that are sequences obtained from paths of a graph structure, called embedding graph, whose vertices are the maximal pairings of a genomic sequence T and an EST P. The procedure runs in time linear in the length of P and T and in the size of the output.The method was implemented into the PIntron package. PIntron requires as input a genomic sequence or region and a set of EST and/or mRNA sequences. Besides the prediction of the full-length transcript isoforms potentially expressed by the gene, the PIntron package includes a module for the CDS annotation of the predicted transcripts.
CONCLUSIONS: PIntron, the software tool implementing our methodology, is available at http://www.algolab.eu/PIntron under GNU AGPL. PIntron has been shown to outperform state-of-the-art methods, and to quickly process some critical genes. At the same time, PIntron exhibits high accuracy (sensitivity and specificity) when benchmarked with ENCODE annotations.

Entities:  

Mesh:

Year:  2012        PMID: 22537006      PMCID: PMC3358663          DOI: 10.1186/1471-2105-13-S5-S2

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  26 in total

1.  ECgene: genome-based EST clustering and gene modeling for alternative splicing.

Authors:  Namshin Kim; Seokmin Shin; Sanghyuk Lee
Journal:  Genome Res       Date:  2005-04       Impact factor: 9.043

Review 2.  Computational methods for alternative splicing prediction.

Authors:  Paola Bonizzoni; Raffaella Rizzi; Graziano Pesole
Journal:  Brief Funct Genomic Proteomic       Date:  2006-02-20

3.  Optimal spliced alignments of short sequence reads.

Authors:  Fabio De Bona; Stephan Ossowski; Korbinian Schneeberger; Gunnar Rätsch
Journal:  Bioinformatics       Date:  2008-08-15       Impact factor: 6.937

4.  Comprehensive splice-site analysis using comparative genomics.

Authors:  Nihar Sheth; Xavier Roca; Michelle L Hastings; Ted Roeder; Adrian R Krainer; Ravi Sachidanandam
Journal:  Nucleic Acids Res       Date:  2006-08-12       Impact factor: 16.971

5.  ASPIC: a web resource for alternative splicing prediction and transcript isoforms characterization.

Authors:  Tiziana Castrignanò; Raffaella Rizzi; Ivano Giuseppe Talamo; Paolo D'Onorio De Meo; Anna Anselmo; Paola Bonizzoni; Graziano Pesole
Journal:  Nucleic Acids Res       Date:  2006-07-01       Impact factor: 16.971

6.  Exogean: a framework for annotating protein-coding genes in eukaryotic genomic DNA.

Authors:  Sarah Djebali; Franck Delaplace; Hugues Roest Crollius
Journal:  Genome Biol       Date:  2006-08-07       Impact factor: 13.583

Review 7.  EGASP: the human ENCODE Genome Annotation Assessment Project.

Authors:  Roderic Guigó; Paul Flicek; Josep F Abril; Alexandre Reymond; Julien Lagarde; France Denoeud; Stylianos Antonarakis; Michael Ashburner; Vladimir B Bajic; Ewan Birney; Robert Castelo; Eduardo Eyras; Catherine Ucla; Thomas R Gingeras; Jennifer Harrow; Tim Hubbard; Suzanna E Lewis; Martin G Reese
Journal:  Genome Biol       Date:  2006-08-07       Impact factor: 13.583

8.  ASPIC: a novel method to predict the exon-intron structure of a gene that is optimally compatible to a set of transcript sequences.

Authors:  Paola Bonizzoni; Raffaella Rizzi; Graziano Pesole
Journal:  BMC Bioinformatics       Date:  2005-10-05       Impact factor: 3.169

9.  Automated generation of heuristics for biological sequence comparison.

Authors:  Guy St C Slater; Ewan Birney
Journal:  BMC Bioinformatics       Date:  2005-02-15       Impact factor: 3.169

10.  A space-efficient and accurate method for mapping and aligning cDNA sequences onto genomic sequence.

Authors:  Osamu Gotoh
Journal:  Nucleic Acids Res       Date:  2008-03-15       Impact factor: 16.971

View more
  4 in total

1.  Modeling alternative splicing variants from RNA-Seq data with isoform graphs.

Authors:  Stefano Beretta; Paola Bonizzoni; Gianluca Della Vedova; Yuri Pirola; Raffaella Rizzi
Journal:  J Comput Biol       Date:  2013-11-07       Impact factor: 1.479

2.  RCDA: a highly sensitive and specific alternatively spliced transcript assembly tool featuring upstream consecutive exon structures.

Authors:  Xiaolu H Sturgeon; Katheleen J Gardiner
Journal:  Genomics       Date:  2012-08-20       Impact factor: 5.736

3.  RAP: RNA-Seq Analysis Pipeline, a new cloud-based NGS web application.

Authors:  Mattia D'Antonio; Paolo D'Onorio De Meo; Matteo Pallocca; Ernesto Picardi; Anna Maria D'Erchia; Raffaele A Calogero; Tiziana Castrignanò; Graziano Pesole
Journal:  BMC Genomics       Date:  2015-06-01       Impact factor: 3.969

4.  Global Investigation of Cytochrome P450 Genes in the Chicken Genome.

Authors:  Junxiao Ren; Liyu Yang; Quanlin Li; Qinghe Zhang; Congjiao Sun; Xiaojun Liu; Ning Yang
Journal:  Genes (Basel)       Date:  2019-08-14       Impact factor: 4.096

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.