| Literature DB >> 33434218 |
August Guang1,2, Mark Howison3, Felipe Zapata4, Charles Lawrence5, Casey W Dunn6.
Abstract
A common transcriptome assembly error is to mistake different transcripts of the same gene as transcripts from multiple closely related genes. This error is difficult to identify during assembly, but in a phylogenetic analysis such errors can be diagnosed from gene phylogenies where they appear as clades of tips from the same species with improbably short branch lengths. treeinform is a method that uses phylogenetic information across species to refine transcriptome assemblies within species. It identifies transcripts of the same gene that were incorrectly assigned to multiple genes and reassign them as transcripts of the same gene. The treeinform method is implemented in Agalma, available at https://bitbucket.org/caseywdunn/agalma, and the general approach is relevant in a variety of other contexts.Entities:
Mesh:
Year: 2021 PMID: 33434218 PMCID: PMC7802918 DOI: 10.1371/journal.pone.0244202
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240