| Literature DB >> 33289887 |
Osamu Gotoh1,2.
Abstract
Gene-structure-aware multiple sequence alignment (GSA-MSA) is conventionally used as a tool for analyzing evolutionary changes in gene structure, i.e., gain and loss of introns during the course of evolution of homologous eukaryotic genes. Recently, however, it has become apparent that GSA-MSA is a powerful tool for detecting and remedying gene-prediction errors prevalent in genome annotations produced by various genome projects. Unfortunately, the construction of GSA-MSAs has so far required tedious procedures, thereby preventing researchers from enjoying the potential benefits of GSA-MSAs. In this chapter, we introduce a straightforward way for constructing GSA-MSAs when one or more genomic sequences and a set of transcript sequences (protein or full-length cDNAs/CDSs) are given. Our method requires no external tool or extra data, such as annotation files, although a supplementary script can generate a gene-structure-informed (GSI) transcript sequence file from annotation files.Entities:
Keywords: Annotation error; Gene prediction; Gene structure; Genome mapping; Multiple sequence alignment; Phylogenetic analysis; Spliced alignment
Mesh:
Substances:
Year: 2021 PMID: 33289887 DOI: 10.1007/978-1-0716-1036-7_5
Source DB: PubMed Journal: Methods Mol Biol ISSN: 1064-3745