Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Identification and Correction of Erroneous Protein Sequences in Public Databases.

Literature DB >> 27115633

Identification and Correction of Erroneous Protein Sequences in Public Databases.

Abstract

Correct prediction of the structure of protein-coding genes of higher eukaryotes is a difficult task therefore public sequence databases incorporating predicted sequences are increasingly contaminated with erroneous sequences. The high rate of misprediction has serious consequences since it significantly affects the conclusions that may be drawn from genome-scale sequence analyses.Here we describe the MisPred and FixPred approaches that may help the identification and correction of erroneous sequences. The rationale of these approaches is that a protein sequence is likely to be erroneous if some of its features conflict with our current knowledge about proteins.

Keywords: Gene prediction; Genome annotation; Genome assembly; Misannotation; Misassembly; Misprediction; Protein-coding genes; Proteins; Sequencing errors

Mesh：

Substances：
Proteins

Year: 2016 PMID： 27115633 DOI： 10.1007/978-1-4939-3572-7_9

Source DB: PubMed Journal: Methods Mol Biol ISSN： 1064-3745

Keyword Cloud
Cited

1 in total

1. Cooperation of Spaln and Prrn5 for Construction of Gene-Structure-Aware Multiple Sequence Alignment.

Authors: Osamu Gotoh
Journal: Methods Mol Biol Date: 2021

1 in total