Literature DB >> 18384072

Functional annotation by sequence-weighted structure alignments: statistical analysis and case studies from the Protein 3000 structural genomics project in Japan.

Daron M Standley1, Hiroyuki Toh, Haruki Nakamura.   

Abstract

A method to functionally annotate structural genomics targets, based on a novel structural alignment scoring function, is proposed. In the proposed score, position-specific scoring matrices are used to weight structurally aligned residue pairs to highlight evolutionarily conserved motifs. The functional form of the score is first optimized for discriminating domains belonging to the same Pfam family from domains belonging to different families but the same CATH or SCOP superfamily. In the optimization stage, we consider four standard weighting functions as well as our own, the "maximum substitution probability," and combinations of these functions. The optimized score achieves an area of 0.87 under the receiver-operating characteristic curve with respect to identifying Pfam families within a sequence-unique benchmark set of domain pairs. Confidence measures are then derived from the benchmark distribution of true-positive scores. The alignment method is next applied to the task of functionally annotating 230 query proteins released to the public as part of the Protein 3000 structural genomics project in Japan. Of these queries, 78 were found to align to templates with the same Pfam family as the query or had sequence identities > or = 30%. Another 49 queries were found to match more distantly related templates. Within this group, the template predicted by our method to be the closest functional relative was often not the most structurally similar. Several nontrivial cases are discussed in detail. Finally, 103 queries matched templates at the fold level, but not the family or superfamily level, and remain functionally uncharacterized. 2008 Wiley-Liss, Inc.

Mesh:

Substances:

Year:  2008        PMID: 18384072     DOI: 10.1002/prot.22015

Source DB:  PubMed          Journal:  Proteins        ISSN: 0887-3585


  3 in total

1.  SeSAW: balancing sequence and structural information in protein functional mapping.

Authors:  Daron M Standley; Reiko Yamashita; Akira R Kinjo; Hiroyuki Toh; Haruki Nakamura
Journal:  Bioinformatics       Date:  2010-03-17       Impact factor: 6.937

2.  A single polymorphic amino acid on Toxoplasma gondii kinase ROP16 determines the direct and strain-specific activation of Stat3.

Authors:  Masahiro Yamamoto; Daron M Standley; Seiji Takashima; Hiroyuki Saiga; Megumi Okuyama; Hisako Kayama; Emi Kubo; Hiroshi Ito; Mutsumi Takaura; Tadashi Matsuda; Dominique Soldati-Favre; Kiyoshi Takeda
Journal:  J Exp Med       Date:  2009-11-09       Impact factor: 14.307

Review 3.  Genomes to hits in silico - a country path today, a highway tomorrow: a case study of chikungunya.

Authors:  Anjali Soni; Khushhali M Pandey; Pratima Ray; B Jayaram
Journal:  Curr Pharm Des       Date:  2013       Impact factor: 3.116

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.