Literature DB >> 22149633

Statistical significance of threading scores.

Afshin Fayyaz Movaghar1, Guillaume Launay, Sophie Schbath, Jean-François Gibrat, François Rodolphe.   

Abstract

We present a general method for assessing threading score significance. The threading score of a protein sequence, thread onto a given structure, should be compared with the threading score distribution of a random amino-acid sequence, of the same length, thread on the same structure; small p-values point significantly high scores. We claim that, due to general protein contact map properties, this reference distribution is a Weibull extreme value distribution whose parameters depend on the threading method, the structure, the length of the query and the random sequence simulation model used. These parameters can be estimated off-line with simulated sequence samples, for different sequence lengths. They can further be interpolated at the exact length of a query, enabling the quick computation of the p-value.

Mesh:

Substances:

Year:  2011        PMID: 22149633      PMCID: PMC3244815          DOI: 10.1089/cmb.2011.0236

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  17 in total

1.  Threading with explicit models for evolutionary conservation of structure and sequence.

Authors:  A Panchenko; A Marchler-Bauer; S H Bryant
Journal:  Proteins       Date:  1999

2.  Estimating the total number of protein folds.

Authors:  S Govindarajan; R Recabarren; R A Goldstein
Journal:  Proteins       Date:  1999-06-01

3.  Toward high-resolution de novo structure prediction for small proteins.

Authors:  Philip Bradley; Kira M S Misura; David Baker
Journal:  Science       Date:  2005-09-16       Impact factor: 47.728

Review 4.  Statistics of sequence-structure threading.

Authors:  S H Bryant; S F Altschul
Journal:  Curr Opin Struct Biol       Date:  1995-04       Impact factor: 6.809

5.  Pathways to a protein folding intermediate observed in a 1-microsecond simulation in aqueous solution.

Authors:  Y Duan; P A Kollman
Journal:  Science       Date:  1998-10-23       Impact factor: 47.728

6.  A re-estimation for the total numbers of protein folds and superfamilies.

Authors:  Z X Wang
Journal:  Protein Eng       Date:  1998-08

7.  How many fold types of protein are there in nature?

Authors:  Z X Wang
Journal:  Proteins       Date:  1996-10

8.  Threading a database of protein cores.

Authors:  T Madej; J F Gibrat; S H Bryant
Journal:  Proteins       Date:  1995-11

9.  SCOP: a structural classification of proteins database for the investigation of sequences and structures.

Authors:  A G Murzin; S E Brenner; T Hubbard; C Chothia
Journal:  J Mol Biol       Date:  1995-04-07       Impact factor: 5.469

10.  Protein superfamilies and domain superfolds.

Authors:  C A Orengo; D T Jones; J M Thornton
Journal:  Nature       Date:  1994-12-15       Impact factor: 49.962

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.