Literature DB >> 16218944

Protein database searches using compositionally adjusted substitution matrices.

Stephen F Altschul1, John C Wootton, E Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A Schäffer, Yi-Kuo Yu.   

Abstract

Almost all protein database search methods use amino acid substitution matrices for scoring, optimizing, and assessing the statistical significance of sequence alignments. Much care and effort has therefore gone into constructing substitution matrices, and the quality of search results can depend strongly upon the choice of the proper matrix. A long-standing problem has been the comparison of sequences with biased amino acid compositions, for which standard substitution matrices are not optimal. To address this problem, we have recently developed a general procedure for transforming a standard matrix into one appropriate for the comparison of two sequences with arbitrary, and possibly differing compositions. Such adjusted matrices yield, on average, improved alignments and alignment scores when applied to the comparison of proteins with markedly biased compositions. Here we review the application of compositionally adjusted matrices and consider whether they may also be applied fruitfully to general purpose protein sequence database searches, in which related sequence pairs do not necessarily have strong compositional biases. Although it is not advisable to apply compositional adjustment indiscriminately, we describe several simple criteria under which invoking such adjustment is on average beneficial. In a typical database search, at least one of these criteria is satisfied by over half the related sequence pairs. Compositional substitution matrix adjustment is now available in NCBI's protein-protein version of blast.

Mesh:

Substances:

Year:  2005        PMID: 16218944      PMCID: PMC1343503          DOI: 10.1111/j.1742-4658.2005.04945.x

Source DB:  PubMed          Journal:  FEBS J        ISSN: 1742-464X            Impact factor:   5.542


  33 in total

1.  The estimation of statistical parameters for local alignment score distributions.

Authors:  S F Altschul; R Bundschuh; R Olsen; T Hwa
Journal:  Nucleic Acids Res       Date:  2001-01-15       Impact factor: 16.971

2.  A global compositional complexity measure for biological sequences: AT-rich and GC-rich genomes encode less complex proteins.

Authors:  H Wan; J C Wootton
Journal:  Comput Chem       Date:  2000-01

3.  PHAT: a transmembrane-specific substitution matrix. Predicted hydrophobic and transmembrane.

Authors:  P C Ng; J G Henikoff; S Henikoff
Journal:  Bioinformatics       Date:  2000-09       Impact factor: 6.937

Review 4.  Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements.

Authors:  A A Schäffer; L Aravind; T L Madden; S Shavirin; J L Spouge; Y I Wolf; E V Koonin; S F Altschul
Journal:  Nucleic Acids Res       Date:  2001-07-15       Impact factor: 16.971

5.  Non-symmetric score matrices and the detection of homologous transmembrane proteins.

Authors:  T Müller; S Rahmann; M Rehmsmeier
Journal:  Bioinformatics       Date:  2001       Impact factor: 6.937

6.  ASTRAL compendium enhancements.

Authors:  John-Marc Chandonia; Nigel S Walker; Loredana Lo Conte; Patrice Koehl; Michael Levitt; Steven E Brenner
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

7.  Modeling amino acid replacement.

Authors:  T Müller; M Vingron
Journal:  J Comput Biol       Date:  2000       Impact factor: 1.479

8.  The compositional adjustment of amino acid substitution matrices.

Authors:  Yi-Kuo Yu; John C Wootton; Stephen F Altschul
Journal:  Proc Natl Acad Sci U S A       Date:  2003-12-08       Impact factor: 11.205

9.  Use of receiver operating characteristic (ROC) analysis to evaluate sequence matching.

Authors:  M Gribskov; N L Robinson
Journal:  Comput Chem       Date:  1996-03

10.  A general method applicable to the search for similarities in the amino acid sequence of two proteins.

Authors:  S B Needleman; C D Wunsch
Journal:  J Mol Biol       Date:  1970-03       Impact factor: 5.469

View more
  398 in total

1.  Coenzyme A-dependent aerobic metabolism of benzoate via epoxide formation.

Authors:  Liv J Rather; Bettina Knapp; Wolfgang Haehnel; Georg Fuchs
Journal:  J Biol Chem       Date:  2010-05-07       Impact factor: 5.157

Review 2.  Denitrification by plant roots? New aspects of plant plasma membrane-bound nitrate reductase.

Authors:  Manuela Eick; Christine Stöhr
Journal:  Protoplasma       Date:  2011-12-09       Impact factor: 3.356

3.  Sequence TTKF ↓ QE defines the site of proteolytic cleavage in Mhp683 protein, a novel glycosaminoglycan and cilium adhesin of Mycoplasma hyopneumoniae.

Authors:  Daniel R Bogema; Nichollas E Scott; Matthew P Padula; Jessica L Tacchi; Benjamin B A Raymond; Cheryl Jenkins; Stuart J Cordwell; F Chris Minion; Mark J Walker; Steven P Djordjevic
Journal:  J Biol Chem       Date:  2011-10-03       Impact factor: 5.157

4.  Distinct class of DNA-binding domains is exemplified by a master regulator of phenotypic switching in Candida albicans.

Authors:  Matthew B Lohse; Rebecca E Zordan; Christopher W Cain; Alexander D Johnson
Journal:  Proc Natl Acad Sci U S A       Date:  2010-07-26       Impact factor: 11.205

Review 5.  Autophagy in protists.

Authors:  Michael Duszenko; Michael L Ginger; Ana Brennand; Melisa Gualdrón-López; María Isabel Colombo; Graham H Coombs; Isabelle Coppens; Bamini Jayabalasingham; Gordon Langsley; Solange Lisboa de Castro; Rubem Menna-Barreto; Jeremy C Mottram; Miguel Navarro; Daniel J Rigden; Patricia S Romano; Veronika Stoka; Boris Turk; Paul A M Michels
Journal:  Autophagy       Date:  2011-02-01       Impact factor: 16.016

6.  Compositional adjustment of Dirichlet mixture priors.

Authors:  Xugang Ye; Yi-Kuo Yu; Stephen F Altschul
Journal:  J Comput Biol       Date:  2010-12       Impact factor: 1.479

7.  Expression of SbGSTU (tau class glutathione S-transferase) gene isolated from Salicornia brachiata in tobacco for salt tolerance.

Authors:  Bhavanath Jha; Anubha Sharma; Avinash Mishra
Journal:  Mol Biol Rep       Date:  2010-12-07       Impact factor: 2.316

8.  Phosphate transporter genes as reliable gene markers for the identification and discrimination of arbuscular mycorrhizal fungi in the genus glomus.

Authors:  Serge Sokolski; Yolande Dalpé; Yves Piché
Journal:  Appl Environ Microbiol       Date:  2010-12-30       Impact factor: 4.792

9.  Virtual screening and evaluation of Ketol-Acid Reducto-Isomerase (KARI) as a putative drug target for Aspergillosis.

Authors:  Vivek K Morya; Shalini Kumari; Eun-Ki Kim
Journal:  Clin Proteomics       Date:  2012-02-03       Impact factor: 3.988

10.  Vibrio fischeri Biofilm Formation Prevented by a Trio of Regulators.

Authors:  Cecilia M Thompson; Anne E Marsden; Alice H Tischler; Jovanka Koo; Karen L Visick
Journal:  Appl Environ Microbiol       Date:  2018-09-17       Impact factor: 4.792

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.