Literature DB >> 2176240

Weighting aligned protein or nucleic acid sequences to correct for unequal representation.

P R Sibbald1, P Argos.   

Abstract

Aligned sequences from the same family (e.g. the haemoglobins) are seldom representative of the entire family. This is because (1) the sequence databases are heavily skewed toward a small number of organisms and (2) only a minute fraction of all the different family members have been sequenced. For many applications, such as using alignments or profiles to perform database searches for distantly related family members, such unequal representation requires correction. An algorithm to perform appropriate weighting of individual sequences is presented along with examples illustrating its efficacy.

Mesh:

Substances:

Year:  1990        PMID: 2176240     DOI: 10.1016/S0022-2836(99)80003-5

Source DB:  PubMed          Journal:  J Mol Biol        ISSN: 0022-2836            Impact factor:   5.469


  18 in total

Review 1.  Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements.

Authors:  A A Schäffer; L Aravind; T L Madden; S Shavirin; J L Spouge; Y I Wolf; E V Koonin; S F Altschul
Journal:  Nucleic Acids Res       Date:  2001-07-15       Impact factor: 16.971

2.  An assessment of substitution scores for protein profile-profile comparison.

Authors:  Xugang Ye; Guoli Wang; Stephen F Altschul
Journal:  Bioinformatics       Date:  2011-10-13       Impact factor: 6.937

3.  The relative inefficiency of sequence weights approaches in determining a nucleotide position weight matrix.

Authors:  Lee A Newberg; Lee Ann McCue; Charles E Lawrence
Journal:  Stat Appl Genet Mol Biol       Date:  2005-06-01

4.  Improving the sensitivity of the sequence profile method.

Authors:  R Lüthy; I Xenarios; P Bucher
Journal:  Protein Sci       Date:  1994-01       Impact factor: 6.725

Review 5.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

Authors:  S F Altschul; T L Madden; A A Schäffer; J Zhang; Z Zhang; W Miller; D J Lipman
Journal:  Nucleic Acids Res       Date:  1997-09-01       Impact factor: 16.971

6.  Charting the landscape of tandem BRCT domain-mediated protein interactions.

Authors:  Nicholas T Woods; Rafael D Mesquita; Michael Sweet; Marcelo A Carvalho; Xueli Li; Yun Liu; Huey Nguyen; C Eric Thomas; Edwin S Iversen; Sylvia Marsillac; Rachel Karchin; John Koomen; Alvaro N A Monteiro
Journal:  Sci Signal       Date:  2012-09-18       Impact factor: 8.192

7.  Weighting in sequence space: a comparison of methods in terms of generalized sequences.

Authors:  M Vingron; P R Sibbald
Journal:  Proc Natl Acad Sci U S A       Date:  1993-10-01       Impact factor: 11.205

8.  Maximum diversity weighting for biomarkers with application in HIV-1 vaccine studies.

Authors:  Zonglin He; Youyi Fong
Journal:  Stat Med       Date:  2019-06-19       Impact factor: 2.373

9.  A phylogenetic approach for weighting genetic sequences.

Authors:  Nicola De Maio; Alexander V Alekseyenko; William J Coleman-Smith; Fabio Pardi; Marc A Suchard; Asif U Tamuri; Jakub Truszkowski; Nick Goldman
Journal:  BMC Bioinformatics       Date:  2021-05-28       Impact factor: 3.169

10.  The construction and use of log-odds substitution scores for multiple sequence alignment.

Authors:  Stephen F Altschul; John C Wootton; Elena Zaslavsky; Yi-Kuo Yu
Journal:  PLoS Comput Biol       Date:  2010-07-15       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.