Literature DB >> 27010337

Finding Protein and Nucleotide Similarities with FASTA.

William R Pearson1.   

Abstract

The FASTA programs provide a comprehensive set of rapid similarity searching tools (fasta36, fastx36, tfastx36, fasty36, tfasty36), similar to those provided by the BLAST package, as well as programs for slower, optimal, local, and global similarity searches (ssearch36, ggsearch36), and for searching with short peptides and oligonucleotides (fasts36, fastm36). The FASTA programs use an empirical strategy for estimating statistical significance that accommodates a range of similarity scoring matrices and gap penalties, improving alignment boundary accuracy and search sensitivity. The FASTA programs can produce "BLAST-like" alignment and tabular output, for ease of integration into existing analysis pipelines, and can search small, representative databases, and then report results for a larger set of sequences, using links from the smaller dataset. The FASTA programs work with a wide variety of database formats, including mySQL and postgreSQL databases. The programs also provide a strategy for integrating domain and active site annotations into alignments and highlighting the mutational state of functionally critical residues. These protocols describe how to use the FASTA programs to characterize protein and DNA sequences, using protein:protein, protein:DNA, and DNA:DNA comparisons.
Copyright © 2016 John Wiley & Sons, Inc.

Entities:  

Keywords:  E()-value; alignment annotation; expectation; homology; scoring matrices; similarity

Mesh:

Substances:

Year:  2016        PMID: 27010337      PMCID: PMC5072362          DOI: 10.1002/0471250953.bi0309s53

Source DB:  PubMed          Journal:  Curr Protoc Bioinformatics        ISSN: 1934-3396


  21 in total

1.  Estimating amino acid substitution models: a comparison of Dayhoff's estimator, the resolvent approach and a maximum likelihood method.

Authors:  Tobias Müller; Rainer Spang; Martin Vingron
Journal:  Mol Biol Evol       Date:  2002-01       Impact factor: 16.240

2.  Empirical determination of effective gap penalties for sequence comparison.

Authors:  J T Reese; W R Pearson
Journal:  Bioinformatics       Date:  2002-11       Impact factor: 6.937

3.  Performance evaluation of a new algorithm for the detection of remote homologs with sequence comparison.

Authors:  Maricel G Kann; Richard A Goldstein
Journal:  Proteins       Date:  2002-08-01

4.  Getting more from less: algorithms for rapid protein identification with multiple short peptide sequences.

Authors:  Aaron J Mackey; Timothy A J Haystead; William R Pearson
Journal:  Mol Cell Proteomics       Date:  2002-02       Impact factor: 5.911

5.  Amino acid substitution matrices from protein blocks.

Authors:  S Henikoff; J G Henikoff
Journal:  Proc Natl Acad Sci U S A       Date:  1992-11-15       Impact factor: 11.205

6.  The rapid generation of mutation data matrices from protein sequences.

Authors:  D T Jones; W R Taylor; J M Thornton
Journal:  Comput Appl Biosci       Date:  1992-06

7.  Effective protein sequence comparison.

Authors:  W R Pearson
Journal:  Methods Enzymol       Date:  1996       Impact factor: 1.600

8.  Maximum-likelihood estimation of the statistical distribution of Smith-Waterman local sequence similarity scores.

Authors:  R Mott
Journal:  Bull Math Biol       Date:  1992-01       Impact factor: 1.758

9.  Improved tools for biological sequence comparison.

Authors:  W R Pearson; D J Lipman
Journal:  Proc Natl Acad Sci U S A       Date:  1988-04       Impact factor: 11.205

10.  HMMER web server: interactive sequence similarity searching.

Authors:  Robert D Finn; Jody Clements; Sean R Eddy
Journal:  Nucleic Acids Res       Date:  2011-05-18       Impact factor: 16.971

View more
  19 in total

1.  The mole genome reveals regulatory rearrangements associated with adaptive intersexuality.

Authors:  Francisca M Real; Stefan A Haas; Paolo Franchini; Peiwen Xiong; Oleg Simakov; Heiner Kuhl; Robert Schöpflin; David Heller; M-Hossein Moeinzadeh; Verena Heinrich; Thomas Krannich; Annkatrin Bressin; Michaela F Hartmann; Stefan A Wudy; Dina K N Dechmann; Alicia Hurtado; Francisco J Barrionuevo; Magdalena Schindler; Izabela Harabula; Marco Osterwalder; Michael Hiller; Lars Wittler; Axel Visel; Bernd Timmermann; Axel Meyer; Martin Vingron; Rafael Jiménez; Stefan Mundlos; Darío G Lupiáñez
Journal:  Science       Date:  2020-10-09       Impact factor: 47.728

2.  PDBspheres: a method for finding 3D similarities in local regions in proteins.

Authors:  Adam T Zemla; Jonathan E Allen; Dan Kirshner; Felice C Lightstone
Journal:  NAR Genom Bioinform       Date:  2022-10-10

3.  Retrogene Duplication and Expression Patterns Shaped by the Evolution of Sex Chromosomes in Malaria Mosquitoes.

Authors:  Duncan Miller; Jianhai Chen; Jiangtao Liang; Esther Betrán; Manyuan Long; Igor V Sharakhov
Journal:  Genes (Basel)       Date:  2022-05-28       Impact factor: 4.141

4.  No Evidence for Orthohepevirus C in Archived Human Samples in Germany, 2000-2020.

Authors:  Mirko Faber; Jürgen J Wenzel; Monika Erl; Klaus Stark; Mathias Schemmerer
Journal:  Viruses       Date:  2022-03-31       Impact factor: 5.818

5.  Query-seeded iterative sequence similarity searching improves selectivity 5-20-fold.

Authors:  William R Pearson; Weizhong Li; Rodrigo Lopez
Journal:  Nucleic Acids Res       Date:  2017-04-20       Impact factor: 16.971

6.  MicroRNA duplication accelerates the recruitment of new targets during vertebrate evolution.

Authors:  Junjie Luo; Yirong Wang; Jian Yuan; Zhilei Zhao; Jian Lu
Journal:  RNA       Date:  2018-03-06       Impact factor: 4.942

7.  Type IV CRISPR-Cas systems are highly diverse and involved in competition between plasmids.

Authors:  Rafael Pinilla-Redondo; David Mayo-Muñoz; Jakob Russel; Roger A Garrett; Lennart Randau; Søren J Sørensen; Shiraz A Shah
Journal:  Nucleic Acids Res       Date:  2020-02-28       Impact factor: 16.971

8.  Genome-Wide Analysis of Known and Potential Tetraspanins in Entamoeba histolytica.

Authors:  Kentaro Tomii; Herbert J Santos; Tomoyoshi Nozaki
Journal:  Genes (Basel)       Date:  2019-11-03       Impact factor: 4.096

9.  Rhodopsin 7-The unusual Rhodopsin in Drosophila.

Authors:  Pingkalai R Senthilan; Charlotte Helfrich-Förster
Journal:  PeerJ       Date:  2016-09-06       Impact factor: 2.984

10.  iMusta4SLC: Database for the structural property and variations of solute carrier transporters.

Authors:  Akiko Higuchi; Naoki Nonaka; Kei Yura
Journal:  Biophys Physicobiol       Date:  2018-04-27
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.