R C Graul1, W Sadée. 1. Department of Biopharmaceutical Sciences, University of California San Francisco 94143-0446, USA.
Abstract
PURPOSE: Searching the existing databases for homologous sequences is essential to understanding a protein's structure and function. For a query sequence, its nearest neighbors can be identified by BLAST (basic local alignment search tool). However, a single query sequence is sufficient to define the entire neighborhood of related sequences, and multiple BLAST queries are needed. We describe here a program which permits automated and iterative BLAST analysis of an entire neighborhood of sequences and apply this to search for homologs of the bacteriorhodopsins outside the archaea phylum. METHODS: We have developed a Java program, 'Iterative Neighborhood Cluster Analysis' (INCA), which performs iterative BLAST searches, beginning with a single starter sequence, and proceeding with any other sequence achieving a predefined minimum alignment score. This results in a cluster of sequences where each sequence is related to at least one other sequence by the cutoff score, additional lists of more distantly related sequences for each member of cluster. RESULTS: Bacteriorhodopsins had not been previously aligned with any other protein family with scores indicative of probable homology. Using INCA, we identified a probable homolog in yeast, YRO2_YEAST, also containing seven putative transmembrane domains. A finding of probable homology was supported by additional alignment strategies. CONCLUSIONS: INCA is a useful tool to assess complete protein neighborhoods. With an increasing database, INCA can serve to detect the emergence of evolutionary links between even the most distantly related protein families. Identifying a homolog of the bacteriorhodopsins in yeast illustrates this approach but at the same time highlights the vast evolutionary distances between polytopic membrane proteins, such as the bacteriorhodopsins.
PURPOSE: Searching the existing databases for homologous sequences is essential to understanding a protein's structure and function. For a query sequence, its nearest neighbors can be identified by BLAST (basic local alignment search tool). However, a single query sequence is sufficient to define the entire neighborhood of related sequences, and multiple BLAST queries are needed. We describe here a program which permits automated and iterative BLAST analysis of an entire neighborhood of sequences and apply this to search for homologs of the bacteriorhodopsins outside the archaea phylum. METHODS: We have developed a Java program, 'Iterative Neighborhood Cluster Analysis' (INCA), which performs iterative BLAST searches, beginning with a single starter sequence, and proceeding with any other sequence achieving a predefined minimum alignment score. This results in a cluster of sequences where each sequence is related to at least one other sequence by the cutoff score, additional lists of more distantly related sequences for each member of cluster. RESULTS: Bacteriorhodopsins had not been previously aligned with any other protein family with scores indicative of probable homology. Using INCA, we identified a probable homolog in yeast, YRO2_YEAST, also containing seven putative transmembrane domains. A finding of probable homology was supported by additional alignment strategies. CONCLUSIONS: INCA is a useful tool to assess complete protein neighborhoods. With an increasing database, INCA can serve to detect the emergence of evolutionary links between even the most distantly related protein families. Identifying a homolog of the bacteriorhodopsins in yeast illustrates this approach but at the same time highlights the vast evolutionary distances between polytopic membrane proteins, such as the bacteriorhodopsins.
Authors: Katherine A Borkovich; Lisa A Alex; Oded Yarden; Michael Freitag; Gloria E Turner; Nick D Read; Stephan Seiler; Deborah Bell-Pedersen; John Paietta; Nora Plesofsky; Michael Plamann; Marta Goodrich-Tanrikulu; Ulrich Schulte; Gertrud Mannhaupt; Frank E Nargang; Alan Radford; Claude Selitrennikoff; James E Galagan; Jay C Dunlap; Jennifer J Loros; David Catcheside; Hirokazu Inoue; Rodolfo Aramayo; Michael Polymenis; Eric U Selker; Matthew S Sachs; George A Marzluf; Ian Paulsen; Rowland Davis; Daniel J Ebbole; Alex Zelter; Eric R Kalkman; Rebecca O'Rourke; Frederick Bowring; Jane Yeadon; Chizu Ishii; Keiichiro Suzuki; Wataru Sakai; Robert Pratt Journal: Microbiol Mol Biol Rev Date: 2004-03 Impact factor: 11.056
Authors: Alexander Adam; Stephan Deimel; Javier Pardo-Medina; Jorge García-Martínez; Tilen Konte; M Carmen Limón; Javier Avalos; Ulrich Terpitz Journal: Int J Mol Sci Date: 2018-01-11 Impact factor: 5.923