Literature DB >> 9366496

An artificial intelligence approach to motif discovery in protein sequences: application to steriod dehydrogenases.

T L Bailey1, M E Baker, C P Elkan.   

Abstract

MEME (Multiple Expectation-maximization for Motif Elicitation) is a unique new software tool that uses artificial intelligence techniques to discover motifs shared by a set of protein sequences in a fully automated manner. This paper is the first detailed study of the use of MEME to analyse a large, biologically relevant set of sequences, and to evaluate the sensitivity and accuracy of MEME in identifying structurally important motifs. For this purpose, we chose the short-chain alcohol dehydrogenase superfamily because it is large and phylogenetically diverse, providing a test of how well MEME can work on sequences with low amino acid similarity. Moreover, this dataset contains enzymes of biological importance, and because several enzymes have known X-ray crystallographic structures, we can test the usefulness of MEME for structural analysis. The first six motifs from MEME map onto structurally important alpha-helices and beta-strands on Streptomyces hydrogenans 20beta-hydroxysteroid dehydrogenase. We also describe MAST (Motif Alignment Search Tool), which conveniently uses output from MEME for searching databases such as SWISS-PROT and Genpept. MAST provides statistical measures that permit a rigorous evaluation of the significance of database searches with individual motifs or groups of motifs. A database search of Genpept90 by MAST with the log-odds matrix of the first six motifs obtained from MEME yields a bimodal output, demonstrating the selectivity of MAST. We show for the first time, using primary sequence analysis, that bacterial sugar epimerases are homologs of short-chain dehydrogenases. MEME and MAST will be increasingly useful as genome sequencing provides large datasets of phylogenetically divergent sequences of biomedical interest.

Entities:  

Mesh:

Substances:

Year:  1997        PMID: 9366496     DOI: 10.1016/s0960-0760(97)00013-7

Source DB:  PubMed          Journal:  J Steroid Biochem Mol Biol        ISSN: 0960-0760            Impact factor:   4.292


  21 in total

1.  Analysis of overrepresented motifs in human core promoters reveals dual regulatory roles of YY1.

Authors:  Hualin Xi; Yong Yu; Yutao Fu; Jonathan Foley; Anason Halees; Zhiping Weng
Journal:  Genome Res       Date:  2007-06       Impact factor: 9.043

2.  Plasmodium interspersed repeats: the major multigene superfamily of malaria parasites.

Authors:  Christoph S Janssen; R Stephen Phillips; C Michael R Turner; Michael P Barrett
Journal:  Nucleic Acids Res       Date:  2004-10-26       Impact factor: 16.971

3.  Mutation of threonine-241 to proline eliminates autocatalytic modification of human carbonyl reductase.

Authors:  M A Sciotti; S Nakajin; B Wermuth; M E Baker
Journal:  Biochem J       Date:  2000-08-15       Impact factor: 3.857

4.  Mutation of tyrosine-194 and lysine-198 in the catalytic site of pig 3alpha/beta,20beta-hydroxysteroid dehydrogenase.

Authors:  S Nakajin; N Takase; S Ohno; S Toyoshima; M E Baker
Journal:  Biochem J       Date:  1998-09-15       Impact factor: 3.857

5.  Accurate identification of paraprotein antigen targets by epitope reconstruction.

Authors:  Seshi R Sompuram; Gerassimos Bastas; Kodela Vani; Steven A Bogen
Journal:  Blood       Date:  2007-09-18       Impact factor: 22.113

6.  Multiple structurally distinct ERα mRNA variants in zebrafish are differentially expressed by tissue type, stage of development and estrogen exposure.

Authors:  Kellie A Cotter; Anya Yershov; Apolonia Novillo; Gloria V Callard
Journal:  Gen Comp Endocrinol       Date:  2013-10-01       Impact factor: 2.822

7.  Genes Induced by Reovirus Infection Have a Distinct Modular Cis-Regulatory Architecture.

Authors:  R Lapadat; R L Debiasi; G L Johnson; K L Tyler; I Shah
Journal:  Curr Genomics       Date:  2005       Impact factor: 2.236

8.  Selection and characterization of conditionally active promoters in Lactobacillus plantarum, using alanine racemase as a promoter probe.

Authors:  Peter A Bron; Sally M Hoffer; Iris I Van Swam; Willem M De Vos; Michiel Kleerebezem
Journal:  Appl Environ Microbiol       Date:  2004-01       Impact factor: 4.792

9.  Analysis of a large cluster of SLC22 transporter genes, including novel USTs, reveals species-specific amplification of subsets of family members.

Authors:  Wei Wu; Michael E Baker; Satish A Eraly; Kevin T Bush; Sanjay K Nigam
Journal:  Physiol Genomics       Date:  2009-05-05       Impact factor: 3.107

10.  Comprehensive viral oligonucleotide probe design using conserved protein regions.

Authors:  Omar J Jabado; Yang Liu; Sean Conlan; P Lan Quan; Hédi Hegyi; Yves Lussier; Thomas Briese; Gustavo Palacios; W I Lipkin
Journal:  Nucleic Acids Res       Date:  2007-12-13       Impact factor: 19.160

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.