Literature DB >> 11875024

Exploration of novel motifs derived from mouse cDNA sequences.

Hideya Kawaji1, Christian Schönbach, Yo Matsuo, Jun Kawai, Yasushi Okazaki, Yoshihide Hayashizaki, Hideo Matsuda.   

Abstract

We performed a systematic maximum density subgraph (MDS) detection of conserved sequence regions to discover new, biologically relevant motifs from a set of 21,050 conceptually translated mouse cDNA (FANTOM1) sequences. A total of 3202 candidate sequences, which shared similar regions over >20 amino acid residues, were screened against known conserved regions listed in Pfam, ProDom, and InterPro. The filtering procedure resulted in 139 FANTOM1 sequences belonging to 49 new motif candidates. Using annotations and multiple sequence alignment information, we removed by visual inspection 42 candidates whose members were found to be false positives because of sequence redundancy, alternative splicing, low complexity, transcribed retroviral repeat elements contained in the region of the predicted open reading frame, and reports in the literature. The remaining seven motifs have been expanded by hidden Markov model (HMM) profile searches of SWISS-PROT/TrEMBL from 28 FANTOM1 sequences to 164 members and analyzed in detail on sequence and structure level to elucidate the possible functions of motifs and members. The novel and conserved motif MDS00105 is specific for the mammalian inhibitor of growth (ING) family. Three submotifs MDS00105.1-3 are specific for ING1/ING1L, ING1-homolog, and ING3 subfamilies. The motif MDS00105 together with a PHD finger domain constitutes a module for ING proteins. Structural motif MDS00113 represents a leucine zipper-like motif. Conserved motif MDS00145 is a novel 1-acyl-SN-glycerol-3-phosphate acyltransferase (AGPAT) submotif containing a transmembrane domain that distinguishes AGPAT3 and AGPAT4 from all other acyltransferase domain-containing proteins. Functional motif MDS00148 overlaps with the kazal-type serine protease inhibitor domain but has been detected only in an extracellular loop region of solute carrier 21 (SLC21) (organic anion transporters) family members, which may regulate the specificity of anion uptake. Our motif discovery not only aided in the functional characterization of new mouse orthologs for potential drug targets but also allowed us to predict that at least 16 other new motifs are waiting to be discovered from the current SWISS-PROT/TrEMBL database.

Entities:  

Mesh:

Substances:

Year:  2002        PMID: 11875024      PMCID: PMC155289          DOI: 10.1101/gr.193702

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  58 in total

1.  MView: a web-compatible database search or multiple alignment viewer.

Authors:  N P Brown; C Leroy; C Sander
Journal:  Bioinformatics       Date:  1998       Impact factor: 6.937

2.  A tool for analyzing and annotating genomic sequences.

Authors:  X Huang; M D Adams; H Zhou; A R Kerlavage
Journal:  Genomics       Date:  1997-11-15       Impact factor: 5.736

Review 3.  Gene families: the taxonomy of protein paralogs and chimeras.

Authors:  S Henikoff; E A Greene; S Pietrokovski; P Bork; T K Attwood; L Hood
Journal:  Science       Date:  1997-10-24       Impact factor: 47.728

Review 4.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

Authors:  S F Altschul; T L Madden; A A Schäffer; J Zhang; Z Zhang; W Miller; D J Lipman
Journal:  Nucleic Acids Res       Date:  1997-09-01       Impact factor: 16.971

5.  DSC: public domain protein secondary structure predication.

Authors:  R D King; M Saqi; R Sayle; M J Sternberg
Journal:  Comput Appl Biosci       Date:  1997-08

6.  Characterization of a human lysophosphatidic acid acyltransferase that is encoded by a gene located in the class III region of the human major histocompatibility complex.

Authors:  B Aguado; R D Campbell
Journal:  J Biol Chem       Date:  1998-02-13       Impact factor: 5.157

7.  Isolation of a multispecific organic anion and cardiac glycoside transporter from rat brain.

Authors:  B Noé; B Hagenbuch; B Stieger; P J Meier
Journal:  Proc Natl Acad Sci U S A       Date:  1997-09-16       Impact factor: 11.205

8.  Ready for a motif submission? A proposed checklist.

Authors:  P Bork; C Ouzounis; J McEntyre
Journal:  Trends Biochem Sci       Date:  1995-03       Impact factor: 13.807

9.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice.

Authors:  J D Thompson; D G Higgins; T J Gibson
Journal:  Nucleic Acids Res       Date:  1994-11-11       Impact factor: 16.971

10.  The cea10 gene encodes a secreted member of the murine carcinoembryonic antigen family and is expressed in the placenta, gastrointestinal tract and bone marrow.

Authors:  U Keck; P Nédellec; N Beauchemin; J Thompson; W Zimmermann
Journal:  Eur J Biochem       Date:  1995-04-15
View more
  8 in total

1.  Development and evaluation of an automated annotation pipeline and cDNA annotation system.

Authors:  Takeya Kasukawa; Masaaki Furuno; Itoshi Nikaido; Hidemasa Bono; David A Hume; Carol Bult; David P Hill; Richard Baldarelli; Julian Gough; Alexander Kanapin; Hideo Matsuda; Lynn M Schriml; Yoshihide Hayashizaki; Yasushi Okazaki; John Quackenbush
Journal:  Genome Res       Date:  2003-06       Impact factor: 9.043

2.  Inferring higher functional information for RIKEN mouse full-length cDNA clones with FACTS.

Authors:  Takeshi Nagashima; Diego G Silva; Nikolai Petrovsky; Luis A Socha; Harukazu Suzuki; Rintaro Saito; Takeya Kasukawa; Igor V Kurochkin; Akihiko Konagaya; Christian Schönbach
Journal:  Genome Res       Date:  2003-06       Impact factor: 9.043

3.  Agpat6 deficiency causes subdermal lipodystrophy and resistance to obesity.

Authors:  Laurent Vergnes; Anne P Beigneux; Ryan Davis; Steven M Watkins; Stephen G Young; Karen Reue
Journal:  J Lipid Res       Date:  2006-01-25       Impact factor: 5.922

Review 4.  The role of the tumour suppressor p33 ING1b in human neoplasia.

Authors:  G S Nouman; J J Anderson; J Lunec; B Angus
Journal:  J Clin Pathol       Date:  2003-07       Impact factor: 3.411

5.  The crystal structure of the signature domain of cartilage oligomeric matrix protein: implications for collagen, glycosaminoglycan and integrin binding.

Authors:  Kemin Tan; Mark Duquette; Andrzej Joachimiak; Jack Lawler
Journal:  FASEB J       Date:  2009-03-10       Impact factor: 5.191

6.  Mouse proteome analysis.

Authors:  Alexander Kanapin; Serge Batalov; Melissa J Davis; Julian Gough; Sean Grimmond; Hideya Kawaji; Michele Magrane; Hideo Matsuda; Christian Schönbach; Rohan D Teasdale; Zheng Yuan
Journal:  Genome Res       Date:  2003-06       Impact factor: 9.043

7.  Cytokine-related genes identified from the RIKEN full-length mouse cDNA data set.

Authors:  Vladimir Brusic; Rekha S Pillai; Diego G Silva; Nikolai Petrovsky; Christian Schönbach
Journal:  Genome Res       Date:  2003-06       Impact factor: 9.043

8.  New developments in the InterPro database.

Authors:  Nicola J Mulder; Rolf Apweiler; Teresa K Attwood; Amos Bairoch; Alex Bateman; David Binns; Peer Bork; Virginie Buillard; Lorenzo Cerutti; Richard Copley; Emmanuel Courcelle; Ujjwal Das; Louise Daugherty; Mark Dibley; Robert Finn; Wolfgang Fleischmann; Julian Gough; Daniel Haft; Nicolas Hulo; Sarah Hunter; Daniel Kahn; Alexander Kanapin; Anish Kejariwal; Alberto Labarga; Petra S Langendijk-Genevaux; David Lonsdale; Rodrigo Lopez; Ivica Letunic; Martin Madera; John Maslen; Craig McAnulla; Jennifer McDowall; Jaina Mistry; Alex Mitchell; Anastasia N Nikolskaya; Sandra Orchard; Christine Orengo; Robert Petryszak; Jeremy D Selengut; Christian J A Sigrist; Paul D Thomas; Franck Valentin; Derek Wilson; Cathy H Wu; Corin Yeats
Journal:  Nucleic Acids Res       Date:  2007-01       Impact factor: 16.971

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.