Literature DB >> 10592221

EID: the Exon-Intron Database-an exhaustive database of protein-coding intron-containing genes.

S Saxonov1, I Daizadeh, A Fedorov, W Gilbert.   

Abstract

To aid studies of molecular evolution and to assist in gene prediction research, we have constructed an Exon-Intron Database (EID) in FASTA format. Currently, the database is derived from GenBank release 112, and it contains 51 289 protein-coding genes (287 209 exons) that harbor introns, along with extensive descriptions of each gene and its DNA and protein sequences, as well as splice motif information. There is 17% redundancy inherited from GenBank-a purge at the 99% identity level reduced the database to 42 460 genes (243 589 exons). We have created subdatabases of genes whose intron positions have been experimentally determined. One such database, constructed by comparing genomic and mRNA sequences, contains 11 242 genes (62 474 exons). A larger database of 22 196 genes (105 595 exons) was constructed by selecting on keywords to eliminate computer-predicted genes. By examining the two nucleotides adjacent to the intron boundary, we infer that there is a 2% rate of errors or other deviations from the standard GTellipsisAG motif in nuclear genes. This criterion can be used to eliminate 4921 genes from the overall database. Various tools are provided to enable generation of user-specific subsets of the EID. The EID distribution can be obtained from http://mcb.harvard.edu/gilbert/EID

Mesh:

Substances:

Year:  2000        PMID: 10592221      PMCID: PMC102483          DOI: 10.1093/nar/28.1.185

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  16 in total

1.  Introns and protein revolution--an analysis of the exon/intron organisation of actin genes.

Authors:  S Bagavathi; R Malathi
Journal:  FEBS Lett       Date:  1996-08-19       Impact factor: 4.124

2.  Relationship between "proto-splice sites" and intron phases: evidence from dicodon analysis.

Authors:  M Long; S J de Souza; C Rosenberg; W Gilbert
Journal:  Proc Natl Acad Sci U S A       Date:  1998-01-06       Impact factor: 11.205

3.  Introns and reading frames: correlation between splicing sites and their codon positions.

Authors:  M Tomita; N Shimizu; D L Brutlag
Journal:  Mol Biol Evol       Date:  1996-11       Impact factor: 16.240

4.  Intron distribution in ancient paralogs supports random insertion and not random loss.

Authors:  G Cho; R F Doolittle
Journal:  J Mol Evol       Date:  1997-06       Impact factor: 2.395

5.  Exon/intron structure of aldehyde dehydrogenase genes supports the "introns-late" theory.

Authors:  A Rzhetsky; F J Ayala; L C Hsu; C Chang; A Yoshida
Journal:  Proc Natl Acad Sci U S A       Date:  1997-06-24       Impact factor: 11.205

6.  Arabidopsis consensus intron sequences.

Authors:  J W Brown; P Smith; C G Simpson
Journal:  Plant Mol Biol       Date:  1996-11       Impact factor: 4.076

Review 7.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

Authors:  S F Altschul; T L Madden; A A Schäffer; J Zhang; Z Zhang; W Miller; D J Lipman
Journal:  Nucleic Acids Res       Date:  1997-09-01       Impact factor: 16.971

8.  Intron phase correlations and the evolution of the intron/exon structure of genes.

Authors:  M Long; C Rosenberg; W Gilbert
Journal:  Proc Natl Acad Sci U S A       Date:  1995-12-19       Impact factor: 11.205

9.  Requirement of U12 snRNA for in vivo splicing of a minor class of eukaryotic nuclear pre-mRNA introns.

Authors:  S L Hall; R A Padgett
Journal:  Science       Date:  1996-03-22       Impact factor: 47.728

10.  Why genes in pieces?

Authors:  W Gilbert
Journal:  Nature       Date:  1978-02-09       Impact factor: 49.962

View more
  53 in total

1.  Comparison of intron-containing and intron-lacking human genes elucidates putative exonic splicing enhancers.

Authors:  A Fedorov; S Saxonov; L Fedorova; I Daizadeh
Journal:  Nucleic Acids Res       Date:  2001-04-01       Impact factor: 16.971

2.  GeneSplicer: a new computational method for splice site prediction.

Authors:  M Pertea; X Lin; S L Salzberg
Journal:  Nucleic Acids Res       Date:  2001-03-01       Impact factor: 16.971

3.  Intron distribution difference for 276 ancient and 131 modern genes suggests the existence of ancient introns.

Authors:  A Fedorov; X Cao; S Saxonov; S J de Souza; S W Roy; W Gilbert
Journal:  Proc Natl Acad Sci U S A       Date:  2001-10-30       Impact factor: 11.205

4.  ExInt: an Exon Intron Database.

Authors:  M Sakharkar; F Passetti; J E de Souza; M Long; S J de Souza
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

5.  Regularities of context-dependent codon bias in eukaryotic genes.

Authors:  Alexei Fedorov; Serge Saxonov; Walter Gilbert
Journal:  Nucleic Acids Res       Date:  2002-03-01       Impact factor: 16.971

6.  Evolutionary relationships among Rel domains indicate functional diversification by recombination.

Authors:  I A Graef; J M Gastier; U Francke; G R Crabtree
Journal:  Proc Natl Acad Sci U S A       Date:  2001-05-08       Impact factor: 11.205

7.  Bioinformatics and clinical informatics: the imperative to collaborate.

Authors:  I S Kohane
Journal:  J Am Med Inform Assoc       Date:  2000 Sep-Oct       Impact factor: 4.497

8.  A novel beta-glucosidase in Uromyces fabae: feast or fight?

Authors:  Ariane C Haerter; Ralf T Voegele
Journal:  Curr Genet       Date:  2003-11-26       Impact factor: 3.886

9.  The effect of nonsense codons on splicing: a genomic analysis.

Authors:  Xiang Zhang; James Lee; Lawrence A Chasin
Journal:  RNA       Date:  2003-06       Impact factor: 4.942

10.  Mystery of intron gain.

Authors:  Alexei Fedorov; Scott Roy; Larisa Fedorova; Walter Gilbert
Journal:  Genome Res       Date:  2003-09-15       Impact factor: 9.043

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.