Literature DB >> 10454621

Intron-exon structures of eukaryotic model organisms.

M Deutsch1, M Long.   

Abstract

To investigate the distribution of intron-exon structures of eukaryotic genes, we have constructed a general exon database comprising all available intron-containing genes and exon databases from 10 eukaryotic model organisms: Homo sapiens, Mus musculus, Gallus gallus, Rattus norvegicus, Arabidopsis thaliana, Zea mays, Schizosaccharomyces pombe, Aspergillus, Caenorhabditis elegans and Drosophila. We purged redundant genes to avoid the possible bias brought about by redundancy in the databases. After discarding those questionable introns that do not contain correct splice sites, the final database contained 17 102 introns, 21 019 exons and 2903 independent or quasi-independent genes. On average, a eukaryotic gene contains 3.7 introns per kb protein coding region. The exon distribution peaks around 30-40 residues and most introns are 40-125 nt long. The variable intron-exon structures of the 10 model organisms reveal two interesting statistical phenomena, which cast light on some previous speculations. (i) Genome size seems to be correlated with total intron length per gene. For example, invertebrate introns are smaller than those of human genes, while yeast introns are shorter than invertebrate introns. However, this correlation is weak, suggesting that other factors besides genome size may also affect intron size. (ii) Introns smaller than 50 nt are significantly less frequent than longer introns, possibly resulting from a minimum intron size requirement for intron splicing.

Entities:  

Mesh:

Substances:

Year:  1999        PMID: 10454621      PMCID: PMC148551          DOI: 10.1093/nar/27.15.3219

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  21 in total

1.  Splicing signals in Drosophila: intron size, information content, and consensus sequences.

Authors:  S M Mount; C Burks; G Hertz; G D Stormo; O White; C Fields
Journal:  Nucleic Acids Res       Date:  1992-08-25       Impact factor: 16.971

2.  Analysis of nonuniformity in intron phase distribution.

Authors:  A Fedorov; G Suboch; M Bujakov; L Fedorova
Journal:  Nucleic Acids Res       Date:  1992-05-25       Impact factor: 16.971

Review 3.  The recent origins of introns.

Authors:  J D Palmer; J M Logsdon
Journal:  Curr Opin Genet Dev       Date:  1991-12       Impact factor: 5.578

4.  Genome size and intron size in Drosophila.

Authors:  E N Moriyama; D A Petrov; D L Hartl
Journal:  Mol Biol Evol       Date:  1998-06       Impact factor: 16.240

Review 5.  A survey on intron and exon lengths.

Authors:  J D Hawkins
Journal:  Nucleic Acids Res       Date:  1988-11-11       Impact factor: 16.971

6.  Using the FASTA program to search protein and DNA sequence databases.

Authors:  W R Pearson
Journal:  Methods Mol Biol       Date:  1994

7.  Structural organization of the human type VII collagen gene (COL7A1), composed of more exons than any previously characterized gene.

Authors:  A M Christiano; G G Hoffman; L C Chung-Honet; S Lee; W Cheng; J Uitto; D S Greenspan
Journal:  Genomics       Date:  1994-05-01       Impact factor: 5.736

8.  Comparative sequence analysis of the human and pufferfish Huntington's disease genes.

Authors:  S Baxendale; S Abdulla; G Elgar; D Buck; M Berks; G Micklem; R Durbin; G Bates; S Brenner; S Beck
Journal:  Nat Genet       Date:  1995-05       Impact factor: 38.330

9.  Extremely short 20-33 nucleotide introns are the standard length in Paramecium tetraurelia.

Authors:  C B Russell; D Fraga; R D Hinrichsen
Journal:  Nucleic Acids Res       Date:  1994-04-11       Impact factor: 16.971

10.  Dystrophin is transcribed in brain from a distant upstream promoter.

Authors:  F M Boyce; A H Beggs; C Feener; L M Kunkel
Journal:  Proc Natl Acad Sci U S A       Date:  1991-02-15       Impact factor: 11.205

View more
  147 in total

1.  The IDB and IEDB: intron sequence and evolution databases.

Authors:  N J Schisler; J D Palmer
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  EID: the Exon-Intron Database-an exhaustive database of protein-coding intron-containing genes.

Authors:  S Saxonov; I Daizadeh; A Fedorov; W Gilbert
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

3.  Positional characterisation of false positives from computational prediction of human splice sites.

Authors:  T A Thanaraj
Journal:  Nucleic Acids Res       Date:  2000-02-01       Impact factor: 16.971

4.  Molecular characterisation of RecQ homologues in Arabidopsis thaliana.

Authors:  F Hartung; H Plchová; H Puchta
Journal:  Nucleic Acids Res       Date:  2000-11-01       Impact factor: 16.971

5.  ExInt: an Exon Intron Database.

Authors:  M Sakharkar; F Passetti; J E de Souza; M Long; S J de Souza
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

6.  Intron evolution as a population-genetic process.

Authors:  Michael Lynch
Journal:  Proc Natl Acad Sci U S A       Date:  2002-04-30       Impact factor: 11.205

7.  Analysis of similarity within 142 pairs of orthologous intergenic regions of Caenorhabditis elegans and Caenorhabditis briggsae.

Authors:  Colleen T Webb; Svetlana A Shabalina; Aleksey Yu Ogurtsov; Alexey S Kondrashov
Journal:  Nucleic Acids Res       Date:  2002-03-01       Impact factor: 16.971

8.  A phylogenetic hypothesis for passerine birds: taxonomic and biogeographic implications of an analysis of nuclear DNA sequence data.

Authors:  F Keith Barker; George F Barrowclough; Jeff G Groth
Journal:  Proc Biol Sci       Date:  2002-02-07       Impact factor: 5.349

9.  Requirements for intron-mediated enhancement of gene expression in Arabidopsis.

Authors:  Alan B Rose
Journal:  RNA       Date:  2002-11       Impact factor: 4.942

10.  Distribution and characterization of regulatory elements in the human genome.

Authors:  Jacek Majewski; Jurg Ott
Journal:  Genome Res       Date:  2002-12       Impact factor: 9.043

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.