Literature DB >> 7826021

Computational DNA sequence analysis.

S Karlin1, L R Cardon.   

Abstract

This paper reviews several new developments in computer and statistical analysis of DNA and protein sequences. We present criteria and describe means for assessing and interpreting genomic inhomogeneities within and between sequences. These include: (a) characterizations of short oligonucleotide biases and general compositional tendencies; (b) molecular evolutionary reconstructions based on dinucleotide relative abundance distance measures and partial orderings; and (c) the application of r-scan statistics, quantile distributions, and score-based analyses to identify clustering, overdispersion, and excessive evenness in the distribution of a marker array along a sequence. These apply, for example, to restriction sites, microsatellite runs, regulatory motifs, and nucleosome placements. Furthermore, (d) the definition and determination of rare and frequent oligonucleotides and peptides provides another perspective on sequence heterogeneity, and (e) score methods are also applied in exon and gene locations. Most of the ideas and methods are illustrated with respect to bacteriophage genomes, to megabase amounts of several eukaryotic sequences, to a diverse collection of bacterial sets, to mitochondrial chromosomes, and to a broad assembly of viral genomes.

Entities:  

Mesh:

Substances:

Year:  1994        PMID: 7826021     DOI: 10.1146/annurev.mi.48.100194.003155

Source DB:  PubMed          Journal:  Annu Rev Microbiol        ISSN: 0066-4227            Impact factor:   15.500


  47 in total

Review 1.  SWORDS: a statistical tool for analysing large DNA sequences.

Authors:  Probal Chaudhuri; Sandip Das
Journal:  J Biosci       Date:  2002-02       Impact factor: 1.826

2.  Evolutionary implications of microbial genome tetranucleotide frequency biases.

Authors:  David T Pride; Richard J Meinersmann; Trudy M Wassenaar; Martin J Blaser
Journal:  Genome Res       Date:  2003-02       Impact factor: 9.043

3.  Frequent oligonucleotides and peptides of the Haemophilus influenzae genome.

Authors:  S Karlin; J Mrázek; A M Campbell
Journal:  Nucleic Acids Res       Date:  1996-11-01       Impact factor: 16.971

Review 4.  Statistical signals in bioinformatics.

Authors:  Samuel Karlin
Journal:  Proc Natl Acad Sci U S A       Date:  2005-09-12       Impact factor: 11.205

5.  Evidence for diversifying selection at the pyoverdine locus of Pseudomonas aeruginosa.

Authors:  Eric E Smith; Elizabeth H Sims; David H Spencer; Rajinder Kaul; Maynard V Olson
Journal:  J Bacteriol       Date:  2005-03       Impact factor: 3.490

6.  Different clustering of genomes across life using the A-T-C-G and degenerate R-Y alphabets: early and late signaling on genome evolution?

Authors:  V Kirzhner; A Paz; Z Volkovich; E Nevo; A Korol
Journal:  J Mol Evol       Date:  2007-03-19       Impact factor: 2.395

7.  Comparisons of eukaryotic genomic sequences.

Authors:  S Karlin; I Ladunga
Journal:  Proc Natl Acad Sci U S A       Date:  1994-12-20       Impact factor: 11.205

8.  Which bacterium is the ancestor of the animal mitochondrial genome?

Authors:  S Karlin; A M Campbell
Journal:  Proc Natl Acad Sci U S A       Date:  1994-12-20       Impact factor: 11.205

9.  Heterogeneity of genomes: measures and values.

Authors:  S Karlin; I Ladunga; B E Blaisdell
Journal:  Proc Natl Acad Sci U S A       Date:  1994-12-20       Impact factor: 11.205

10.  Analysis of the Escherichia coli genome VI: DNA sequence of the region from 92.8 through 100 minutes.

Authors:  V Burland; G Plunkett; H J Sofia; D L Daniels; F R Blattner
Journal:  Nucleic Acids Res       Date:  1995-06-25       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.