Literature DB >> 1641330

The application of Markov chain analysis to oligonucleotide frequency prediction and physical mapping of Drosophila melanogaster.

A J Cuticchia1, R Ivarie, J Arnold.   

Abstract

Here we compare several methods for predicting oligonucleotide frequencies in 691 kb of Drosophila melanogaster DNA. As in previous work on Escherichia coli and Saccharomyces cerevisiae, a relatively simple equation based on tetranucleotide frequencies can be used in predicting frequencies of higher order oligonucleotides. For example, the mean of observed/expected abundances of 4,096 hexamers was 1.07 with a sample standard deviation of .55. This simple predictor arises by considering each base on the sense strand of D. melanogaster to depend only on the three bases 5' to it (a 3rd order Markov chain) and is more accurate than the random predictor. This equation is useful in predicting restriction enzyme fragment sizes, selecting restriction enzymes that cut preferentially in coding vs noncoding regions, and in selecting probes to fingerprint clones in contig mapping. Once again, this equation well predicts the occurrence of higher order oligonucleotides, supporting our hypothesis that this predictor holds in evolutionarily diverse organisms. When ranked from highest to lowest abundance, the observed frequencies of oligomers of a given length are closely tracked by the predicted abundances of a 3rd order Markov chain. Through use of the dependence of oligomer frequencies on base composition, we report a list of oligomers that will be useful for the completion of a cosmid physical map of D. melanogaster. Presently, the library is such that it will be possible to construct large contigs using only 30 oligonucleotide probes to fingerprint cosmids.

Entities:  

Mesh:

Substances:

Year:  1992        PMID: 1641330      PMCID: PMC334014          DOI: 10.1093/nar/20.14.3651

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  19 in total

1.  Chromosomal region of the cystic fibrosis gene in yeast artificial chromosomes: a model for human genome mapping.

Authors:  E D Green; M V Olson
Journal:  Science       Date:  1990-10-05       Impact factor: 47.728

2.  Toward a physical map of the genome of the nematode Caenorhabditis elegans.

Authors:  A Coulson; J Sulston; S Brenner; J Karn
Journal:  Proc Natl Acad Sci U S A       Date:  1986-10       Impact factor: 11.205

3.  Prediction of the frequencies of restriction endonuclease recognition sequences using di- and mononucleotide frequencies.

Authors:  R C Peterson
Journal:  Biotechniques       Date:  1988-01       Impact factor: 1.993

4.  Multiplicity of satellite DNA sequences in Drosophila melanogaster.

Authors:  A R Lohe; D L Brutlag
Journal:  Proc Natl Acad Sci U S A       Date:  1986-02       Impact factor: 11.205

5.  The physical map of the whole E. coli chromosome: application of a new strategy for rapid analysis and sorting of a large genomic library.

Authors:  Y Kohara; K Akiyama; K Isono
Journal:  Cell       Date:  1987-07-31       Impact factor: 41.582

6.  Cloning of large segments of exogenous DNA into yeast by means of artificial chromosome vectors.

Authors:  D T Burke; G F Carle; M V Olson
Journal:  Science       Date:  1987-05-15       Impact factor: 47.728

7.  A comprehensive package for DNA sequence analysis in FORTRAN IV for the PDP-11.

Authors:  J Arnold; V K Eckenrode; K Lemke; G J Phillips; S W Schaeffer
Journal:  Nucleic Acids Res       Date:  1986-01-10       Impact factor: 16.971

8.  Mono- through hexanucleotide composition of the sense strand of yeast DNA: a Markov chain analysis.

Authors:  J Arnold; A J Cuticchia; D A Newsome; W W Jennings; R Ivarie
Journal:  Nucleic Acids Res       Date:  1988-07-25       Impact factor: 16.971

9.  Inverted repeat sequences in the Drosophila genome.

Authors:  C W Schmid; J E Manning; N Davidson
Journal:  Cell       Date:  1975-06       Impact factor: 41.582

10.  Use of high coverage reference libraries of Drosophila melanogaster for relational data analysis. A step towards mapping and sequencing of the genome.

Authors:  J D Hoheisel; G G Lennon; G Zehetner; H Lehrach
Journal:  J Mol Biol       Date:  1991-08-20       Impact factor: 5.469

View more
  5 in total

1.  Counting stem cells: methodological constraints.

Authors:  Leonid V Bystrykh; Evgenia Verovskaya; Erik Zwart; Mathilde Broekhuis; Gerald de Haan
Journal:  Nat Methods       Date:  2012-05-30       Impact factor: 28.547

Review 2.  Mechanisms and evolution of control logic in prokaryotic transcriptional regulation.

Authors:  Sacha A F T van Hijum; Marnix H Medema; Oscar P Kuipers
Journal:  Microbiol Mol Biol Rev       Date:  2009-09       Impact factor: 11.056

3.  On the consistency of a physical mapping method to reconstruct a chromosome in vitro.

Authors:  M Xiong; H J Chen; R A Prade; Y Wang; J Griffith; W E Timberlake; J Arnold
Journal:  Genetics       Date:  1996-01       Impact factor: 4.562

4.  Over- and underrepresentation of short DNA words in herpesvirus genomes.

Authors:  M Y Leung; G M Marsh; T P Speed
Journal:  J Comput Biol       Date:  1996       Impact factor: 1.479

5.  Atypical regions in large genomic DNA sequences.

Authors:  S Scherer; M S McPeek; T P Speed
Journal:  Proc Natl Acad Sci U S A       Date:  1994-07-19       Impact factor: 11.205

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.