Literature DB >> 3753763

Heuristic informational analysis of sequences.

J M Claverie, L Bougueleret.   

Abstract

Nucleotide or amino-acid sequences are interpreted as successions of words of length k (k-tuples) the frequencies of which are highly variable in different statistical populations of genes or proteins. After building k-tuple reference tables from coherent subsets or entire data banks, the local information content profile of individual sequences is drawn. Anomalous regions (peaks or depressions) of such a profile can lead to the discovery and identification of specific sequence patterns. Along the same principle, the simultaneous use of two reference statistical populations and the computation of an index combining the two information profiles lead to a general and powerful discriminant analysis methods. The identification of a "signal" associated with gene conversion, the introns/exons discrimination and the location of function specific patterns in proteins are given as examples of successful applications of this heuristic informational approach.

Entities:  

Mesh:

Substances:

Year:  1986        PMID: 3753763      PMCID: PMC339368          DOI: 10.1093/nar/14.1.179

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  19 in total

1.  Rapid and sensitive protein similarity searches.

Authors:  D J Lipman; W R Pearson
Journal:  Science       Date:  1985-03-22       Impact factor: 47.728

2.  Computer generation and statistical analysis of a data bank of protein sequences translated from GenBank.

Authors:  J M Claverie; I Sauvaget; L Bougueleret
Journal:  Biochimie       Date:  1985-05       Impact factor: 4.079

3.  Efficient algorithms for folding and comparing nucleic acid sequences.

Authors:  J P Dumas; J Ninio
Journal:  Nucleic Acids Res       Date:  1982-01-11       Impact factor: 16.971

Review 4.  Analysis of biological sequences on small computers.

Authors:  L J Korn; C Queen
Journal:  DNA       Date:  1984-12

5.  Fast computer search for similar DNA sequences.

Authors:  M Bishop; E Thompson
Journal:  Nucleic Acids Res       Date:  1984-07-11       Impact factor: 16.971

6.  Rapid similarity searches of nucleic acid and protein data banks.

Authors:  W J Wilbur; D J Lipman
Journal:  Proc Natl Acad Sci U S A       Date:  1983-02       Impact factor: 11.205

7.  DNA methylation and the frequency of CpG in animal DNA.

Authors:  A P Bird
Journal:  Nucleic Acids Res       Date:  1980-04-11       Impact factor: 16.971

8.  Soybean leghemoglobin gene family: normal, pseudo, and truncated genes.

Authors:  N Brisson; D P Verma
Journal:  Proc Natl Acad Sci U S A       Date:  1982-07       Impact factor: 11.205

9.  Complete nucleotide sequence of the murine H-2Kk gene. Comparison of three H-2K locus alleles.

Authors:  B Arnold; H G Burgert; A L Archibald; S Kvist
Journal:  Nucleic Acids Res       Date:  1984-12-21       Impact factor: 16.971

10.  Assessing the biological significance of primary structure consensus patterns using sequence databanks. I. Heat-shock and glucocorticoid control elements in eukaryotic promoters.

Authors:  J M Claverie; I Sauvaget
Journal:  Comput Appl Biosci       Date:  1985
View more
  19 in total

1.  A computational analysis of sequence features involved in recognition of short introns.

Authors:  L P Lim; C B Burge
Journal:  Proc Natl Acad Sci U S A       Date:  2001-09-25       Impact factor: 11.205

2.  Gene prediction by spectral rotation measure: a new method for identifying protein-coding regions.

Authors:  Daniel Kotlar; Yizhar Lavner
Journal:  Genome Res       Date:  2003-07-17       Impact factor: 9.043

Review 3.  Assessment of protein coding measures.

Authors:  J W Fickett; C S Tung
Journal:  Nucleic Acids Res       Date:  1992-12-25       Impact factor: 16.971

4.  Statistical analysis of nucleotide sequences.

Authors:  E E Stückle; C Emmrich; U Grob; P J Nielsen
Journal:  Nucleic Acids Res       Date:  1990-11-25       Impact factor: 16.971

5.  Rearrangement of a common cellular DNA domain on chromosome 4 in human primary liver tumors.

Authors:  C Pasquinelli; F Garreau; L Bougueleret; E Cariani; K H Grzeschik; V Thiers; O Croissant; M Hadchouel; P Tiollais; C Bréchot
Journal:  J Virol       Date:  1988-02       Impact factor: 5.103

6.  K-tuple frequency in the human genome and polymerase chain reaction.

Authors:  R Griffais; P M André; M Thibon
Journal:  Nucleic Acids Res       Date:  1991-07-25       Impact factor: 16.971

7.  A set of viral DNA decamers enriched in transcription control signals.

Authors:  S Volinia; C Scapoli; R Gambari; R Barale; I Barrai
Journal:  Nucleic Acids Res       Date:  1991-07-11       Impact factor: 16.971

Review 8.  Computational methods for exon detection.

Authors:  J M Claverie
Journal:  Mol Biotechnol       Date:  1998-08       Impact factor: 2.695

9.  Self-identification of protein-coding regions in microbial genomes.

Authors:  S Audic; J M Claverie
Journal:  Proc Natl Acad Sci U S A       Date:  1998-08-18       Impact factor: 11.205

10.  Objective comparison of exon and intron sequences by means of 2-dimensional data analysis methods.

Authors:  L Bougueleret; F Tekaia; I Sauvaget; J M Claverie
Journal:  Nucleic Acids Res       Date:  1988-03-11       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.