Literature DB >> 15130826

DNA sequence analysis linguistic tools: contrast vocabularies, compositional spectra and linguistic complexity.

Alexander Bolshoy1.   

Abstract

This is a review of the methods based on counting oligomers in nucleotide and amino acid sequences. Such methods are analogous to the formal linguistic analysis of human texts. This review includes methods based on the calculation of observed occurrences (frequencies) of oligomers and their distribution, as well as those based on deviations between the observed and the expected occurrences (contrast words, genome signatures) in biological sequences. Both types of methods have a wide range of sensitivity and can identify homologous as well as functionally and taxonomically related sequences.

Entities:  

Mesh:

Year:  2003        PMID: 15130826

Source DB:  PubMed          Journal:  Appl Bioinformatics        ISSN: 1175-5636


  3 in total

1.  Next generation sequencing and RNA-seq characterization of adipose tissue in the Nile crocodile (Crocodylus niloticus) in South Africa: Possible mechanism(s) of pathogenesis and pathophysiology of pansteatitis.

Authors:  Odunayo I Azeez; Jan G Myburgh; Ana-Mari Bosman; Jonathan Featherston; Kgomotso P Sibeko-Matjilla; Marinda C Oosthuizen; Joseph P Chamunorwa
Journal:  PLoS One       Date:  2019-11-18       Impact factor: 3.240

2.  Peptide vocabulary analysis reveals ultra-conservation and homonymity in protein sequences.

Authors:  Derek Gatherer
Journal:  Bioinform Biol Insights       Date:  2009-11-24

3.  A novel bioinformatics method for efficient knowledge discovery by BLSOM from big genomic sequence data.

Authors:  Yu Bai; Yuki Iwasaki; Shigehiko Kanaya; Yue Zhao; Toshimichi Ikemura
Journal:  Biomed Res Int       Date:  2014-04-03       Impact factor: 3.411

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.