Literature DB >> 8391893

UNIREP: a microcomputer program to find unique and repetitive nucleotide sequences in genomes.

J Mrázek1, J Kypr.   

Abstract

We present a program UNIREP, written in PowerBASIC for IBM-PCs, that identifies repetitive and unique nucleotide sequences in genomes or parts of genomes. A key feature of the algorithm is an oligonucleotide representation in a numerical code to make possible a comparison of all pairs of oligonucleotides (including overlaps) occurring in the analyzed sequence. This comparison assigns a score to each oligonucleotide, reflecting its similarity/dissimilarity to other oligonucleotides of the same length in the analyzed sequence. The score is plotted along the sequence so that peaks in the plot indicate repetitive regions and very low values reflect unique sequences. The scores are filtered to suppress or enhance the unique or repetitive sequences according to the user's wish. UNIREP is extended by auxiliary programs HIGHER and LOWER to list nucleotide sequences that have scores higher or lower than given limits. The potential of UNIREP is demonstrated using several long nucleotide sequences including the complete genomic sequence of EBV.

Entities:  

Mesh:

Substances:

Year:  1993        PMID: 8391893     DOI: 10.1093/bioinformatics/9.3.355

Source DB:  PubMed          Journal:  Comput Appl Biosci        ISSN: 0266-7061


  1 in total

1.  Phage_UniR_LGBM: Phage Virion Proteins Classification with UniRep Features and LightGBM Model.

Authors:  Wenzheng Bao; Qingyu Cui; Baitong Chen; Bin Yang
Journal:  Comput Math Methods Med       Date:  2022-04-15       Impact factor: 2.809

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.