Literature DB >> 18660848

Consensus higher order repeats and frequency of string distributions in human genome.

Vladimir Paar1, Ivan Basar, Marija Rosandić, Matko Gluncić.   

Abstract

Key string algorithm (KSA) could be viewed as robust computational generalization of restriction enzyme method. KSA enables robust and effective identification and structural analyzes of any given genomic sequences, like in the case of NCBI assembly for human genome. We have developed a method, using total frequency distribution of all r-bp key strings in dependence on the fragment length l, to determine the exact size of all repeats within the given genomic sequence, both of monomeric and HOR type. Subsequently, for particular fragment lengths equal to each of these repeat sizes we compute the partial frequency distribution of r-bp key strings; the key string with highest frequency is a dominant key string, optimal for segmentation of a given genomic sequence into repeat units. We illustrate how a wide class of 3-bp key strings leads to a key-string-dependent periodic cell which enables a simple identification and consensus length determinations of HORs, or any other highly convergent repeat of monomeric or HOR type, both tandem or dispersed. We illustrated KSA application for HORs in human genome and determined consensus HORs in the Build 35.1 assembly. In the next step we compute suprachromosomal family classification and CENP-B box / pJalpha distributions for HORs. In the case of less convergent repeats, like for example monomeric alpha satellite (20-40% divergence), we searched for optimal compact key string using frequency method and developed a concept of composite key string (GAAAC--CTTTG) or flexible relaxation (28 bp key string) which provides both monomeric alpha satellites as well as alpha monomer segmentation of internal HOR structure. This method is convenient also for study of R-strand (direct) / S-strand (reverse complement) alpha monomer alternations. Using KSA we identified 16 alternating regions of R-strand and S-strand monomers in one contig in choromosome 7. Use of CENP-B box and/or pJalpha motif as key string is suitable both for identification of HORs and monomeric pattern as well as for studies of CENP-B box / pJalpha distribution. As an example of application of KSA to sequences outside of HOR regions we present our finding of a tandem with highly convergent 3434-bp Long monomer in chromosome 5 (divergence less then 0.3%).

Entities:  

Keywords:  CENP-B box; Human genome; alpha satellite; alphoid; consensus higher order repeat; frequency distribution of strings; higher order repeat (HOR); key string algorithm - KSA; pJα motif; suprachromosomal families

Year:  2007        PMID: 18660848      PMCID: PMC2435359          DOI: 10.2174/138920207780368169

Source DB:  PubMed          Journal:  Curr Genomics        ISSN: 1389-2029            Impact factor:   2.236


  117 in total

1.  Analysis of the monomeric alphoid sequences in the pericentromeric region of human chromosome 7.

Authors:  A de la Puente; E Velasco; L A Pérez Jurado; C Hernández-Chico; F M van de Rijke; S W Scherer; A K Raap; J Cruces
Journal:  Cytogenet Cell Genet       Date:  1998

2.  Alpha-satellite DNA of primates: old and new families.

Authors:  I Alexandrov; A Kazakov; I Tumeneva; V Shepelev; Y Yurov
Journal:  Chromosoma       Date:  2001-08       Impact factor: 4.316

3.  Key-string segmentation algorithm and higher-order repeat 16mer (54 copies) in human alpha satellite DNA in chromosome 7.

Authors:  M Rosandić; V Paar; I Basar
Journal:  J Theor Biol       Date:  2003-03-07       Impact factor: 2.691

4.  Localization and polymorphism of a chromosome 12-specific alpha satellite DNA sequence.

Authors:  L H Looijenga; V T Smit; J W Wessels; P Mollevanger; J W Oosterhuis; C J Cornelisse; P Devilee
Journal:  Cytogenet Cell Genet       Date:  1990

5.  Structural organization and polymorphism of the alpha satellite DNA sequences of chromosomes 13 and 21 as revealed by pulse field gel electrophoresis.

Authors:  B Marçais; M Bellis; A Gérard; M Pagès; Y Boublik; G Roizès
Journal:  Hum Genet       Date:  1991-01       Impact factor: 4.132

6.  A human alphoid DNA clone from the EcoRI dimeric family: genomic and internal organization and chromosomal assignment.

Authors:  A Baldini; D I Smith; M Rocchi; O J Miller; D A Miller
Journal:  Genomics       Date:  1989-11       Impact factor: 5.736

7.  Chromosome-specific alpha satellite DNA from human chromosome 1: hierarchical structure and genomic organization of a polymorphic domain spanning several hundred kilobase pairs of centromeric DNA.

Authors:  J S Waye; S J Durfy; D Pinkel; S Kenwrick; M Patterson; K E Davies; H F Willard
Journal:  Genomics       Date:  1987-09       Impact factor: 5.736

8.  Repeating restriction fragments of human DNA.

Authors:  L Manuelidis
Journal:  Nucleic Acids Res       Date:  1976-11       Impact factor: 16.971

9.  DNA strand reassociation and polyribonucleotide binding in the African green monkey, Cercopithecus aethiops.

Authors:  J J Maio
Journal:  J Mol Biol       Date:  1971-03-28       Impact factor: 5.469

10.  Extensive sequence polymorphisms associated with chromosome 10 alpha satellite DNA and its close linkage to markers from the pericentromeric region.

Authors:  J S Wu; K K Kidd
Journal:  Hum Genet       Date:  1990-02       Impact factor: 4.132

View more
  6 in total

1.  Large tandem, higher order repeats and regularly dispersed repeat units contribute substantially to divergence between human and chimpanzee Y chromosomes.

Authors:  Vladimir Paar; Matko Glunčić; Ivan Basar; Marija Rosandić; Petar Paar; Mislav Cvitković
Journal:  J Mol Evol       Date:  2010-11-20       Impact factor: 2.395

2.  Global Repeat Map (GRM): Advantageous Method for Discovery of Largest Higher-Order Repeats (HORs) in Neuroblastoma Breakpoint Family (NBPF) Genes, in Hornerin Exon and in Chromosome 21 Centromere.

Authors:  Vladimir Paar; Ines Vlahović; Marija Rosandić; Matko Glunčić
Journal:  Prog Mol Subcell Biol       Date:  2021

3.  Complete genomic and epigenetic maps of human centromeres.

Authors:  Glennis A Logsdon; Andrey V Bzikadze; Pragya Sidhwani; Sasha A Langley; Gina V Caldas; Nicolas Altemose; Savannah J Hoyt; Lev Uralsky; Fedor D Ryabov; Colin J Shew; Michael E G Sauria; Matthew Borchers; Ariel Gershman; Alla Mikheenko; Valery A Shepelev; Tatiana Dvorkina; Olga Kunyavskaya; Mitchell R Vollger; Arang Rhie; Ann M McCartney; Mobin Asri; Ryan Lorig-Roach; Kishwar Shafin; Julian K Lucas; Sergey Aganezov; Daniel Olson; Leonardo Gomes de Lima; Tamara Potapova; Gabrielle A Hartley; Marina Haukness; Peter Kerpedjiev; Fedor Gusev; Kristof Tigyi; Shelise Brooks; Alice Young; Sergey Nurk; Sergey Koren; Sofie R Salama; Benedict Paten; Evgeny I Rogaev; Aaron Streets; Gary H Karpen; Abby F Dernburg; Beth A Sullivan; Aaron F Straight; Travis J Wheeler; Jennifer L Gerton; Evan E Eichler; Adam M Phillippy; Winston Timp; Megan Y Dennis; Rachel J O'Neill; Justin M Zook; Michael C Schatz; Pavel A Pevzner; Mark Diekhans; Charles H Langley; Ivan A Alexandrov; Karen H Miga
Journal:  Science       Date:  2022-04-01       Impact factor: 63.714

Review 4.  Dark Matter of Primate Genomes: Satellite DNA Repeats and Their Evolutionary Dynamics.

Authors:  Syed Farhan Ahmad; Worapong Singchat; Maryam Jehangir; Aorarat Suntronpong; Thitipong Panthum; Suchinda Malaivijitnond; Kornsorn Srikulnath
Journal:  Cells       Date:  2020-12-18       Impact factor: 6.600

5.  Hierarchical structure of cascade of primary and secondary periodicities in Fourier power spectrum of alphoid higher order repeats.

Authors:  Vladimir Paar; Nenad Pavin; Ivan Basar; Marija Rosandić; Matko Gluncić; Nils Paar
Journal:  BMC Bioinformatics       Date:  2008-11-03       Impact factor: 3.169

6.  Organization and evolution of Gorilla centromeric DNA from old strategies to new approaches.

Authors:  C R Catacchio; R Ragone; G Chiatante; M Ventura
Journal:  Sci Rep       Date:  2015-09-21       Impact factor: 4.379

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.