Literature DB >> 15632091

Identification of polymorphic motifs using probabilistic search algorithms.

Analabha Basu1, Probal Chaudhuri, Partha P Majumder.   

Abstract

The problem of identifying motifs comprising nucleotides at a set of polymorphic DNA sites, not necessarily contiguous, arises in many human genetic problems. However, when the sites are not contiguous, no efficient algorithm exists for polymorphic motif identification. A search based on complete enumeration is computationally inefficient. We have developed probabilistic search algorithms to discover motifs of known or unknown lengths. We have developed statistical tests of significance for assessing a motif discovery, and a statistical criterion for simultaneously estimating motif length and discovering it. We have tested these algorithms on various synthetic data sets and have shown that they are very efficient, in the sense that the "true" motifs can be detected in the vast majority of replications and in a small number of iterations. Additionally, we have applied them to some real data sets and have shown that they are able to identify known motifs. In certain applications, it is pertinent to find motifs that contain contrasting nucleotides at the sites included in the motif (e.g., motifs identified in case-control association studies). For this, we have suggested appropriate modifications. Using simulations, we have discovered that the success rate of identification of the correct motif is high in case-control studies except when relative risks are small. Our analyses of evolutionary data sets resulted in the identification of some motifs that appear to have important implications on human evolutionary inference. These algorithms can easily be implemented to discover motifs from multilocus genotype data by simple numerical recoding of genotypes.

Entities:  

Mesh:

Substances:

Year:  2005        PMID: 15632091      PMCID: PMC540278          DOI: 10.1101/gr.2358005

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  19 in total

Review 1.  Association study designs for complex diseases.

Authors:  L R Cardon; J I Bell
Journal:  Nat Rev Genet       Date:  2001-02       Impact factor: 53.242

2.  High-resolution haplotype structure in the human genome.

Authors:  M J Daly; J D Rioux; S F Schaffner; T J Hudson; E S Lander
Journal:  Nat Genet       Date:  2001-10       Impact factor: 38.330

3.  Genetic evidence on the origins of Indian caste populations.

Authors:  M Bamshad; T Kivisild; W S Watkins; M E Dixon; C E Ricker; B B Rao; J M Naidu; B V Prasad; P G Reddy; A Rasanayagam; S S Papiha; R Villems; A J Redd; M F Hammer; S V Nguyen; M L Carroll; M A Batzer; L B Jorde
Journal:  Genome Res       Date:  2001-06       Impact factor: 9.043

4.  Human genome. HapMap launched with pledges of $100 million.

Authors:  Jennifer Couzin
Journal:  Science       Date:  2002-11-01       Impact factor: 47.728

5.  SNPs on human chromosomes 21 and 22 -- analysis in terms of protein features and pseudogenes.

Authors:  Suganthi Balasubramanian; Paul Harrison; Hedi Hegyi; Paul Bertone; Nicholas Luscombe; Nathaniel Echols; Patrick McGarvey; ZhaoLei Zhang; Mark Gerstein
Journal:  Pharmacogenomics       Date:  2002-05       Impact factor: 2.533

6.  Detecting recent positive selection in the human genome from haplotype structure.

Authors:  Pardis C Sabeti; David E Reich; John M Higgins; Haninah Z P Levine; Daniel J Richter; Stephen F Schaffner; Stacey B Gabriel; Jill V Platko; Nick J Patterson; Gavin J McDonald; Hans C Ackerman; Sarah J Campbell; David Altshuler; Richard Cooper; Dominic Kwiatkowski; Ryk Ward; Eric S Lander
Journal:  Nature       Date:  2002-10-09       Impact factor: 49.962

7.  Deep common ancestry of indian and western-Eurasian mitochondrial DNA lineages.

Authors:  T Kivisild; M J Bamshad; K Kaldma; M Metspalu; E Metspalu; M Reidla; S Laos; J Parik; W S Watkins; M E Dixon; S S Papiha; S S Mastana; M R Mir; V Ferak; R Villems
Journal:  Curr Biol       Date:  1999-11-18       Impact factor: 10.834

8.  Conserved promoter motif is required for cell cycle timing of dnaX transcription in Caulobacter.

Authors:  K C Keiler; L Shapiro
Journal:  J Bacteriol       Date:  2001-08       Impact factor: 3.490

9.  Genetic evidence of an early exit of Homo sapiens sapiens from Africa through eastern Africa.

Authors:  L Quintana-Murci; O Semino; H J Bandelt; G Passarino; K McElreavey; A S Santachiara-Benerecetti
Journal:  Nat Genet       Date:  1999-12       Impact factor: 38.330

10.  Expression of QK/QR/RRRAA or DERAA motifs at the third hypervariable region of HLA-DRB1 and disease severity in rheumatoid arthritis.

Authors:  Abbas Khani-Hanjani; Diane Lacaille; Cathy Horne; Andrew Chalmers; David I Hoar; Robert Balshaw; Paul A Keown
Journal:  J Rheumatol       Date:  2002-07       Impact factor: 4.666

View more
  1 in total

Review 1.  The Indian Genome Variation database (IGVdb): a project overview.

Authors: 
Journal:  Hum Genet       Date:  2005-08-25       Impact factor: 4.132

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.