Literature DB >> 19361328

Statistical comparison framework and visualization scheme for ranking-based algorithms in high-throughput genome-wide studies.

Waibhav D Tembe1, John V Pearson, Nils Homer, James Lowey, Edward Suh, David W Craig.   

Abstract

As a first step in analyzing high-throughput data in genome-wide studies, several algorithms are available to identify and prioritize candidates lists for downstream fine-mapping. The prioritized candidates could be differentially expressed genes, aberrations in comparative genomics hybridization studies, or single nucleotide polymorphisms (SNPs) in association studies. Different analysis algorithms are subject to various experimental artifacts and analytical features that lead to different candidate lists. However, little research has been carried out to theoretically quantify the consensus between different candidate lists and to compare the study specific accuracy of the analytical methods based on a known reference candidate list. Within the context of genome-wide studies, we propose a generic mathematical framework to statistically compare ranked lists of candidates from different algorithms with each other or, if available, with a reference candidate list. To cope with the growing need for intuitive visualization of high-throughput data in genome-wide studies, we describe a complementary customizable visualization tool. As a case study, we demonstrate application of our framework to the comparison and visualization of candidate lists generated in a DNA-pooling based genome-wide association study of CEPH data in the HapMap project, where prior knowledge from individual genotyping can be used to generate a true reference candidate list. The results provide a theoretical basis to compare the accuracy of various methods and to identify redundant methods, thus providing guidance for selecting the most suitable analysis method in genome-wide studies.

Mesh:

Substances:

Year:  2009        PMID: 19361328      PMCID: PMC3148127          DOI: 10.1089/cmb.2008.0151

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  9 in total

1.  Cheap, accurate and rapid allele frequency estimation of single nucleotide polymorphisms by primer extension and DHPLC in DNA pools.

Authors:  B Hoogendoorn; N Norton; G Kirov; N Williams; M L Hamshere; G Spurlock; J Austin; M K Stephens; P R Buckland; M J Owen; M C O'Donovan
Journal:  Hum Genet       Date:  2000-11       Impact factor: 4.132

2.  A cluster validity framework for genome expression data.

Authors:  F Azuaje
Journal:  Bioinformatics       Date:  2002-02       Impact factor: 6.937

Review 3.  DNA Pooling: a tool for large-scale association studies.

Authors:  Pak Sham; Joel S Bader; Ian Craig; Michael O'Donovan; Michael Owen
Journal:  Nat Rev Genet       Date:  2002-11       Impact factor: 53.242

4.  A haplotype map of the human genome.

Authors: 
Journal:  Nature       Date:  2005-10-27       Impact factor: 49.962

5.  Identification of the genetic basis for complex disorders by use of pooling-based genomewide single-nucleotide-polymorphism association studies.

Authors:  John V Pearson; Matthew J Huentelman; Rebecca F Halperin; Waibhav D Tembe; Stacey Melquist; Nils Homer; Marcel Brun; Szabolcs Szelinger; Keith D Coon; Victoria L Zismann; Jennifer A Webster; Thomas Beach; Sigrid B Sando; Jan O Aasly; Reinhard Heun; Frank Jessen; Heike Kolsch; Magdalini Tsolaki; Makrina Daniilidou; Eric M Reiman; Andreas Papassotiropoulos; Michael L Hutton; Dietrich A Stephan; David W Craig
Journal:  Am J Hum Genet       Date:  2006-12-06       Impact factor: 11.025

6.  Association testing by DNA pooling: an effective initial screen.

Authors:  Aruna Bansal; Dirk van den Boom; Stefan Kammerer; Christiane Honisch; Gail Adam; Charles R Cantor; Patrick Kleyn; Andi Braun
Journal:  Proc Natl Acad Sci U S A       Date:  2002-12-10       Impact factor: 11.205

7.  Identification of disease causing loci using an array-based genotyping approach on pooled DNA.

Authors:  David W Craig; Matthew J Huentelman; Diane Hu-Lince; Victoria L Zismann; Michael C Kruer; Anne M Lee; Erik G Puffenberger; John M Pearson; Dietrich A Stephan
Journal:  BMC Genomics       Date:  2005-09-30       Impact factor: 3.969

8.  Silhouette scores for assessment of SNP genotype clusters.

Authors:  Lovisa Lovmar; Annika Ahlford; Mats Jonsson; Ann-Christine Syvänen
Journal:  BMC Genomics       Date:  2005-03-10       Impact factor: 3.969

9.  MPDA: microarray pooled DNA analyzer.

Authors:  Hsin-Chou Yang; Mei-Chu Huang; Ling-Hui Li; Chien-Hsing Lin; Alice L T Yu; Mitchell B Diccianni; Jer-Yuarn Wu; Yuan-Tsong Chen; Cathy S J Fann
Journal:  BMC Bioinformatics       Date:  2008-04-15       Impact factor: 3.169

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.