Literature DB >> 25273843

SNP characteristics predict replication success in association studies.

Ivan P Gorlov1, Jason H Moore, Bo Peng, Jennifer L Jin, Olga Y Gorlova, Christopher I Amos.   

Abstract

Successful independent replication is the most direct approach for distinguishing real genotype-disease associations from false discoveries in genome-wide association studies (GWAS). Selecting SNPs for replication has been primarily based on P values from the discovery stage, although additional characteristics of SNPs may be used to improve replication success. We used disease-associated SNPs from more than 2,000 published GWASs to identify predictors of SNP reproducibility. SNP reproducibility was defined as a proportion of successful replications among all replication attempts. The study reporting association for the first time was considered to be discovery and all consequent studies targeting the same phenotype replications. We found that -Log(P), where P is a P value from the discovery study, is the strongest predictor of the SNP reproducibility. Other significant predictors include type of the SNP (e.g., missense vs intronic SNPs) and minor allele frequency. Features of the genes linked to the disease-associated SNP also predict SNP reproducibility. Based on empirically defined rules, we developed a reproducibility score (RS) to predict SNP reproducibility independently of -Log(P). We used data from two lung cancer GWAS studies as well as recently reported disease-associated SNPs to validate RS. Minus Log(P) outperforms RS when the very top SNPs are selected, while RS works better with relaxed selection criteria. In conclusion, we propose an empirical model to predict SNP reproducibility, which can be used to select SNPs for validation and prioritization.

Entities:  

Mesh:

Year:  2014        PMID: 25273843      PMCID: PMC4384517          DOI: 10.1007/s00439-014-1493-6

Source DB:  PubMed          Journal:  Hum Genet        ISSN: 0340-6717            Impact factor:   4.132


  26 in total

1.  Optimal selection of markers for validation or replication from genome-wide association studies.

Authors:  Celia M T Greenwood; Jagadish Rangrej; Lei Sun
Journal:  Genet Epidemiol       Date:  2007-07       Impact factor: 2.135

2.  Potential etiologic and functional implications of genome-wide association loci for human diseases and traits.

Authors:  Lucia A Hindorff; Praveen Sethupathy; Heather A Junkins; Erin M Ramos; Jayashri P Mehta; Francis S Collins; Teri A Manolio
Journal:  Proc Natl Acad Sci U S A       Date:  2009-05-27       Impact factor: 11.205

3.  Genome-wide association scan of tag SNPs identifies a susceptibility locus for lung cancer at 15q25.1.

Authors:  Christopher I Amos; Xifeng Wu; Peter Broderick; Ivan P Gorlov; Jian Gu; Timothy Eisen; Qiong Dong; Qing Zhang; Xiangjun Gu; Jayaram Vijayakrishnan; Kate Sullivan; Athena Matakidou; Yufei Wang; Gordon Mills; Kimberly Doheny; Ya-Yu Tsai; Wei Vivien Chen; Sanjay Shete; Margaret R Spitz; Richard S Houlston
Journal:  Nat Genet       Date:  2008-04-02       Impact factor: 38.330

Review 4.  Planning a genome-wide association study: points to consider.

Authors:  Hakon Hakonarson; Struan F A Grant
Journal:  Ann Med       Date:  2011-05-19       Impact factor: 4.709

5.  Trait-associated SNPs are more likely to be eQTLs: annotation to enhance discovery from GWAS.

Authors:  Dan L Nicolae; Eric Gamazon; Wei Zhang; Shiwei Duan; M Eileen Dolan; Nancy J Cox
Journal:  PLoS Genet       Date:  2010-04-01       Impact factor: 5.917

Review 6.  Study designs for genome-wide association studies.

Authors:  Peter Kraft; David G Cox
Journal:  Adv Genet       Date:  2008       Impact factor: 1.944

Review 7.  Comparative genomics as a tool to understand evolution and disease.

Authors:  Jessica Alföldi; Kerstin Lindblad-Toh
Journal:  Genome Res       Date:  2013-07       Impact factor: 9.043

8.  A guide to best practices for Gene Ontology (GO) manual annotation.

Authors:  Rama Balakrishnan; Midori A Harris; Rachael Huntley; Kimberly Van Auken; J Michael Cherry
Journal:  Database (Oxford)       Date:  2013-07-09       Impact factor: 3.451

9.  TiGER: a database for tissue-specific gene expression and regulation.

Authors:  Xiong Liu; Xueping Yu; Donald J Zack; Heng Zhu; Jiang Qian
Journal:  BMC Bioinformatics       Date:  2008-06-09       Impact factor: 3.169

10.  Genes with a large intronic burden show greater evolutionary conservation on the protein level.

Authors:  Olga Gorlova; Alexey Fedorov; Christopher Logothetis; Christopher Amos; Ivan Gorlov
Journal:  BMC Evol Biol       Date:  2014-03-16       Impact factor: 3.260

View more
  12 in total

1.  Untouchable genes in the human genome: Identifying ideal targets for cancer treatment.

Authors:  Ivan P Gorlov; Olga Y Gorlova; Christopher I Amos
Journal:  Cancer Genet       Date:  2019-01-24

2.  SNP characteristics and validation success in genome wide association studies.

Authors:  Olga Y Gorlova; Xiangjun Xiao; Spiridon Tsavachidis; Christopher I Amos; Ivan P Gorlov
Journal:  Hum Genet       Date:  2022-01-04       Impact factor: 5.881

3.  Machine learning and data mining in complex genomic data--a review on the lessons learned in Genetic Analysis Workshop 19.

Authors:  Inke R König; Jonathan Auerbach; Damian Gola; Elizabeth Held; Emily R Holzinger; Marc-André Legault; Rui Sun; Nathan Tintle; Hsin-Chou Yang
Journal:  BMC Genet       Date:  2016-02-03       Impact factor: 2.797

4.  Reconciling Differences in Pool-GWAS Between Populations: A Case Study of Female Abdominal Pigmentation in Drosophila melanogaster.

Authors:  Lukas Endler; Andrea J Betancourt; Viola Nolte; Christian Schlötterer
Journal:  Genetics       Date:  2015-12-29       Impact factor: 4.562

5.  Validation study of genetic biomarkers of response to TNF inhibitors in rheumatoid arthritis.

Authors:  Rosario Lopez-Rodriguez; Eva Perez-Pampin; Ana Marquez; Francisco J Blanco; Beatriz Joven; Patricia Carreira; Miguel Angel Ferrer; Rafael Caliz; Lara Valor; Javier Narvaez; Juan D Cañete; Maria Del Carmen Ordoñez; Sara Manrique-Arija; Yiannis Vasilopoulos; Alejandro Balsa; Dora Pascual-Salcedo; Manuel J Moreno-Ramos; Juan Jose Alegre-Sancho; Federico Navarro-Sarabia; Virginia Moreira; Rosa Garcia-Portales; Enrique Raya; Cesar Magro-Checa; Javier Martin; Juan J Gomez-Reino; Antonio Gonzalez
Journal:  PLoS One       Date:  2018-05-07       Impact factor: 3.240

6.  Gene characteristics predicting missense, nonsense and frameshift mutations in tumor samples.

Authors:  Ivan P Gorlov; Claudio W Pikielny; Hildreth R Frost; Stephanie C Her; Michael D Cole; Samuel D Strohbehn; David Wallace-Bradley; Marek Kimmel; Olga Y Gorlova; Christopher I Amos
Journal:  BMC Bioinformatics       Date:  2018-11-19       Impact factor: 3.169

Review 7.  Find the Needle in the Haystack, Then Find It Again: Replication and Validation in the 'Omics Era.

Authors:  Wei Perng; Stella Aslibekyan
Journal:  Metabolites       Date:  2020-07-12

8.  The Use of Multiplicity Corrections, Order Statistics and Generalized Family-Wise Statistics with Application to Genome-Wide Studies.

Authors:  Steven J Schrodi
Journal:  PLoS One       Date:  2016-04-29       Impact factor: 3.240

9.  Prediction of the gene expression in normal lung tissue by the gene expression in blood.

Authors:  Justin W Halloran; Dakai Zhu; David C Qian; Jinyoung Byun; Olga Y Gorlova; Christopher I Amos; Ivan P Gorlov
Journal:  BMC Med Genomics       Date:  2015-11-17       Impact factor: 3.063

10.  SNP eQTL status and eQTL density in the adjacent region of the SNP are associated with its statistical significance in GWA studies.

Authors:  Ivan Gorlov; Xiangjun Xiao; Maureen Mayes; Olga Gorlova; Christopher Amos
Journal:  BMC Genet       Date:  2019-11-12       Impact factor: 2.797

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.