| Literature DB >> 27666373 |
Nilah M Ioannidis1, Joseph H Rothstein2, Vikas Pejaver3, Sumit Middha4, Shannon K McDonnell5, Saurabh Baheti5, Anthony Musolf6, Qing Li6, Emily Holzinger6, Danielle Karyadi7, Lisa A Cannon-Albright8, Craig C Teerlink8, Janet L Stanford9, William B Isaacs10, Jianfeng Xu11, Kathleen A Cooney12, Ethan M Lange13, Johanna Schleutker14, John D Carpten15, Isaac J Powell16, Olivier Cussenot17, Geraldine Cancel-Tassin17, Graham G Giles18, Robert J MacInnis18, Christiane Maier19, Chih-Lin Hsieh20, Fredrik Wiklund21, William J Catalona22, William D Foulkes23, Diptasri Mandal24, Rosalind A Eeles25, Zsofia Kote-Jarai25, Carlos D Bustamante26, Daniel J Schaid5, Trevor Hastie27, Elaine A Ostrander7, Joan E Bailey-Wilson6, Predrag Radivojac3, Stephen N Thibodeau28, Alice S Whittemore29, Weiva Sieh30.
Abstract
The vast majority of coding variants are rare, and assessment of the contribution of rare variants to complex traits is hampered by low statistical power and limited functional data. Improved methods for predicting the pathogenicity of rare coding variants are needed to facilitate the discovery of disease variants from exome sequencing studies. We developed REVEL (rare exome variant ensemble learner), an ensemble method for predicting the pathogenicity of missense variants on the basis of individual tools: MutPred, FATHMM, VEST, PolyPhen, SIFT, PROVEAN, MutationAssessor, MutationTaster, LRT, GERP, SiPhy, phyloP, and phastCons. REVEL was trained with recently discovered pathogenic and rare neutral missense variants, excluding those previously used to train its constituent tools. When applied to two independent test sets, REVEL had the best overall performance (p < 10-12) as compared to any individual tool and seven ensemble methods: MetaSVM, MetaLR, KGGSeq, Condel, CADD, DANN, and Eigen. Importantly, REVEL also had the best performance for distinguishing pathogenic from rare neutral variants with allele frequencies <0.5%. The area under the receiver operating characteristic curve (AUC) for REVEL was 0.046-0.182 higher in an independent test set of 935 recent SwissVar disease variants and 123,935 putatively neutral exome sequencing variants and 0.027-0.143 higher in an independent test set of 1,953 pathogenic and 2,406 benign variants recently reported in ClinVar than the AUCs for other ensemble methods. We provide pre-computed REVEL scores for all possible human missense variants to facilitate the identification of pathogenic variants in the sea of rare variants discovered as sequencing studies expand in scale.Entities:
Mesh:
Year: 2016 PMID: 27666373 PMCID: PMC5065685 DOI: 10.1016/j.ajhg.2016.08.016
Source DB: PubMed Journal: Am J Hum Genet ISSN: 0002-9297 Impact factor: 11.025