| Literature DB >> 26483451 |
Yuval Itan1, Lei Shang2, Bertrand Boisson2, Etienne Patin3, Alexandre Bolze2, Marcela Moncada-Vélez4, Eric Scott5, Michael J Ciancanelli2, Fabien G Lafaille2, Janet G Markle2, Ruben Martinez-Barricarte2, Sarah Jill de Jong2, Xiao-Fei Kong2, Patrick Nitschke6, Aziz Belkadi7, Jacinta Bustamante8, Anne Puel7, Stéphanie Boisson-Dupuis9, Peter D Stenson10, Joseph G Gleeson11, David N Cooper10, Lluis Quintana-Murci3, Jean-Michel Claverie12, Shen-Ying Zhang9, Laurent Abel9, Jean-Laurent Casanova13.
Abstract
The protein-coding exome of a patient with a monogenic disease contains about 20,000 variants, only one or two of which are disease causing. We found that 58% of rare variants in the protein-coding exome of the general population are located in only 2% of the genes. Prompted by this observation, we aimed to develop a gene-level approach for predicting whether a given human protein-coding gene is likely to harbor disease-causing mutations. To this end, we derived the gene damage index (GDI): a genome-wide, gene-level metric of the mutational damage that has accumulated in the general population. We found that the GDI was correlated with selective evolutionary pressure, protein complexity, coding sequence length, and the number of paralogs. We compared GDI with the leading gene-level approaches, genic intolerance, and de novo excess, and demonstrated that GDI performed best for the detection of false positives (i.e., removing exome variants in genes irrelevant to disease), whereas genic intolerance and de novo excess performed better for the detection of true positives (i.e., assessing de novo mutations in genes likely to be disease causing). The GDI server, data, and software are freely available to noncommercial users from lab.rockefeller.edu/casanova/GDI.Entities:
Keywords: gene prioritization; gene-level; mutational damage; next generation sequencing; variant prioritization
Mesh:
Year: 2015 PMID: 26483451 PMCID: PMC4640721 DOI: 10.1073/pnas.1518646112
Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN: 0027-8424 Impact factor: 11.205