| Literature DB >> 29197136 |
Eman Alhuzimi1, Luis G Leal1, Michael J E Sternberg1, Alessia David1.
Abstract
We analyzed 563,099 common (minor allele frequency, MAF≥0.01) and rare (MAF < 0.01) genetic variants annotated in ExAC and UniProt and 26,884 disease-causing variants from ClinVar and UniProt occurring in the coding region of 17,975 human protein-coding genes. Three novel sets of genes were identified: those enriched in rare variants (n = 32 genes), in common variants (n = 282 genes), and in disease-causing variants (n = 800 genes). Genes enriched in rare variants have far greater similarities in terms of biological and network properties to genes enriched in disease-causing variants, than to genes enriched in common variants. However, in half of the genes enriched in rare variants (AOC2, MAMDC4, ANKHD1, CDC42BPB, SPAG5, TRRAP, TANC2, IQCH, USP54, SRRM2, DOPEY2, and PITPNM1), no disease-causing variants have been identified in major, publicly available databases. Thus, genetic variants in these genes are strong candidates for disease and their identification, as part of sequencing studies, should prompt further in vitro analyses.Entities:
Keywords: genetic variants; human disease; protein coding genes; protein network
Mesh:
Year: 2017 PMID: 29197136 PMCID: PMC5838408 DOI: 10.1002/humu.23377
Source DB: PubMed Journal: Hum Mutat ISSN: 1059-7794 Impact factor: 4.878
Figure 1Essential genes. Number (percentage) of essential genes in: (A) the set of all human protein‐coding genes reported in UniProt, (B) genes enriched in common variants (common‐EVset), (C) genes enriched in rare variants (rare‐EVset), and (D) genes enriched in disease‐causing variants (disease‐EVset). Genes are classified as essential if the mouse ortholog of the human gene produces a lethal phenotype (essential in MGI database) or if the gene is reported in the OGEE database (essential in OGEE)
Figure 2First‐degree neighbors for TRRAP, AOC2, SRRM2, and SPAG5. Only first‐degree neighbors with known disease‐causing variants are displayed. TRRAP, AOC2, SRRM2, and SPAG5 are presented as black circles. Diseases are classified according to the 10th revision of the International Statistical Classification of Diseases and Related Health Problems (ICD‐10)