| Literature DB >> 27799474 |
Yi Jiang1, Zhongshan Li1, Zhenwei Liu1, Denghui Chen2, Wanying Wu3, Yaoqiang Du2, Liying Ji1, Zi-Bing Jin4, Wei Li5, Jinyu Wu6.
Abstract
De novo germline mutations (DNMs) are the rarest genetic variants proven to cause a considerable number of sporadic genetic diseases, such as autism spectrum disorders, epileptic encephalopathy, schizophrenia, congenital heart disease, type 1 diabetes, and hearing loss. However, it is difficult to accurately assess the cause of DNMs and identify disease-causing genes from the considerable number of DNMs in probands. A common method to this problem is to identify genes that harbor significantly more DNMs than expected by chance, with accurate background DNM rate (DNMR) required. Therefore, in this study, we developed a novel database named mirDNMR for the collection of gene-centered background DNMRs obtained from different methods and population variation data. The database has the following functions: (i) browse and search the background DNMRs of each gene predicted by four different methods, including GC content (DNMR-GC), sequence context (DNMR-SC), multiple factors (DNMR-MF) and local DNA methylation level (DNMR-DM); (ii) search variant frequencies in publicly available databases, including ExAC, ESP6500, UK10K, 1000G and dbSNP and (iii) investigate the DNM burden to prioritize candidate genes based on the four background DNMRs using three statistical methods (TADA, Binomial and Poisson test). As a case study, we successfully employed our database in candidate gene prioritization for a sporadic complex disease: intellectual disability. In conclusion, mirDNMR (https://www.wzgenomics.cn/mirdnmr/) can be widely used to identify the genetic basis of sporadic genetic diseases.Entities:
Mesh:
Year: 2016 PMID: 27799474 PMCID: PMC5210538 DOI: 10.1093/nar/gkw1044
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.The flowchart of mirDNMR. mirDNMR is a gene-centered database incorporating four different background DNMRs and variant frequencies in five human genetic variation databases. Four functions are in this database, which allow users to retrieve background DNMRs and variant frequencies in normal populations and to prioritize candidate genes. GO, KEGG pathway and PPI analysis are also provided for annotation of candidate genes.
Figure 2.An example of mirDNMR use. For the ‘Browse’ function, all four background DNMRs and DNMR-average were divided into 200 bins based on magnitude ranging from 0 to 1.426e-03. Users can search background DNMRs by gene or DNMR range. Users can also search variant frequencies in human genetic variation databases for a gene, exon, genomic region, or locus. With an input DNM list, users can prioritize candidate genes based on TADA, Poisson test, or Binomial test using one of the four background DNMRs. Using a gene list, users can prioritize candidate genes based on background DNMRs, RVIS score or the distribution of different variant types in human genetic variation databases. For a given gene list generated by these functions, GO, KEGG pathway and PPI annotations can also be performed in mirDNMR.
Figure 3.Prioritization of candidate genes for intellectual disability from trio-based WES/WGS. Based on the TADA method, genes were prioritized using the four background DNMRs. (A) Forty six genes with q values <0.1 were shared by the four background DNMRs in intellectual disability trios. (B) A scatter diagram for the 46 intellectual disability candidate genes. The size of each point indicates the total number of LoF and damaging missense DNMs for each gene. (C) Relative enriched GO terms (q value <0.05) of the 46 candidate genes for intellectual disability. Detailed information for each GO term is shown in Supplementary Table S4.