| Literature DB >> 32271766 |
Nikolas Pontikos1,2,3, Cian Murphy1,4, Ismail Moghul5, Gavin Arno2,3,6, Kaoru Fujinami2,3,6,7, Yu Fujinami8,9, Dayyanah Sumodhee10, Susan Downes11,12, Andrew Webster2,3, Jing Yu12.
Abstract
As high-throughput sequencing is increasingly applied to the molecular diagnosis of rare Mendelian disorders, a large number of patients with diverse phenotypes have their genetic and phenotypic data pooled together to uncover new gene-phenotype relations. We introduce Phenogenon, a statistical tool that combines, Human Phenotype Ontology (HPO) annotated patient phenotypes, gnomAD allele population frequency, and Combined Annotation Dependent Depletion (CADD) score for variant pathogenicity, in order to jointly predict the mode of inheritance and gene-phenotype associations. We ran Phenogenon on our cohort of 3,290 patients who had undergone whole exome sequencing. Among the top associations, we recapitulated previously known, such as "SRD5A3-Abnormal full-field electroretinogram-recessive" and "GRHL2 -Nail dystrophy-recessive", and discovered one potentially novel, "RRAGA-Abnormality of the skin-dominant". We also developed an interactive web interface available at https://phenogenon.phenopolis.org to visualise and explore the results.Entities:
Year: 2020 PMID: 32271766 PMCID: PMC7144978 DOI: 10.1371/journal.pone.0230587
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Total number of 3290 exomes by predominant phenotypes.
| Predominant phenotype(s) | Number of samples |
|---|---|
| Dementia (with relation to prion disease) | 1039 |
| Inflammatory bowel disease | 653 |
| Retinal disorders | 504 |
| Healthy | 272 |
| Epilepsy | 241 |
| Bone Marrow Failure | 190 |
| primary immunodeficiency | 109 |
| Sudden Cardiac Death | 92 |
| Mitochondrial diseases | 89 |
| Dermotological disorders | 47 |
| Arrhythmogenic right ventricular cardiomyopathy | 27 |
| Nervous system disorders | 14 |
| Cataract | 5 |
| Mitochondrial diseases | 4 |
| Keratoconus | 4 |
Fig 1Phenogenon profiling workflow.
A) The distribution of frequency vs CADD Phred score for variants of a single gene were binned according to empirically chosen cut-offs. B) Variants within each binned area are further analysed. Individuals carrying these variants are identified and then filtered on the basis of whether they have a selected HPO term. C) Fisher’s Exact test is then used to determine the significance of the gene-phenotype relationship. D) A Phenogenon heatmap is produced using the Fisher Exact P-Values for each binned area. E) Fisher Exact Scores for each of the binned area in the first column are collapsed into a single HPO goodness of fit score (HGF) using a Scaled Stouffer transformation.
Known HPO-gene-MOI relationships used to benchmark Phenogenon.
| Rank | Gene | HPO | MOI | NP | M score | HGF score |
|---|---|---|---|---|---|---|
| 1 | Seizures | Dom | 100 | Dom | 64.43 | |
| 2 | Visual impairment | Rec | 259 | Rec | 26.20 | |
| 3 | Macular dystrophy | Rec | 76 | Rec | 16.78 | |
| 4 | Constriction of the peripheral visual field | Rec | 41 | Rec | 9.43 | |
| 5 | Nyctalopia | Rec | 15 | Rec | 8.25 | |
| 6 | Macular dystrophy | Dom | 60 | Dom | 7.02 | |
| 7 | Visual loss | Rec | 8 | Rec | 6.82 | |
| 8 | Retinal dystrophy | Rec | 25 | Rec | 6.75 | |
| 9 | Bone marrow hypocellularity | Dom | 48 | Dom | 6.28 | |
| 10 | Constriction of the peripheral visual field | Rec | 10 | Rec | 5.61 | |
| 11 | Constriction of the peripheral visual field | X-linked | 28 | Rec | 4.77 | |
| 12 | Visual loss | Rec | 4 | Rec | 2.51 |
MOI = Mode of Inheritance; NP = the number of patients who carry rare variants for the corresponding MOI
Fig 2Using phenogenon to predict gene-HPO-mode of inheritance (MOI) relationships for the 12 known genes.
A. Examples of using Phenogenon to profile known relationships: ABCA4—Macular dystrophy (HP:0007754) -recessive, and SCN1A—Seizures (HP:0001250)—dominant. The color scales represent the HGF score. The majority of high-scoring bins are for rare variants (HGF < 0.00025). B. Error rate in predicting HPO when number of patients selected per gene is higher than ‘HPO NP cut-off’. The lines give the trend of error rates for each prediction model. C. Error rate for MOI when HPO selected per gene is higher than HGF cut-off. The lines give the trend of error rates for each prediction model. Orange line: model using gnomAD allele frequency instead of estimated homozygote frequency for recessive MOI; Red line: model using HGF for both HPO association and MOI prediction; Blue line: model using Fisher method to combine p values; Green line: our current model for Phenogenon.
Top-ranked gene-phenotype-MOI relations reported by phenogenon.
| Gene | Gene Description | Predicted HPO | Predicted MOI | Known MOI | HGF score | Known |
|---|---|---|---|---|---|---|
| Encodes Ras-related GTP-binding protein A that activates Mtorc [ | Abnormality of the skin | dominant | / | 11.43 | No | |
| Steroid 5α-reductase type 3 is known to cause congenital disorders of glycosylation, which may involve retinal disorders [ | Abnormal full-field electroretinogram | recessive | recessive | 11.13 | Yes | |
| Known to cause pituitary adenoma [ | Dementia | recessive | / | 11.03 | No | |
| Abnormal electroretinogram | recessive | / | 10.98 | No | ||
| Transcription factor involved in multiple cancers and keratin development [ | Nail dystrophy | recessive | recessive | 10.54 | Yes | |
| Gain of function variants in this transcription factor exhibit diverse immune dysfunction [ | Severe combined immunodeficiency | dominant | dominant/recessive | 10.38 | Yes | |
| Involved in cardiomyopathy [ | Abnormality of the anterior segment of the globe | dominant | / | 9.74 | No | |
| Retinal dystrophy | recessive | recessive | 9.40 | Yes |