| Literature DB >> 24884844 |
Yannis J Trakadis1, Caroline Buote, Jean-François Therriault, Pierre-Étienne Jacques, Hugo Larochelle, Sébastien Lévesque.
Abstract
BACKGROUND: We propose a phenotype-driven analysis of encrypted exome data to facilitate the widespread implementation of exome sequencing as a clinical genetic screening test.Twenty test-patients with varied syndromes were selected from the literature. For each patient, the mutation, phenotypic data, and genetic diagnosis were available. Next, control exome-files, each modified to include one of these twenty mutations, were assigned to the corresponding test-patients. These data were used by a geneticist blinded to the diagnoses to test the efficiency of our software, PhenoVar. The score assigned by PhenoVar to any genetic diagnosis listed in OMIM (Online Mendelian Inheritance in Man) took into consideration both the patient's phenotype and all variations present in the corresponding exome. The physician did not have access to the individual mutations. PhenoVar filtered the search using a cut-off phenotypic match threshold to prevent undesired discovery of incidental findings and ranked the OMIM entries according to diagnostic score.Entities:
Mesh:
Year: 2014 PMID: 24884844 PMCID: PMC4030287 DOI: 10.1186/1755-8794-7-22
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Figure 1Workflow of PhenoVar. PhenoVar automatically prioritizes diagnoses for validation based on both the phenotypic and genomic information of a proband. It calculates a patient-specific diagnostic score for each OMIM entry with known molecular basis. The diagnostic score assigned to a given syndrome is the sum of its phenotypic and genotypic weight. For each syndrome listed in the HPO database the phenotypic weight is determined by calculating the similarity between the proband and the different patients available in a local database (Phenobase). Phenobase includes simulated patients using HPO and real patients (here denoted as “local patients”). The genotypic weight for each syndrome corresponds to the (predicted) pathogenicity of any variants present in the proband’s exome specifically in the gene(s) causing the respective syndrome. When no variation is found in these genes, the genotypic weight for that syndrome is automatically set to null value. Otherwise, the variants are sorted into known disease-causing variants (DC var) versus possibly pathogenic variants (other var) and assigned a different score. The genotypic weight and phenotypic weight described above are summed to obtain the diagnostic score for each syndrome. The different syndromes are then ranked according to their diagnostic score.
Characteristics of the test patients selected from the literature
| 1a | Holoprosencephaly | Holoprosencephaly-2 (157170) | ||
| Microphthalmos | ||||
| Iris coloboma | ||||
| (Wallis et al.) [ | ||||
| 2a | Preaxial polydactyly | c.379C > T p.Arg127Ter | Short rib-polydactyly syndrome, type II (263520) | |
| Median cleft lip and palate | ||||
| Short ribs | ||||
| (Thiel et al.) [ | ||||
| 3a | Cutaneous finger syndactyly | c.1984 T > A p.Cys653Ser | Epiphyseal dysplasia, multiple, 4 (226900) | |
| Patellar dislocation | ||||
| Scoliosis | ||||
| (Makitie et al.) [ | ||||
| 4a | Polymicrogyria | c.1036 T > A p. Cys346Ser | Polymicrogyria, bilateral frontoparietal (606854) | |
| Seizures | ||||
| Microcephaly | ||||
| (Piao et al.) [ | ||||
| 5a | Synophrys | c.1127C > G p.Pro376Arg | Cornelia de Lange syndrome 4 (614701) | |
| Microcephaly | ||||
| Tetralogy of Fallot | ||||
| (Deardorff et al.) [ | ||||
| 6a | Micromelia | c.4172A > G p.Tyr1391Cys | Platyspondylic lethal skeletal dysplasia, Torrance type (151210) | |
| Radial bowing | ||||
| Pulmonary hypoplasia | ||||
| (Nishimura et al.) [ | ||||
| 7a | Generalized myoclonic seizures | c.3409C > T p. Arg1137Ter | Kleefstra syndrome/Chromosome 9q34.3 deletion syndrome (610253) | |
| Global developmental delay | ||||
| Short stature | ||||
| (Kleefstra et al.) [ | ||||
| 8a | Anophthalmia | c.878C > T p.Pro293Leu | Microphthalmia, syndromic 9 (601186) | |
| Pulmonic stenosis | ||||
| Blepharophimosis | ||||
| (Pasutto et al.) [ | ||||
| 9a | Oligohydramnios | c.1235G > A p. Trp412Ter | TARP syndrome (311900) | |
| Cleft palate | ||||
| Defect in the atrial septum | ||||
| (Johnston et al.) [ | ||||
| 10a | Hyperventilation | c.1727G > A p.Arg576Gln | Pitt-Hopkins syndrome (610954) | |
| Postnatal microcephaly | ||||
| Seizures | ||||
| (Amiel et al.) [ | ||||
| 1b | Limb shortening | |||
| Aplasia/hypoplasia of the fibula | ||||
| Aplasia/hypoplasia of the ulna | ||||
| (Woods et al.) [ | ||||
| 2b | Synostosis of carpals/tarsals | c.104C > G p.Pro35Arg | Tarsal-carpal coalition syndrome (186570) | |
| Proximal symphalangism | ||||
| Radial head subluxation | ||||
| (Dixon et al.) [ | ||||
| 3b | Adrenal hypoplasia | c.341C > T p.Ala114Val | Serkal syndrome or sex reversal, female, with dysgenesis of kidneys, adrenals, and lungs (611812) | |
| Intrauterine growth retardation | ||||
| Renal agenesis | ||||
| (Mandel et al.) [ | ||||
| 4b | Anal atresia | c.2188_2207del | Pallister-Hall syndrome (146510) | |
| Central polydactyly (hands) | ||||
| Short thumb | ||||
| (Killoran et al.) [ | ||||
| 5b | Global developmental delay | c.86G > A p.Arg29Gln | Lathosterolosis | |
| Postaxial polydactyly of foot | ||||
| Toe syndactyly | ||||
| (Brunetti-Pierri et al.) [ | ||||
| 6b | Central polydactyly (feet) | c.434 T > A p.Leu145Ter | Carpenter syndrome (201000) | |
| Craniosynostosis | ||||
| Finger syndactyly | ||||
| (Jenkins et al.) [ | ||||
| 7b | Cleft palate | c.571G > A p.Glu191Lys | Desmosterolosis (602398) | |
| Short stature | ||||
| Aplasia cutis congenita | ||||
| (Waterham et al.) [ | ||||
| 8b | Generalized hypotonia | c.1310C > G p.Ser437Ter | Sotos syndrome (117550) | |
| Macrocephaly | ||||
| Overgrowth | ||||
| (Kurotaki et al.) [ | ||||
| 9b | Holoprosencephaly | c.832-1G > C | Smith-Lemli-Opitz syndrome (270400) | |
| Median cleft lip and palate | ||||
| Microcephaly | ||||
| (Wright et al.) [ | ||||
| 10b | Short stature | c.137C > T p.Pro46Leu | Acrocapitofemoral dysplasia (607778) | |
| Limb shortening | ||||
| Cone-shaped epiphysis | ||||
| (Hellemans et al.) [ |
Table 1 summarizes the characteristics of the test-patients selected from the literature. The first column lists the identification number assigned to each patient. The phenotypic traits selected by the medical geneticist “blinded” to the diagnoses and the reference articles are listed in the second column. The affected gene, exact mutation, and corresponding diagnosis for each test-patient are also included in this table.
Four real patients analyzed by PhenoVar
| A | Cleft palate | c.1165C > T (p.Arg389Cys) | Cleft palate, isolated; cleft palate and mental retardation (119540) | 1 | |
| Congenital myopia | |||||
| Global developmental delay | |||||
| Micrognathia | |||||
| B | Cutis laxa | c.5741G > A(p.Arg1914His)/c.682insT (p.Cys228Fs) | Short stature, optic nerve atrophy, and Pelger-Huet anomaly (614800) | 2 | |
| Hydrocephalus | |||||
| Intellectual disability | |||||
| Optic atrophy | |||||
| C | Abnormality of dental enamel | c.902A > G (p.Glu301Gly)/c.902A > G (p.Glu301Gly) | Naxos disease (601214) | 3 | |
| Generalized ichthyosis | |||||
| Palmar hyperkeratosis | |||||
| Plantar hyperkeratosis | |||||
| Woolly hair | |||||
| D | Congenital cataract | c.3149G > A (p.Gly1050Glu) | Porencephaly, Familial (175780) | 7 | |
| Intellectual disability | |||||
| Microcephaly | |||||
| Seizures |
Table 2 summarizes four examples illustrating that Phenovar can be used with real patients data. The first column lists the identification letter assigned to each patient. The phenotypic traits used when running PhenoVar are listed in the second column. The next three columns denote the affected gene, exact mutation, and corresponding diagnosis (as determined after standard analysis of all the data, i.e. without using PhenoVar) for each patient. Finally, the last column indicates the ranking assigned by PhenoVar to the correct diagnosis.
Diagnosis prediction for test-patients using PhenoVar
| 1a | 3631 | 11 | 1 | 1 | 2 |
| 2a | 3848 | 1 | 1 | 1 | 3 |
| 3a | 3842 | >200 | 37 | 7 | 1 |
| 4a | 3841 | 84 | 11 | 3 | 2 |
| 5a* | 4353 | 26 | 2 | 1 | 2 |
| 6a* | 3913 | 30 | 3 | 2 | 2 |
| 7a | 3850 | >200 | 131 | 22 | 1 |
| 8a* | 3819 | 1 | 1 | 1 | 3 |
| 9a | 4519 | 2 | 1 | 1 | 2 |
| 10a | 3799 | 2 | 1 | 1 | 3 |
| 1b | 3 | 1 | 1 | 2 | |
| 2b | 3848 | 6 | 4 | 4 | 2 |
| 3b | 3842 | 100 | 3 | 1 | 1 |
| 4b | 3841 | 4 | 1 | 1 | 2 |
| 5b | 4353 | 3 | 1 | 1 | 3 |
| 6b | 3913 | 136 | 8 | 2 | 1 |
| 7b | 3850 | 11 | 1 | 1 | 2 |
| 8b | 3819 | 156 | 17 | 1 | 1 |
| 9b | 4519 | 22 | 2 | 1 | 2 |
| 10b | 3799 | 1 | 1 | 1 | 3 |
The first column in this table lists the identification number assigned to each test-patient. The number of variants with global minor allele frequency (GMAF) of less than 5% present in the modified exome assigned to each patient is highlighted in the second column. The next three columns denote the position of the correct diagnosis for each patient, as ranked by Phenovar using some of its different options: first solely based on the selected phenotypic traits of the respective patient (third column); next, by integrating the phenotypic traits and variants present in the exome of the patient: while assigning the same weight to all variants (fourth column); and finally, by assigning a higher weight to mutations known or predicted to cause disease (fifth column). The last column indicates how many of the traits selected by the medical geneticist “blinded” to the correct diagnoses matched any traits in Phenobase.
*Mutation annotated incorrectly (please refer to discussion).
Figure 2Future directions about Phenovar or similar software using the I-MPOS approach and data from real patients. A database containing phenotypic and encrypted genomic information of real patients with known or not-yet identified diagnoses can be made available (1). A patient with an unknown diagnosis presents in clinic. His encrypted ES data are obtained and his phenotype is assessed (2 α, β). The software automatically searches the “Encrypted Patients Database” using target patient’s assessed phenotype (3) thereby providing a first ranking of possible genetic conditions based on “phenotype weight” (4), (5). For all patients in the database meeting a specific phenotype-similarity threshold in relation to the proband, the software will compare the changes present in their genomes against the ones present in the genome of the patient seen in clinic (6). Matching the proband with the phenotypically similar subjects in the database based on similarity of their genetic changes (“mutation weight”) forms the basis of adjusting the first ranking to calculate the second ranking (6), (7). Subjects sharing adequate phenotypic characteristics who also share a genetic variant cluster together. As a result, a given match is indicative of the possibility that the target subject shares the same genetic condition with the matched other subject(s). After the second ranking, the information about the shared phenotype and genotype of the patients clustering together is accessible and can aid in reaching the diagnosis. It should be noted that “phenotype” (steps 2–5) is not limited to clinical traits but also refers to other levels of phenotype, such as a metabolomic profile. Also, the word “mutation” (steps 6, 7) can refer to variants in more than one genetic loci which are simultaneously present in all matched patients allowing one to explore the possibility of gene-gene interaction.