| Literature DB >> 29946186 |
Anibal Bueno1, Rocío Rodríguez-López1,2, Armando Reyes-Palomares3, Elena Rojano1, Manuel Corpas4, Julián Nevado2,5, Pablo Lapunzina2,5, Francisca Sánchez-Jiménez1,2, Juan A G Ranea6,7.
Abstract
Copy number variations (CNVs) are genomic structural variations (deletions, duplications, or translocations) that represent the 4.8-9.5% of human genome variation in healthy individuals. In some cases, CNVs can also lead to disease, being the etiology of many known rare genetic/genomic disorders. Despite the last advances in genomic sequencing and diagnosis, the pathological effects of many rare genetic variations remain unresolved, largely due to the low number of patients available for these cases, making it difficult to identify consistent patterns of genotype-phenotype relationships. We aimed to improve the identification of statistically consistent genotype-phenotype relationships by integrating all the genetic and clinical data of thousands of patients with rare genomic disorders (obtained from the DECIPHER database) into a phenotype-patient-genotype tripartite network. Then we assessed how our network approach could help in the characterization and diagnosis of novel cases in clinical genetics. The systematic approach implemented in this work is able to better define the relationships between phenotypes and specific loci, by exploiting large-scale association networks of phenotypes and genotypes in thousands of rare disease patients. The application of the described methodology facilitated the diagnosis of novel clinical cases, ranking phenotypes by locus specificity and reporting putative new clinical features that may suggest additional clinical follow-ups. In this work, the proof of concept developed over a set of novel clinical cases demonstrates that this network-based methodology might help improve the precision of patient clinical records and the characterization of rare syndromes.Entities:
Mesh:
Year: 2018 PMID: 29946186 PMCID: PMC6138686 DOI: 10.1038/s41431-018-0139-x
Source DB: PubMed Journal: Eur J Hum Genet ISSN: 1018-4813 Impact factor: 4.246
Fig. 1Generation of a tripartite network using DECIPHER patient data. Circles represent phenotypes and rectangles loci. (1) Patients are phenotypically annotated using HPO terms; (2) a locus is defined as the chromosomal region where a set of patient CNVs overlap; (3) the HPOs-patients layers; (4) the patients-loci layers; (5) the final tripartite network
Fig. 2Hypergeometric Index (HyI) equation. 'A' represents a phenotype node and ‘B' a locus node within the tripartite network
Fig. 3Calculation of the Hypergeometric Index (HyI) in two scenarios within a tripartite network. Scenarios 1 and 2 show the HyI values for high and low prevalent phenotypes, respectively, connected to a locus via three patients
Examples of HPO-phenotypes vs. locus associations identified in the system
| HPO code | Phenotype | Max HyI (del.) | Max HyI (dup.) | ||
|---|---|---|---|---|---|
| HP:0002813 | Abnormality of limb bone morphology | 4.93 | 3.10 | Ch 7: 41613503–42807486 | Ch 9: 11818351–12709928 |
| HP:0000284 | Abnormality of ocular region | 3.39 | 1.65 | Ch 7: 41518389–41613502 | Ch Y: 945080–2654860 |
| HP:0000153 | Abnormality of the mouth | 3.50 | 1.94 | Ch 2: 200208169–200246437 | Ch 22: 40849826–41082043 |
| HP:0001315 | Reduced tendon reflexes | 3.16 | 3.54 | Ch 3: 6036656–6045520 | Ch 20: 29462074–29833608 |
| HP:0010477 | Aplasia of the bladder | – | 3.36 | – | Ch 17: 34817222–34817420 |
| HP:0001933 | Subcutaneous hemorrhage | 3.38 | – | Ch 21: 15398168–15412670 | – |
| HP:0200008 | Intestinal polyposis | 3.56 | – | Ch 10: 89717525–93614902 | – |
| HP:0001789 | Hydrops fetalis | 3.56 | – | Ch 13: 80378611–80386671 | – |
| HP:0003186 | Inverted nipples | 2.90 | 3.54 | Ch X: 455566–544731 | Ch 16: 75683739–78186860 |
| HP:0000699 | Diastema | 3.08 | 3.36 | Ch 5: 13750113–14064732 | Ch 7: 2290686–2996437 |
| HP:0010761 | Broad columella | 3.56 | – | Ch 19: 48066340–48270667 | – |
| HP:0008110 | Equinovarus deformity | 3.26 | 3.36 | Ch 16: 2038810–2124458 | Ch 16: 90148342–90148393 |
| HP:0002323 | Anencephaly | – | 3.36 | – | Ch 17: 34817222–34817420 |
Columns: (1) HPO code; (2) Phenotype description; (3) Maximun HyI obtained for the phenotype associated to a locus in the de novo deletions subnetwork; (4) Maximun HyI obtained for the phenotype associated to a locus in the de novo duplications subnetwork; (5) Chromosome id: the start and end coordinates (in bps in the hg19 genome reference) of the locus associated with the phenotype with the Max HyI value in the de novo deletions subnetwork; and (6) in the de novo duplications subnetwork
Fig. 4Identification of phenotype-locus associations for new clinical cases. A CNV from a new patient (Query) is assigned to a locus (CNV 1) in the tripartite network by genomic overlap comparison (left side of the figure). All the phenotypes associated to patients are ranked based on their HyI association value to the query locus (right side)
Statistics of the comparison between the clinical records of the 293 patients with rare CNV genomic disorders and the HPO phenotypes-loci associations identified by the system
| # HPO phenotypes diagnosed by the INGEMM clinicians for all the patients | 1694 |
| # Diagnosed HPOs for all the patients presenting a deletion | 1489 |
| # Diagnosed HPOs for all the patients presenting a duplication | 609 |
| # Diagnosed HPOs also identified by the system using the de novo deletions network | 381 |
| # Diagnosed HPOs also identified by the system using the de novo duplication network | 252 |
| # HPOs identified by the system and not diagnosed (with HyI > 2, penetrance 100%, and loci overlap 100%) using the de novo deletions network | 521 |
| # HPOs identified by the system and not diagnosed (with HyI > 2, penetrance 100%, and loci overlap 100%) using the de novo duplications network | 376 |
Example of the CNV analysis for three INGEMM´s patients
| Patient 1 | Deletion | chr 12 | Mutation start hg19: 30 946 782 | Mutation end hg19: 132 246 215 | |
|---|---|---|---|---|---|
|
|
|
|
|
|
|
| Abnormal facial shape (HP:0001999) | + Low-set ears (HP:0000369) | 5.41 | 100% | 100% | 100% |
| -> Abnormal location of ears (HP:0000357) | 5.16 | 100% | 100% | 100% | |
| -> Abnormality of the outer ear (HP:0000356) | 4.46 | 100% | 100% | 100% | |
| -> Abnormality of the ear (HP:0000598) | 4.01 | 100% | 100% | 100% | |
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
| |
| Global developmental delay (HP:0001263) | + Abnormality of body weight (HP:0004323) | 3.28 | 100% | 100% | 100% |
| -> Growth abnormality (HP:0001507) | 2.68 | 100% | 67% | 100% | |
| + Short stature (HP:0004322) | 3.03 | 100% | 70% | 100% | |
| -> Abnormality of body height (HP:0000002) | 2.70 | 100% | 72% | 100% | |
| -> Growth delay (HP:0001510) | 2.63 | 100% | 50% | 100% | |
Header: Chromosome coordinates of the CNV (mapped to the hg19 genome reference). Columns from left to right: (1) Observed phenotypes in the patient reported by the physician; (2) Associated phenotypes: The list of phenotypes identified in the network with a significant level of association (HyI value) with the overlapping locus to the patient CNV. “+” indicates that the HPO term is the most specific one among a group of them (children). “>” is used for detected parental terms that are related to the child term (less specific terms) in the HPO ontology; (3) HyI (Hypergeometric Index) Rank; (4) Penetrance; (5) % max; (6) Node overlap.
Patients with the syndrome associated with CNVs in the 19p13.3 region
|
|
The table shows the patient ids; Type of CNV: “−” for deletions and “+” for duplications. First column: phenotype descriptions and HPO codes. Boxes with X indicate that the phenotype has been previously diagnosed in the corresponding patient by the clinician examination. Phenotypes found by the systemic approach with significant HyI values (HyI value ≥ 2.0; p-value ≤ 0.01) are represented by dark gray boxes, and those detected with lower HyI values by light gray boxes. Phenotypes are grouped in four general categories: Growth and development, neurology, others, and an extra category for those found by the system but not previously diagnosed