| Literature DB >> 35047860 |
Michael A Levy1, Haley McConkey1, Jennifer Kerkhof1, Mouna Barat-Houari2, Sara Bargiacchi3, Elisa Biamino4, María Palomares Bralo5, Gerarda Cappuccio6,7, Andrea Ciolfi8, Angus Clarke9, Barbara R DuPont10, Mariet W Elting11, Laurence Faivre12,13, Timothy Fee10, Robin S Fletcher10, Florian Cherik14,15, Aidin Foroutan16, Michael J Friez10, Cristina Gervasini17, Sadegheh Haghshenas16, Benjamin A Hilton10, Zandra Jenkins18, Simranpreet Kaur19, Suzanne Lewis20, Raymond J Louie10, Silvia Maitz21, Donatella Milani22, Angela T Morgan23, Renske Oegema24, Elsebet Østergaard25,26, Nathalie Ruiz Pallares2, Maria Piccione27, Simone Pizzi8, Astrid S Plomp28, Cathryn Poulton29, Jack Reilly16, Raissa Relator1, Rocio Rius30,31, Stephen Robertson18, Kathleen Rooney1,16, Justine Rousseau32, Gijs W E Santen33, Fernando Santos-Simarro5, Josephine Schijns34, Gabriella Maria Squeo35, Miya St John23, Christel Thauvin-Robinet12,13,36,37, Giovanna Traficante3, Pleuntje J van der Sluijs33, Samantha A Vergano38,39, Niels Vos40, Kellie K Walden10, Dimitar Azmanov41, Tugce Balci42,43, Siddharth Banka44,45, Jozef Gecz46,47, Peter Henneman28, Jennifer A Lee10, Marcel M A M Mannens28, Tony Roscioli48,49,50,51, Victoria Siu42,43, David J Amor23, Gareth Baynam29,29,52, Eric G Bend53, Kym Boycott54,55, Nicola Brunetti-Pierri6,7, Philippe M Campeau32, John Christodoulou19, David Dyment56, Natacha Esber57, Jill A Fahrner58, Mark D Fleming59, David Genevieve15, Kristin D Kerrnohan54,60, Alisdair McNeill61, Leonie A Menke34, Giuseppe Merla35,62, Paolo Prontera63, Cheryl Rockman-Greenberg64, Charles Schwartz10, Steven A Skinner10, Roger E Stevenson10, Antonio Vitobello12,36, Marco Tartaglia8, Marielle Alders28, Matthew L Tedder10, Bekim Sadikovic1,16.
Abstract
Overlapping clinical phenotypes and an expanding breadth and complexity of genomic associations are a growing challenge in the diagnosis and clinical management of Mendelian disorders. The functional consequences and clinical impacts of genomic variation may involve unique, disorder-specific, genomic DNA methylation episignatures. In this study, we describe 19 novel episignature disorders and compare the findings alongside 38 previously established episignatures for a total of 57 episignatures associated with 65 genetic syndromes. We demonstrate increasing resolution and specificity ranging from protein complex, gene, sub-gene, protein domain, and even single nucleotide-level Mendelian episignatures. We show the power of multiclass modeling to develop highly accurate and disease-specific diagnostic classifiers. This study significantly expands the number and spectrum of disorders with detectable DNA methylation episignatures, improves the clinical diagnostic capabilities through the resolution of unsolved cases and the reclassification of variants of unknown clinical significance, and provides further insight into the molecular etiology of Mendelian conditions.Entities:
Keywords: Clinical diagnostics; DNA methylation; Epigenetics; Episignatures; Neurodevelopmental disorders
Year: 2021 PMID: 35047860 PMCID: PMC8756545 DOI: 10.1016/j.xhgg.2021.100075
Source DB: PubMed Journal: HGG Adv ISSN: 2666-2477
List of episignatures and their corresponding syndromes and genes or genomic regions
| Syndrome | Signature abbreviation | Underlying gene or region | OMIM | Samples | In EpiSign V2 classifier |
|---|---|---|---|---|---|
| X-linked alpha-thalassemia/mental retardation syndrome (ATRX) | ATRX | 22 | yes | ||
| Arboleda-Tham syndrome (ARTHS) | ARTHS | 18 | no | ||
| Autism, susceptibility to, 18 (AUTS18) | AUTS18 | 28 | yes | ||
| Beck-Fahrner syndrome (BEFAHRS) | BEFAHRS | 16 | no | ||
| Blepharophimosis Intellectual disability SMARCA2 syndrome | BISS | 5 | yes | ||
| Börjeson-Forssman-Lehmann syndrome (BFLS) | BFLS | 16 | yes | ||
| Cerebellar ataxia, deafness, and narcolepsy, autosomal dominant (ADCADN) | ADCADN | 5 | yes | ||
| CHARGE syndrome | CHARGE | 65 | yes | ||
| Chr16p11.2 deletion syndrome, 593-KB | Chr16p11.2del | Chr16p11.2 deletion | 18 | no | |
| Coffin-Siris syndrome-1,2 (CSS1,2) | CSS_c.6200 | 4 | no | ||
| Coffin-Siris syndrome-1,2,3,4 (CSS1,2,3,4); Nicolaides-Baraitser syndrome (NCBRS) | BAFopathy | 97 | yes | ||
| Coffin-Siris syndrome-4 (CSS4) | CSS4_c.2656 | 3 | no | ||
| Coffin-Siris syndrome-9 (CSS9) | CSS9 | 10 | no | ||
| Cohen-Gibson syndrome (COGIS); Weaver syndrome (WVS) | PRC2 | 7 | yes | ||
| Cornelia de Lange syndromes 1,2,3,4 (CDLS1,2,3,4) | CdLS | 57 | yes | ||
| Down syndrome | Down | Chr21 trisomy | 40 | yes | |
| Dystonia 28, childhood-onset (DYT28) | DYT28 | 11 | no | ||
| Epileptic encephalopathy, childhood-onset (EEOC) | EEOC | 8 | yes | ||
| Floating Harbor syndrome (FLHS) | FLHS | 20 | yes | ||
| Gabriele-de Vries syndrome (GADEVS) | GADEVS | 10 | no | ||
| Genitopatellar syndrome (see also Ohdo syndrome, SBBYSS variant) (KAT6B) | GTPTS | 4 | yes | ||
| Helsmoortel-van der Aa syndrome (HVDAS) | HVDAS_C | 13 | yes | ||
| Helsmoortel-van der Aa syndrome (HVDAS) | HVDAS_T | 23 | yes | ||
| Hunter McAlpine craniosynostosis syndrome | HMA | Chr5q35-qter duplication | 4 | yes | |
| Immunodeficiency-centromeric instability-facial anomalies syndrome 1 (ICF1) | ICF_1 | 8 | yes | ||
| Immunodeficiency-centromeric instability-facial anomalies syndromes 2,3,4 (ICF2,3,4) | ICF_2_3_4 | 7 | yes | ||
| Intellectual developmental disorder with seizures and language delay (IDDSELD) | IDDSELD | 10 | yes | ||
| Kabuki syndromes 1,2 (KABUK1,2) | Kabuki | 149 | yes | ||
| KDM2B-related syndrome | KDM2B | unofficial | 9 | no | |
| Autosomal dominant intellectual developmental disorder-65 (MRD65) | KDM4B | 6 | no | ||
| Kleefstra syndrome 1 (KLEFS1) | Kleefstra | 32 | yes | ||
| Koolen de Vreis syndrome (KDVS) | KDVS | 11 | yes | ||
| Luscan-Lumish syndrome (LLS) | LLS | 4 | no | ||
| Menke-Hennekam syndromes 1,2 (MKHK1,2) | MKHK_ID4 | 13 | no | ||
| Intellectual developmental disorder, X-linked, syndromic, Armfield type (MRXSA) | MRXSA | 6 | no | ||
| Mental retardation, autosomal dominant 23 (MRD23) | MRD23 | 25 | yes | ||
| Mental retardation, autosomal dominant 51 (MRD51) | MRD51 | 7 | yes | ||
| Intellectual developmental disorder, X-linked 93 (MRX93) | MRX93 | 11 | yes | ||
| Intellectual developmental disorder, X-linked 97 (MRX97) | MRX97 | 15 | yes | ||
| Intellectual developmental disorder, X-linked syndromic, Nascimento-type (MRXSN) | MRXSN | 4 | yes | ||
| Intellectual developmental disorder, X-linked, Snyder-Robinson type (MRXSSR) | MRXSSR | 17 | yes | ||
| Intellectual developmental disorder, X-linked, syndromic, Claes-Jensen type (MRXSCJ) | MRXSCJ | 49 | yes | ||
| Myopathy, lactic acidosis, and sideroblastic anemia 2 (MLASA2) | MLASA2 | 11 | no | ||
| Ohdo syndrome, SBBYSS variant (SBBYSS) | SBBYSS | 10 | yes | ||
| Phelan-McDermid syndrome (PHMDS) | PHMDS | Chr22q13.3 deletion | 11 | no | |
| Rahman syndrome (RMNS) | RMNS | 8 | yes | ||
| Renpenning syndrome (RENS1) | RENS1 | 8 | no | ||
| Rubinstein-Taybi syndrome 1 (RSTS1) | RSTS1 | 37 | no | ||
| Rubinstein-Taybi syndromes 1,2 (RSTS1,2) | RSTS | 39 | yes | ||
| Rubinstein-Taybi syndrome 2 (RSTS2) | RSTS2 | 29 | no | ||
| Sotos syndrome 1 (SOTOS1) | Sotos | 69 | yes | ||
| Tatton-Brown-Rahman syndrome (TBRS) | TBRS | 27 | yes | ||
| Velocardiofacial syndrome (VCFS) | VCFS | Chr22q11.2 deletion | 11 | no | |
| Wiedemann-Steiner syndrome (WDSTS) | WDSTS | 42 | yes | ||
| Williams-Beuren deletion syndrome (WBS) | Williams | Chr7q11.23 deletion | 22 | yes | |
| Williams-Beuren duplication syndrome (Chr7q11.23 duplication syndrome) | Dup7 | Chr7q11.23 duplication | 13 | yes | |
| Wolf-Hirschhorn syndrome (WHS) | WHS | Chr4p16.13 deletion | 12 | yes |
Episignatures that encompass a specific region or variant within a gene.
Figure 1Methylation differences of probes used for episignatures
Methylation differences between cases and controls for the microarray probes that make up each episignature for the newly identified and previously reported episignatures. Red lines indicate mean methylation for each episignature. Asterisk indicates new episignatures and/or those that have not previously been included in the multiclass classifier.
Figure 2Gene region- or variant-specific sub-signatures
(A) The last exon of ARID1A and ARID1B shown with the location of seven variants in the c.6200 region colored by whether they match the c.6200 episignature or not. (B and C) MDS (B) and hierarchical clustering (C) plots of the seven samples showing that the four central samples have a matching episignature, while the outer three cluster with controls. For hierarchical clustering plots, each row represents one microarray probe, and each column represents one sample. (D) Gene diagram of SMARCA4 (NM_001128849.1) showing the location of the three c.2656A>G variants in exon 19 (red arrowhead). The five horizontal gray bars indicate the locations of protein domains: QLQ, HSA, helicase ATP-binding, helicase C-terminal, and bromodomain. (E and F) MDS (E) and hierarchical clustering (F) showing that the three CSS4 samples with the above variant cluster separately from controls and from other BAFopathy samples. (G) Protein diagram of CREBBP/EP300 showing the location of protein domains (gray boxes) and intrinsically disordered (ID) domains (numbered). (H and I) MDS (H) and hierarchical clustering (I) showing the MKHK_ID4 samples clustering separately from controls and from other MKHK samples.
Figure 3Identifying episignatures to distinguish between closely related syndromes
Hierarchical clustering and MDS plots are shown for each episignature. For hierarchical clustering plots, each row represents one microarray probe, and each column represents one sample. (A and B) ARTHS probe selection using only ARTHS and control samples. (C and D) ARTHS probe selection when GTPTS and SBBYSS samples are included as controls. (E and F) The previously reported RSTS (RSTS1/RSTS2 combined) episignature. (G and H) The RSTS1 episignature generated by including RSTS2 samples as control. (I and J) the RSTS2 episignature generated by including RSTS1 samples as control.
Figure 4Support vector machine-based classifiers for concurrent episignature detection
(A) Each of the 19 new episignatures were evaluated using 4-fold cross-validation. For each fold, a different 25% of samples were used for testing (blue), and the remaining 75% of samples were used for training (gray). The case samples for each episignature are shown in red. The eight testing samples referenced in the text are labeled. (B) The final classifiers for all 57 episignatures. The case samples for each episignature are shown in red, and all other samples are in black. Non-case RSTS samples (for example, RSTS1 samples in the RSTS2 column) are in blue. Arrowheads indicate the two GADEVS samples with high BAFopathy scores.
Figure 5Screening unresolved cases
Samples with MVP scores greater than 0.5 were further assessed by unsupervised clustering plots. Hierarchical clustering and MDS plots are shown for each case. For hierarchical clustering plots, each row represents one microarray probe, and each column represents one sample. (A) Sample Unresolved_1, a previously unresolved case that matches the MKHK_ID4 episignature. (B) Sample Unresolved_2, a previously unresolved case that matches the LLS episignature. (C) Sample Unresolved_3, a previously unresolved case that matches the VCFS episignature.