Claire C Homan1, Sarah L King-Smith1, David M Lawrence2, Peer Arts1, Jinghua Feng3, James Andrews1, Mark Armstrong1, Thuong Ha1, Julia Dobbins1, Michael W Drazer4, Kai Yu5, Csaba Bödör6, Alan Cantor7, Mario Cazzola8, Erin Degelman9, Courtney D DiNardo10, Nicolas Duployez11, Remi Favier12, Stefan Fröhling13, Jude Fitzgibbon14, Jeffery M Klco15, Alwin Krämer16, Mineo Kurokawa17, Joanne Lee18, Luca Malcovati8, Neil V Morgan19, Georges Natsoulis20, Carolyn Owen9, Keyur P Patel10, Claude Preudhomme11, Hana Raslova21, Hugh Rienhoff20, Tim Ripperger22, Rachael Schulte23, Kiran Tawana24, Elvira Velloso25, Benedict Yan18, Paul Liu5, Lucy A Godley4, Andreas W Schreiber26, Christopher N Hahn27, Hamish S Scott28, Anna L Brown29. 1. Department of Genetics and Molecular Pathology, SA Pathology, Adelaide, SA, Australia; Centre for Cancer Biology, SA Pathology and University of South Australia, Adelaide, SA. 2. Department of Genetics and Molecular Pathology, SA Pathology, Adelaide, SA, Australia; Centre for Cancer Biology, SA Pathology and University of South Australia, Adelaide, SA, Australia; Australian Cancer Research Foundation (ACRF) Cancer Genomics Facility, Centre for Cancer Biology, SA Pathology, Adelaide, SA. 3. Centre for Cancer Biology, SA Pathology and University of South Australia, Adelaide, SA, Australia; Australian Cancer Research Foundation (ACRF) Cancer Genomics Facility, Centre for Cancer Biology, SA Pathology, Adelaide, SA. 4. Section of Hematology/Oncology, Departments of Medicine and Human Genetics, Center for Clinical Cancer Genetics, and The University of Chicago Comprehensive Cancer Center, The University of Chicago, Chicago, IL. 5. National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892. 6. HCEMM-SE Molecular Oncohematology Research Group, 1st Department of P athology and E xperimental Cancer R esearch, Semmelweis U niversity, B udapest, H ungary. 7. Division of Hematology/Oncology, Boston Children's Hospital and Dana Farber Cancer Institute, Harvard Medical School, Boston, MA 02115. 8. Department of Molecular Medicine, University of Pavia, Pavia, Italy; Department of Hematology Oncology, Fondazione IRCCS Policlinico San Matteo, Pavia. 9. Division of Hematology and Hematological Malignancies, Foothills Medical Centre, Calgary, AB. 10. Department of Leukemia, University of Texas MD Anderson Cancer Center, Houston, TX. 11. Laboratory of Hematology, Biology and Pathology Center, Centre Hospitalier Regional Universitaire de Lille, Lille, France; Jean-Pierre Aubert Research Center, INSERM, Universitaire de Lille, Lille. 12. Assistance Publique- Hôpitaux de Paris, Armand Trousseau children's Hospital, Paris. 13. Department of Translational Medical Oncology, National Center for Tumor Diseases (NCT) and German Cancer Research Center (DKFZ), Heidelberg, Germany; German Cancer Consortium (DKTK), Heidelberg, Germany. 14. Centre for Haemato-Oncology, Barts Cancer Institute, Queen Mary University of London, London. 15. St Jude Children's Research Hospital, Memphis, Tennessee, United States. 16. Clinical Cooperation Unit Molecular Hematology/Oncology, German Cancer Research Center (DKFZ) and Dept. of Internal Medicine V, University of Heidelberg, Heidelberg, Germany. 17. Department of Hematology and Oncology, Graduate School of Medicine, The University of Tokyo, Japan. 18. Department of Haematology-Oncology, National University Cancer Institute, National University Health System. 19. Institute of Cardiovascular Sciences, College of Medical and Dental Sciences, University of Birmingham. 20. Imago Biosciences, Inc., San Francisco, CA. 21. Institut Gustave Roussy, Université Paris Sud, Equipe Labellisée par la Ligue Nationale Contre le Cancer, Villejuif. 22. Department of Human Genetics, Hannover Medical School, Hannover, Germany. 23. Department of Pediatrics, Division of Pediatric Hematology and Oncology, Monroe Carell Jr. Children's Hospital, Vanderbilt University Medical Center, Nashville, TN. 24. Department of Haematology, Addenbrooke's Hospital. Cambridge, CB2 0QQ. 25. Service of Hematology, Transfusion and Cell Therapy and Laboratory of Medical Investigation in Pathogenesis and Directed Therapy in Onco-Immuno-Hematology (LIM-31) HCFMUSP, University of Sao Paulo Medical School, Sao Paulo, Brazil; Genetics Laboratory, Hospital Israelita Albert Einstein, Sao Paulo, Brazil. 26. Centre for Cancer Biology, SA Pathology and University of South Australia, Adelaide, SA, Australia; Australian Cancer Research Foundation (ACRF) Cancer Genomics Facility, Centre for Cancer Biology, SA Pathology, Adelaide, SA, Australia; School of Biological Sciences, University of Adelaide, Adelaide, SA. 27. Department of Genetics and Molecular Pathology, SA Pathology, Adelaide, SA, Australia; Centre for Cancer Biology, SA Pathology and University of South Australia, Adelaide, SA, Australia; School of Medicine, University of Adelaide, Adelaide, SA. 28. Department of Genetics and Molecular Pathology, SA Pathology, Adelaide, SA, Australia; Centre for Cancer Biology, SA Pathology and University of South Australia, Adelaide, SA, Australia; School of Biological Sciences, University of Adelaide, Adelaide, SA, Australia; School of Medicine, University of Adelaide, Adelaide, SA. 29. Department of Genetics and Molecular Pathology, SA Pathology, Adelaide, SA, Australia; Centre for Cancer Biology, SA Pathology and University of South Australia, Adelaide, SA, Australia; School of Medicine, University of Adelaide, Adelaide, SA. anna.brown@sa.gov.au.
Familial platelet disorder with associated myeloid malignancy (FPD-MM, OMIM:601399)[1,2] is a rare cancer predisposition syndrome caused by pathogenic germline variants in RUNX1.[3] Despite research dating back over two decades, many challenges remain in improving outcomes for individuals with FPD-MM.[4] Firstly, the syndrome may go unrecognized due to poor recognition of family history and/or access to appropriate genetic testing. Secondly, intentional screening or incidental detection (e.g., tumour-sequencing) of RUNX1 variants requires access to expert interpretation. Thirdly, after diagnosis, the relative rarity of the disorder inhibits the collation of sizeable local cohorts, making identification of commonalities in disease course and/or outcome highly challenging. To help overcome these significant challenges, we have developed an interactive public webbased international collaborative database for RUNX1: RUNX1db (https://runx1db.runx1-fpd.org/). RUNX1db is a centralized repository for germline RUNX1 variant information, associated next-generation sequencing (NGS) data, and expert-curated variant information (both germline and somatic).We recently identified, from publications, 140 different families with germline RUNX1 variants.[4] While being a rich resource, historically reported variants are largely not classified according to the American College of Medical Genetics and Genomics/Association for Molecular Pathology (ACMG/AMP) guidelines, only established in 2015.[5] Additionally, the Clinical Genome Resource myeloid malignancy variant curation expert panel (ClinGen MM-VCEP) recently created guidelines specific for classification of germline RUNX1 variants.[6] Gene-specific guidelines, while important, add additional complexity to the curation of identified variants. Making available expert knowledge to accurately classify these germline variants prevents both missing pathogenic variants or the misattribution of benign variants as causative in families.[7,8] Additionally, variants identified through clinical services and research studies don’t always make it into the public domain due to constraints associated with the reporting of variants through publication or variant repositories. To address some of these challenges, we updated curated variants from publications and undertook an international survey of colleagues, identifying unpublished variants. This study identified an additional 119 families (259 in total), with 164 unique variants. These included ten new variants not previously described (Figure 1 and Table S1). Using this data, we created the first comprehensive RUNX1 germline registry and performed expert curation of all variants according to the RUNX1-specific ACMG classification rules (ALB, CNH, LAG, LM, CDD MM-VCEP members). The registry represents the largest collection of curated and clinically classified RUNX1 germline variants to date, providing a unique clinical resource for researchers, clinical genomics laboratories, and haematologists (Figure 1, Table S1). Utilizing this resource, we have identified 97 pathogenic/ likely pathogenic RUNX1 variants, with 54 located within the RUNT domain (RHD)(75% of RHD variants), of which 24 are missense mutations. Only one pathogenic missense variant is observed outside of the RHD, suggesting the RHD is highly intolerant to genetic-variation. Most commonly observed pathogenic germline RUNX1 variations are whole-gene deletions (21 probands), deletion of exons 1-2 (9 probands), and mutation of amino acid p.Arg201 within the RHD (8 probands)(Table S1). Accessibility and update-ability of this information is available through a live-webportal which hosts the registry (https://runx1db.runx1-fpd.org/classification/classifications). Each curated variant has links to patient-phenotypic information and the current clinical classification, including the evidence for each ACMG code assessed and links to external clinical databases, including ClinVar and associated publications. Importantly, expert crowdsourcing allows the real-time updating of the database through user profile accounts. Newly-identified variants can be easily added to the database and are automatically annotated with over 137 parameters required for accurate classification (e.g., population frequency, pathogenicity predictions). These parameters populate a classificationtool that guides users stepwise through the ACMG classification of new variants (or updating current classifications with new information). Once curated and classified, collated information can be exported as an automated classification report summary, flagged for expert-review, shared with other users, and uploaded to ClinVar.
Figure 1.
Registry of germline Germline RUNX1 variants currently included in the RUNX1db registry are visualised using the ProteinPaint web application (https://pecan.stjude.cloud/home).12 Variants (displayed as protein changes where possible) are colour-coded according to pathogenicity classification as determined by the MM-VCEP RUNX1-specific recommendations. The number of probands for each variant is indicated within the circle where the number is greater than one. All variants are annotated to RUNX1c; NM_001754.4; LRG_ 482.
RUNX1database genomics cohort demographics.Registry of germline Germline RUNX1 variants currently included in the RUNX1db registry are visualised using the ProteinPaint web application (https://pecan.stjude.cloud/home).12 Variants (displayed as protein changes where possible) are colour-coded according to pathogenicity classification as determined by the MM-VCEP RUNX1-specific recommendations. The number of probands for each variant is indicated within the circle where the number is greater than one. All variants are annotated to RUNX1c; NM_001754.4; LRG_ 482.RUNX1 database genomics cohort demographic. (A) Breakdown of the number and types of NGS samples currently stored in the RUNX1db. Pre- Leukemic: thrombocytopenia, asymptomatic Other: includes post-transplant/post-treatment and saliva samples. Both WES and panel data is analysed and stored in the database. (B) Scatter plot displaying the age of the individual when each sample was collected. Major RUNX1db cohorts (malignancy and preleukemic samples) are displayed. The median age for each cohort is represented by the vertical line. Clinical demographics of the malignancy cohort is shown with the number of individuals with different types of FPD-MM malignancy presentation and the (C) gender and (D) age distribution; Adult ≥40years, AYA=15-39 years, children ≤14years. AML: Acute myeloid leukemia; MDS: myelodysplastic syndromes; MDS/MPN: myelodysplastic syndrome/myeloproliferative Neoplasm overlap; MPN: Myeloproliferative Neoplasm; ALL: acute lymphoblastic leukemia; AL: acute undifferentiated leukemia.In addition to a germline RUNX1 variant registry, RUNX1db has the capacity to house NGS datasets, creating the first international genomics cohort of this rare disease. This initiative intends to enable researchers to answer questions about FPD-MM beyond germline variant detection. For example, family members, heterozygous for RUNX1 mutations, can have varying clinical presentations indicating variable penetrance and expressivity. In almost all cases, germline RUNX1 carriers present with thrombocytopenia and qualitative platelet defects, and progression to hematologic malignancies (HM) is incompletely penetrant with variable age of onset ranging from early childhood to late adulthood.[2] Patients develop myeloid malignancies most frequently, and Tcell and, more rarely, B-cell acute lymphoblastic leukaemia (ALL).[4] Currently, there is no way to predict which individuals will progress to myelodysplastic syndrome (MDS), acute myeloid leukaemia (AML), or other HM. Accumulation of somatic mutations and additional germline modifier variants are mechanisms proposed to contribute to this heterogeneity.[4] NGS technology is widely used for surveillance and diagnosis of HM,[4] accumulating large amounts of data often not utilized beyond RUNX1 variant detection. Individual laboratories often only have small numbers of patients with deleterious RUNX1 germline variants, which makes asking larger questions about commonalities of genotype-phenotype, disease progression, monitoring, treatment and outcome, difficult.[9] To accumulate the data required to make evidence- based clinical decisions in FPD-MM, a dedicated resource utilizing the collective wealth of NGS data generated from research and diagnostic laboratories internationally is ideal in standardizing and collating diseasespecific clinical and genomics data. The database has also been designed for the accumulation, sharing and curation of genomics data acquired from individuals with germline RUNX1 mutations both pre- and post-malignancy progression. We have collated 179 NGS datasets, both whole-exome sequencing (WES) and HM gene panel data, from 19 distinct research centres worldwide. This includes NGS from 60 FPD-MM families and 120 individuals, making it the largest FPD-MM NGS dataset (Figure 2). The dataset includes individuals ranging in age from 1-76 years, malignancy phenotypes of AML, MDS, myelodysplastic syndrome/myeloproliferative neoplasm (MDS/MPN), ALL, and pre-leukemic phenotypes including thrombocytopenia and asymptomatic carriers (Table 1). Detailed clinical information for each patient and associated samples are stored on the database and can be updated, enabling specific phenotypic-genotypic cohort studies to be performed on the clinical spectrum of FPDMM. Additionally, the database can be updated easily with new NGS data as available, including longitudinal datasets from serial testing of individual patients. The database allows for a comprehensive, unbiased and customizable review of all RUNX1 germline datasets with all raw sequencing data being analyzed through a standardized bioinformatics pipeline. This is designed to identify both somatic and germline variants and is available on the database as variant level data (VCF, Figure S1). Using the integrated VariantGrid (https://github.com/SACGF/variantgrid) genomics analysis software, we have curated a panel of somatic variants for each dataset (including all malignancy and pre-leukemic samples), prioritizing the identification of potentially pathogenic variants in HM (2,643 variants, 167 samples). Standard filtering criteria were adapted for identifying somatic variants (Online Supplementary Figure S1). Variants that passed all filtering criteria were subsequently manually curated. Variants classified as having no clinical significance (benign/likely benign) according to ACMG/AMP guidelines, were excluded. Remaining variants were either classified as 1) Clinically relevant, 2) Possibly relevant, or 3) Unknown relevance (Online Supplementary Figure S1).[10,11] Curated somatic variant data is available through the interactiveoncoplot on the database homepage or variant page. Shared in real-time with the scientific community, this curated dataset has already allowed the selection of secondary mutations to model FPD-MM disease and therapy in vitro and in animals. Importantly, investigators can interrogate the data to answer additional research questions as the software provides a fully automated annotation of variants and allows non-bioinformaticians to filter, sort, analyze, and curate genetic variants stored in the database via a graphical interface (Online Supplementary Figure S2).
Figure 2.
RUNX1 database genomics cohort demographic. (A) Breakdown of the number and types of NGS samples currently stored in the RUNX1db. Pre- Leukemic: thrombocytopenia, asymptomatic Other: includes post-transplant/post-treatment and saliva samples. Both WES and panel data is analysed and stored in the database. (B) Scatter plot displaying the age of the individual when each sample was collected. Major RUNX1db cohorts (malignancy and preleukemic samples) are displayed. The median age for each cohort is represented by the vertical line. Clinical demographics of the malignancy cohort is shown with the number of individuals with different types of FPD-MM malignancy presentation and the (C) gender and (D) age distribution; Adult ≥40years, AYA=15-39 years, children ≤14years. AML: Acute myeloid leukemia; MDS: myelodysplastic syndromes; MDS/MPN: myelodysplastic syndrome/myeloproliferative Neoplasm overlap; MPN: Myeloproliferative Neoplasm; ALL: acute lymphoblastic leukemia; AL: acute undifferentiated leukemia.
Table 1.
RUNX1database genomics cohort demographics.
This project serves as a model for data accumulation for rare cancer predisposition syndromes. The adoption of a single database that serves as a repository for patient demographic and clinical data, a mutational germline registry, and patient genomics data, which can be interrogated as a large cohort are essential components for the diagnosis and treatment of patients with a rare-disorder such as FPD-MM. This resource is especially useful in FPD-MM, where the genetic cause is well established but variability in clinical presentation and disease development render diagnosis challenging. The aggregation of multiple families, individuals, and disease stages into a centralized database where all data undergo rigorous quality control using a single bioinformatics analysis strategy will aid in the exploration and discovery of the molecular progression of the disorder. The harmonized interpretation of genomic variants is imperative to understanding the mutational profile of a malignancy, which is achieved through a curated list of variants displayed for each sample. Institutional, national, and international ethics and data sharing guidelines may initially limit contributions to initiatives like this that are supported by patient advocates but need to be overcome, given the importance of the work. We envision that information from this database will guide precision-based approaches to patient care plans with reasonable surveillance and adequate counselling and, eventually, the application of new targeted therapies and interventions prior to malignancy development for germline RUNX1 carriers. With the continued accumulation of data and clinical information, this type of gene-specific database can provide the basis to developing evidence-based clinical decisions such as when to watch and wait and when to apply more aggressive therapies such as stem cell transplantation. Finally, we hope that this database will serve as a model from which similar efforts will emerge for other HMs, benefiting all our patients and families.
Authors: Xin Zhou; Michael N Edmonson; Mark R Wilkinson; Aman Patel; Gang Wu; Yu Liu; Yongjin Li; Zhaojie Zhang; Michael C Rusch; Matthew Parker; Jared Becksfort; James R Downing; Jinghui Zhang Journal: Nat Genet Date: 2016-01 Impact factor: 38.330
Authors: Marilyn M Li; Michael Datto; Eric J Duncavage; Shashikant Kulkarni; Neal I Lindeman; Somak Roy; Apostolia M Tsimberidou; Cindy L Vnencak-Jones; Daynna J Wolff; Anas Younes; Marina N Nikiforova Journal: J Mol Diagn Date: 2017-01 Impact factor: 5.568
Authors: Susan Branford; Paul Wang; David T Yeung; Daniel Thomson; Adrian Purins; Carol Wadham; Nur Hezrin Shahrin; Justine E Marum; Nathalie Nataren; Wendy T Parker; Joel Geoghegan; Jinghua Feng; Naranie Shanmuganathan; Martin C Mueller; Christian Dietz; Doris Stangl; Zoe Donaldson; Haley Altamura; Jasmina Georgievski; Jodi Braley; Anna Brown; Christopher Hahn; Ieuan Walker; Soo-Hyun Kim; Soo-Young Choi; Sa-Hee Park; Dong-Wook Kim; Deborah L White; Agnes S M Yong; David M Ross; Hamish S Scott; Andreas W Schreiber; Timothy P Hughes Journal: Blood Date: 2018-07-02 Impact factor: 22.113
Authors: Xi Luo; Simone Feurstein; Shruthi Mohan; Christopher C Porter; Sarah A Jackson; Sioban Keel; Michael Chicka; Anna L Brown; Chimene Kesserwan; Anupriya Agarwal; Minjie Luo; Zejuan Li; Justyne E Ross; Panagiotis Baliakas; Daniel Pineda-Alvarez; Courtney D DiNardo; Alison A Bertuch; Nikita Mehta; Tom Vulliamy; Ying Wang; Kim E Nichols; Luca Malcovati; Michael F Walsh; Lesley H Rawlings; Shannon K McWeeney; Jean Soulier; Anna Raimbault; Mark J Routbort; Liying Zhang; Gabriella Ryan; Nancy A Speck; Sharon E Plon; David Wu; Lucy A Godley Journal: Blood Adv Date: 2019-10-22
Authors: Sue Richards; Nazneen Aziz; Sherri Bale; David Bick; Soma Das; Julie Gastier-Foster; Wayne W Grody; Madhuri Hegde; Elaine Lyon; Elaine Spector; Karl Voelkerding; Heidi L Rehm Journal: Genet Med Date: 2015-03-05 Impact factor: 8.822
Authors: Anna L Brown; Peer Arts; Catherine L Carmichael; Milena Babic; Julia Dobbins; Chan-Eng Chong; Andreas W Schreiber; Jinghua Feng; Kerry Phillips; Paul P S Wang; Thuong Ha; Claire C Homan; Sarah L King-Smith; Lesley Rawlings; Cassandra Vakulin; Andrew Dubowsky; Jessica Burdett; Sarah Moore; Grace McKavanagh; Denae Henry; Amanda Wells; Belinda Mercorella; Mario Nicola; Jeffrey Suttle; Ella Wilkins; Xiao-Chun Li; Joelle Michaud; Peter Brautigan; Ping Cannon; Meryl Altree; Louise Jaensch; Miriam Fine; Carolyn Butcher; Richard J D'Andrea; Ian D Lewis; Devendra K Hiwase; Elli Papaemmanuil; Marshall S Horwitz; Georges Natsoulis; Hugh Y Rienhoff; Nigel Patton; Sally Mapp; Rachel Susman; Susan Morgan; Julian Cooney; Mark Currie; Uday Popat; Tilmann Bochtler; Shai Izraeli; Kenneth Bradstock; Lucy A Godley; Alwin Krämer; Stefan Fröhling; Andrew H Wei; Cecily Forsyth; Helen Mar Fan; Nicola K Poplawski; Christopher N Hahn; Hamish S Scott Journal: Blood Adv Date: 2020-03-24
Authors: Melanie Decker; Anupriya Agarwal; Andreas Benneche; Jane Churpek; Nicolas Duployez; Adam Duvall; Martijn P T Ernst; Alisa Förster; Hildegunn Høberg-Vetti; Inga Hofmann; Michelle Nash; Marc H G P Raaijmakers; Tor H A Tvedt; Adrianna Vlachos; Brigitte Schlegelberger; Thomas Illig; Tim Ripperger Journal: Blood Adv Date: 2022-06-14