| Literature DB >> 35754516 |
Courtney Thaxton1,2, Jennifer Goldstein1, Marina DiStefano3, Kathleen Wallace1, P Dane Witmer4, Melissa A Haendel5, Ada Hamosh6, Heidi L Rehm7,8, Jonathan S Berg1.
Abstract
The dilemma of how to categorize and classify diseases has been debated for centuries. The field of medical genetics has historically approached nosology based on clinical phenotypes observed in patients and families. Advances in genomic sequencing and understanding of genetic contributions to disease often provoke a need to reassess these classifications. The Clinical Genome Resource (ClinGen) has developed frameworks to classify the strength of evidence underlying monogenic gene-disease relationships, variant pathogenicity, and clinical actionability. It is therefore necessary to define the disease entity being evaluated, which can be challenging for genes associated with multiple conditions and/or a broad phenotypic spectrum. We therefore developed criteria to guide "lumping and splitting" decisions and improve consistency in defining monogenic gene-disease relationships. Here, we outline the precuration process, the lumping and splitting guidelines with examples, and describe the implications for clinical diagnosis, informatics, and care management.Entities:
Year: 2022 PMID: 35754516 PMCID: PMC9221396 DOI: 10.1016/j.xgen.2022.100131
Source DB: PubMed Journal: Cell Genom ISSN: 2666-979X
Representative genes evaluated to determine lumping and splitting criteria
| Gene *Date of OMIM record | Disease entity(s) per OMIM | Inheritance pattern | MIM# | Lump or split | Criteria met | Reference | ||||
|---|---|---|---|---|---|---|---|---|---|---|
| {breast cancer, susceptibility to} | AD, Smu | 114480 | split | Assertion Phenotypic variability Inheritance pattern | Simonin et al.[ | |||||
| ataxia-telangiectasia | AR | 208900 | split | Assertion Phenotypic variability Inheritance pattern | ||||||
| lymphoma, B-cell non-Hodgkin, somatic | Som | n/a | not included as they are not germline | n/a | n/a | |||||
| lymphoma, mantle cell, somatic | Som | n/a | ||||||||
| T cell prolymphocytic leukemia, somatic | Som | n/a | ||||||||
| Emery-Dreifuss muscular dystrophy 6, X-linked | XL |
| lump | Assertion Molecular mechanism | Gueneau et al.[ | |||||
| myopathy, X-linked, with postural muscle atrophy | XL |
| Phenotypic variability | |||||||
| reducing body myopathy, X-linked 1a, severe, infantile, or early childhood onset | XL | 300717 | ||||||||
| reducing body myopathy, X-linked | XL | 300718 | ||||||||
| scapuloperoneal myopathy, X-linked dominant | XL | 300695 | ||||||||
| cognitive impairment with or without cerebellar ataxia | AD | 614306 | split | Molecular mechanism, Phenotypic variability | Gertler et al.[ | |||||
| epileptic encephalopathy, early infantile, 13 | AD | 614558 | lump | Molecular mechanism Phenotypic variability | ||||||
| seizures, benign familial infantile, 5 | AD | 617080 | Inheritance pattern | |||||||
| medullary thyroid carcinoma | AD | 155240 | lump into MEN2A and MEN2B | Molecular mechanism Phenotypic variability Inheritance pattern | Plaza-Menacho et al.[ | |||||
| pheochromocytoma | AD | 171300 | ||||||||
| multiple endocrine neoplasia IIA (MEN2A) | AD | 171400 | split | Molecular mechanism | ||||||
| multiple endocrine neoplasia IIB (MEN2B) | AD | 162300 | ||||||||
| central hypoventilation syndrome, congenital | AD | 209880 | split | to dispute | ||||||
| {Hirschsprung disease, susceptibility to, 1} | AD | 142623 | n/a | not evaluated, likely not Mendelian | ||||||
| hypercholesterolemia, familial, 1 | AD, AR | 143890 | lump | Molecular mechanism Phenotypic variability Inheritance pattern | Nussbaum et al.[ | |||||
| LDL cholesterol level QTL2 | AR, AR | 143890 | ||||||||
| cardiomyopathy, familial hypertrophic | AD | 192600 | lump | Assertion Molecular mechanism Phenotypic variability Inheritance pattern | Fee et al.[ | |||||
| creatine phosphokinase, elevated serum | AD | 123320 | ||||||||
| muscular dystrophy, limb-girdle, type 1C | AD | 607801 | ||||||||
| myopathy, distal, Tateyama type | AD | 614321 | ||||||||
| rippling muscle disease 2 | AD | 606072 | ||||||||
| long QT syndrome 9 | AD | 611818 | split | Assertion Molecular mechanism | Chiu et al.[ | |||||
| cardiomyopathy, dilated, 1AA, with or without LVNC | AD |
| lump | Assertion Molecular mechanism Phenotypic variability Inheritance pattern | Bagnall et al[ | |||||
| cardiomyopathy, hypertrophic, 23, with or without LVNC | AD |
| ||||||||
|
| cardiomyopathy, dilated, 1II myopathy, myofibrillar, 2 myopathy, myofibrillar, fatal infantile hypertonic, alpha-B crystallin-related | AD | 615184 | lump | Assertion Molecular mechanism Phenotypic variability Inheritance pattern | Jensen et al[ | ||||
| AD | 608810 | |||||||||
| AR | 613869 | |||||||||
| cataract 16, multiple types | AD, AR | 613763 | needs evaluation | needs assessment by ocular experts as cataracts are observed in the myofibrillar myopathy | ||||||
|
| deafness, autosomal recessive 4, with enlarged vestibular aqueduct | AR | 600791 | lump | Assertion Molecular mechanism Phenotypic variability Inheritance pattern | Coman et al[ | ||||
| Pendred syndrome | AR | 274600 | ||||||||
|
| fumarase deficiency | AR | 606812 | split | Assertion Phenotypic variability Inheritance pattern | Su et al[ | ||||
| leiomyomatosis and renal cell cancer | AD | 150800 | split | Assertion Phenotypic variability Inheritance pattern | Wimmer et al[ | |||||
| mismatch repair cancer syndrome 2 | AR | 619096 | split | Phenotypic variability Inheritance pattern | Lynch et al[ | |||||
| colorectal cancer, hereditary nonpolyposis, type 1 (aka Lynch syndrome) | AD | 120435 | lump | Assertion Molecular mechanism Phenotypic variability Inheritance pattern | Lynch et al[ | |||||
| Muir-Torre syndrome | AD | 158320 | ||||||||
|
| Danon disease | XL | 300257 | n/a | follows general principle | n/a | ||||
|
| cardiomyopathy, dilated, 1P | not noted | 609909 | lump | Assertion Molecular mechanism Phenotypic variability Inheritance pattern | van der Zwaag et al[ | ||||
| cardiomyopathy, hypertrophic, 18 | AD | 613874 | ||||||||
|
| adrenal adenoma, somatic | Som | n/a | not included as they are not germline | n/a | n/a | ||||
| angiofibroma, somatic | Som | n/a | ||||||||
| carcinoid tumor of lung | Som | n/a | ||||||||
| lipoma, somatic | Som | n/a | ||||||||
| parathyroid adenoma, somatic | Som | n/a | ||||||||
| multiple endocrine neoplasia 1 | AD | 131100 | n/a | follows general principle | ||||||
Boldface MIM phenotype numbers indicate where OMIM has asserted the phenotypes are seemingly part of a single disease spectrum.
AD, autosomal dominant; AR, autosomal recessive; n/a, not applicable; Som, somatic variation; these are not included in the current gene-disease clinical validity assessments, as the framework is restricted to germline gene-disease relationships; XL, X-linked inheritance.
Figure 1.Weighing the evidence
The four criteria for lumping and splitting should be assessed and weighed as a balance. If only a single assertion has been made in the literature and/or databases of monogenic diseases (e.g., OMIM, Monarch [Mondo ontology], Orphanet), it is possible that no further steps are needed. However, some groups find it useful to precurate genes with a single disease entity to discuss disease nomenclature and review any new evidence or assertions that have not yet been formally captured in nosological and ontological resources. If multiple distinct disease entities have been asserted, then the curator will evaluate the evidence for the molecular mechanism, phenotypic expressivity, and inheritance pattern to determine whether to lump certain entities for curation as a syndrome or an organ-specific complex phenotype or to keep them separate as split disease entities. If the evidence is equally balanced between lumping or splitting, then experts should be consulted to compare the relevant weight of each piece of evidence.
Reasons to lump or split gene-disease assertions
| Reasons to lump | Reasons to split |
|---|---|
| An assertion for only 1 disease entity has been made in the literature | An assertion for >1 distinct disease entity has been made in the literature |
| No difference in molecular mechanism is observed among the disease entities | A well-established difference in molecular mechanism(s) between ≥2 disease entities is observed |
| Interfamilial phenotypic variability is no more pronounced than intrafamilial variability | Interfamilial phenotypic variability is more pronounced than intrafamilial variability |
| The difference in the inheritance pattern for the disease entities is representative of a continuum of disease (i.e., mild phenotypic features are observed in heterozygotes for recessive disease or dosage impacts are observed for dominant disease [more severe phenotype in homozygotes]) | The representative disease entities between differing inheritance patterns are distinguishable, with notable varying phenotypes and/or clinical management distinctions |
| The disease entities in question are seemingly part of a variable phenotype observed within a single organ system and there is insufficient evidence for any single phenotype | To dispute or refute a disease entity assertion for the gene in question |
Figure 2.Lumping and splitting conundrum: defining a disease entity
When assessing the involvement of any given gene in disease, several possibilities for a disease entity may exist, including the following: (1) an isolated phenotype, in which 1 phenotype (or phenotypic feature) arises in a single organ system with no risk of other phenotypes arising in that organ system or elsewhere; (2) variable phenotypes in a single organ, in which multiple related phenotypes (or phenotypic features) arise in a single organ system; or (3) a syndromic phenotype, in which multiple, varying phenotypes occur in multiple organs. Assessing the appropriate disease entity or entities to curate can be challenging, thus requiring the use of defined criteria.