| Literature DB >> 36180924 |
Bo Yuan1,2, Katharina V Schulze1,3, Nurit Assia Batzir1, Jefferson Sinson1, Hongzheng Dai1,3, Wenmiao Zhu1,3, Francia Bocanegra4, Chin-To Fong5, Jimmy Holder6, Joanne Nguyen7, Christian P Schaaf1,8, Yaping Yang1, Weimin Bi1,3, Christine Eng1,3, Chad Shaw1,9, James R Lupski1,2,6,10, Pengfei Liu11,12.
Abstract
BACKGROUND: In medical genetics, discovery and characterization of disease trait contributory genes and alleles depends on genetic reasoning, study design, and patient ascertainment; we suggest a segmental haploid genetics approach to enhance gene discovery and molecular diagnostics.Entities:
Mesh:
Year: 2022 PMID: 36180924 PMCID: PMC9526336 DOI: 10.1186/s13073-022-01113-y
Source DB: PubMed Journal: Genome Med ISSN: 1756-994X Impact factor: 15.266
Fig. 1A flowchart of this study. Abbreviations: NAHR, nonallelic homologous recombination. *, a summary of the computational modeling is provided in Fig. 2
Fig. 2Computational modeling reveals unexpected contribution of NAHR deletions to autosomal recessive disease trait burden
Recurrent genomic deletions that are prevalent in the population
| Region | Coordinates (GRCh38) | Population allele frequency (× 10−6) | Allele frequency in diagnostic testing (× 10−6) | Known recessive genes | Number of coding genes |
|---|---|---|---|---|---|
| 2q13 | chr2:109930242-110228182 | 5811 | 2616 | 3 | |
| 15q11.2 | chr15:21311962-23261294 | 2764 | 2287 | 14 | |
| 16p12.1 | chr16:21754781-22502804 | 584.0 | 627.8 | 11 | |
| 16p11.2 proximal | chr16:29416551-30202090 | 507.6 | 1674 | 35 | |
| 17p12 HNPP | chr17:14170711-15567588 | 314.8 | 388.6 | 9 | |
| 16p13.11 | chr16:14772948-16330433 | 311.0 | 433.5 | 15 | |
| 1q21.1 BP3-BP4 | chr1:146380249-148811725 | 268.2 | 672.6 | 15 | |
| 13q12.12 | chr13:22911590-24323812 | 201.8 | 104.6 | 7 | |
| 1q21 TAR | chr1:144904297-146209950 | 178.0 | 269.0 | 22 | |
| 22q11.2 LCRA-D | chr22:18530098-21214537 | 141.7 | 4499 | 48 | |
| 10q11.21q11.23 | chr10:45765081-49954967 | 135.3 | 74.73 | 38 | |
| 16p11.2 distal | chr16:28706949-29049993 | 137.7 | 254.1 | 11 | |
| 2q13 | chr2:110494056-112385043 | 125.8 | 149.5 | 11 | |
| 7q11.23 | chr7:73089294-74862006 | 120.0 | 1375 | 28 | |
| 2q21.1 | chr2:130623447-131386379 | 97.33 | 119.6 | 9 | |
| 15q13.3 BP4-BP5 | chr15:30246847-32496522 | 99.70 | 896.8 | 13 | |
| 2q11.2 | chr2:95759114-97430329 | 73.60 | 74.73 | 24 | |
| 17q12 | chr17:36300613-38034442 | 68.86 | 463.4 | 21 | |
| 17p11.2 Smith Magenis Syndrome | chr17:16777950-20450859 | 53.33 | 687.6 | 48 | |
| 15q11q13 BP3-BP4 | chr15:28580349-30417865 | 37.98 | 134.5 | 10 | |
| 3q29 | chr3:195963652-197626678 | 21.36 | 194.3 | 23 | |
| 17q11.2 | chr17:30621877-32037969 | 21.36 | 149.5 | 14 | |
| 15q11 Prader-Willi/ Angelman syndromes BP1-BP3 | chr15:21976318-28537425 | 19.89 | 687.6 | 27 | |
| 15q11 Prader-Willi/ Angelman syndromes BP2-BP3 | chr15:23247414-28447477 | 19.89 | 657.7 | 17 | |
| 22q11.2 LCRD-H | chr22:21206521-24255497 | 11.87 | 59.79 | 45 | |
| 8p23.1 | chr8:7596999-12344083 | 9.495 | 134.5 | 50 | |
| 10q23 | chr10:79733715-87254783 | 7.121 | 59.79 | 31 | |
| 15q24 BPA-BPC | chr15:72628218-75278711 | 2.374 | 74.73 | 40 | |
| 15q11q13 BP3-BP5 | chr15:28569118-32447357 | 2.374 | 59.79 | 21 | |
| 7q11.23 distal | chr7:75456184-76629927 | 2.374 | 29.89 | 19 |
Regions are listed in descending order by population prevalence. Genes in the “Known recessive gene” column are ordered by coordinate map positions. Even though it is the third highest NAHR-mediated deletions, the Xp22.31-STS deletion is not included in this table because the current list focuses on autosomal recessive conditions. The gene OTUD7A in 15q13.3 BP4-BP5 is not reported to cause a recessive disease at the time of this study; however, patient analysis results from this study support this gene as a candidate recessive disease gene
Recessive genes with NAHR-mediated recurrent genomic deletions contributing to more than 20% of the overall disease burden
| Gene | AR disease trait | Genomic region cytogenetic interval | NAHR deletion prevalence in adults | Top 3 allele contribution to diseasea (NAHRdelCNV in bold) | Fraction of NAHRdelCNV allele frequency | Aggregate carrier allele frequency | |
|---|---|---|---|---|---|---|---|
| Nephronophthisis 1, juvenile, MIM# 256100 | 2q13 | 5.811x10-3 | 7.7 | 85 | 6.82x10-3 | ||
| Autosomal recessive mental retardation (from DECIPHER) | 2q11.2 | 7.36x10-5 | 4.1 | 90 | 8.18x10-5 | ||
| Glycogen storage disease XII, MIM# 611881 | 16p11.2 proximal | 5.076x10-4 | 4.1 | 90 | 5.64x10-4 | ||
| Morbid obesity and spermatogenic failure, MIM# 615703 | 3q29 | 2.136x10-5 | 4.1 | 90 | 2.37x10-5 | ||
| Immunodeficiency 8, MIM# 615401 | 16p11.2 proximal | 5.076x10-4 | 4.1 | 90 | 5.64x10-4 | ||
| Mitochondrial complex IV deficiency, nuclear type 20, MIM# 619064 | 15q24 BPA-BPC | 2.374x10-6 | 4.1 | 90 | 2.64x10-6 | ||
| Mental retardation, autosomal recessive 50, MIM# 616460 | 15q24 BPA-BPC | 2.374x10-6 | 4.1 | 90 | 2.64x10-6 | ||
| Immunodeficiency 52, MIM# 617514 | 16p11.2 distal | 1.377x10-4 | 4.1 | 90 | 1.53x10-4 | ||
| Combined oxidative phosphorylation deficiency 49, MIM# 619024 | Smith Magenis Syndrome | 5.333x10-5 | 4.1 | 90 | 5.93x10-5 | ||
| Myasthenic syndrome, congenital, 21, presynaptic, MIM# 617239 | 10q11.21q11.23 | 1.353x10-4 | 4.1 | 90 | 1.50x10-4 | ||
| Dejerine-Sottas disease, MIM# 145900 | 17p12 HNPP | 3.148x10-4 | 4.2 | 89 | 3.54x10-4 | ||
| Neurodevelopmental disorder (this study) | 15q13.3 BP4-BP5 | 9.97x10-5 | 5.4 | 74 | 1.35x10-4 | ||
| Deafness, autosomal recessive 114, MIM# 618456 | Smith Magenis Syndrome | 5.333x10-5 | 4.6 | 84 | 6.38x10-5 | ||
| Thrombocytopenia-absent radius syndrome, MIM# 274000 | 1q21.1 TAR | 1.78x10-4 | 9.8 | 90 | 1.98x10-4 | ||
| Hyperprolinemia, type I, MIM# 239500 | 22q11.2 LCRA-D | 1.417x10-4 | 5 | 75 | 1.90x10-4 | ||
| Mitochondrial complex III deficiency, nuclear type 5, MIM# 615160 | 16p12.1 | 5.84x10-4 | 5 | 75 | 7.83x10-4 | ||
| Lissencephaly 4 (with microcephaly), MIM# 614019 | 16p13.11 | 3.11x10-4 | 5.1 | 72 | 4.30x10-4 | ||
| Peroxisome biogenesis disorder 14B, MIM# 614920 | 1q21 TAR | 1.78x10-4 | 4.7 | 72 | 2.48x10-4 | ||
| Autosomal recessive mental retardation (from DECIPHER) | 16p11.2 proximal | 5.076x10-4 | 5.2 | 65 | 7.77x10-4 | ||
| Combined oxidative phosphorylation deficiency 4, MIM# 610678 | 16p11.2 distal | 1.377x10-4 | 3.9 | 65 | 2.13x10-4 | ||
| Mitochondrial complex IV deficiency, nuclear type 3, MIM# 619046 | 17p12 HNPP | 3.148x10-4 | 5.1 | 62 | 5.05x10-4 | ||
| Endosteal Hyperostosis and Oligodontia (from DECIPHER) | 1q21 TAR | 1.78Ex10-4 | 4.1 | 59 | 3.00x10-4 | ||
| Van den Ende-Gupta syndrome, MIM# 600920 | 22q11.2 LCRA-D | 1.417x10-4 | 3.8 | 58 | 2.45x10-4 | ||
| Joubert syndrome 27, MIM# 617120 | Smith Magenis Syndrome | 5.333x10-5 | 5.3 | 56 | 9.47x10-5 | ||
| Megacystis-microcolon-intestinal hypoperistalsis syndrome (from DECIPHER) | 16p13.11 | 3.11x10-4 | 5 | 57 | 5.48x10-4 | ||
| Microcephaly 23, primary, autosomal recessive, MIM# 617985 | 2q11.2 | 7.36x10-5 | 2.9 | 55 | 1.33x10-4 | ||
| Rothmund-Thomson syndrome, type 1, MIM# 618625 | 2q13 | 1.258x10-4 | 3.8 | 50 | 2.54x10-4 | ||
| Immunodeficiency, common variable, 3, MIM# 613493 | 16p11.2 distal | 1.377x10-4 | 4 | 48 | 2.86x10-4 | ||
| Hemochromatosis, type 2A, MIM# 602390 | 1q21 TAR | 1.78x10-4 | 5.2 | 45 | 3.97x104 | ||
| Glycosylphosphatidylinositol biosynthesis defect 11, MIM# 616025 | 17q12 | 6.886x10-4 | 5.2 | 44 | 1.56x10-4 | ||
| Agammaglobulinemia 2, MIM# 613500 | 22q11.2 LCRD-H | 1.187x10-5 | 5.2 | 43 | 2.79x10-5 | ||
| Mitochondrial complex V (ATP synthase) deficiency, nuclear type 1, MIM# 604273 | Smith Magenis Syndrome | 5.333x10-5 | 4.1 | 41 | 1.29x10-4 | ||
| Jalili syndrome, MIM# 217080 | 2q11.2 | 7.36x10-5 | 3.2 | 40 | 1.84x10-4 | ||
| Retinitis pigmentosa 66, MIM# 615233 | 10q11.21q11.23 | 1.353x10-4 | 4.1 | 40 | 3.42x10-4 | ||
| Myasthenic syndrome, congenital, 23, presynaptic, MIM# 618197 | 22q11.2 LCRA-D | 1.417x10-4 | 4.2 | 38 | 3.69x10-4 | ||
| Squalene synthase deficiency, MIM# 618156 | 8p23.1 | 9.495x10-6 | 5.1 | 38 | 2.52x10-5 | ||
| Giant platelet disorder, isolated, MIM# 231200 | 22q11.2 LCRA-D | 1.417x10-4 | 4.6 | 86, | 38 | 3.75x10-4 | |
| Muscular dystrophy, limb-girdle, autosomal recessive 5, MIM# 253700 | 13q12.12 | 2.018x10-4 | 4.7 | 36 | 5.62x10-4 | ||
| Myasthenic syndrome, congenital, 6, presynaptic, MIM# 254210 | 10q11.21q11.23 | 1.353x10-4 | 4.1 | 35 | 3.92x10-4 | ||
| Cerebral dysgenesis, neuropathy, ichthyosis, and palmoplantar keratoderma syndrome, MIM# 609528 | 22q11.2 LCRA-D | 1.417x10-4 | 4.3 | 33 | 4.23x10-4 | ||
| Meier-Gorlin syndrome 7, MIM# 617063 | 22q11.2_LCRA-D | 1.417x10-4 | 4.2 | 33 | 4.34x10-4 | ||
| Spondylocostal dysostosis 5, MIM# 122600 | 16p11.2 proximal | 5.076x10-4 | 5.6 | 47 | 1.08x10-3 | ||
| Lung disease, immunodeficiency, and chromosome breakage syndrome, MIM# 617241 | 15q11q13 BP3-BP4 | 3.798x10-5 | 4.8 | 91, | 29 | 1.29x10-4 | |
| Seizures, early-onset, with neurodegeneration and brain calcification, MIM# 618875 | 3q29 | 2.136x10-5 | 4.8 | 93, | 29 | 7.48x10-5 | |
| Immunodeficiency 46, MIM# 616740 | 3q29 | 2.136x10-5 | 1.6 | 24 | 8.94x10-5 | ||
| Spondylometaphyseal dysplasia with cone-rod dystrophy, MIM# 608940 | 3q29 | 2.136x10-5 | 1.8 | 22 | 9.49x10-5 | ||
| Spastic ataxia, Charlevoix-Saguenay type, MIM# 270550 | 13q12.12 | 2.018x10-4 | 4.4 | 22 | 9.09x10-4 | ||
| Deafness, autosomal recessive 22, MIM# 607039 | 16p12.1 | 5.84x10-4 | 4.4 | 80, | 21 | 2.78x10-3 | |
| Mental retardation, autosomal recessive, 52, MIM# 616887 | 2q11.2 | 7.36x10-5 | 3.4 | 19 | 3.89x10-4 | ||
| Combined oxidative phosphorylation deficiency 31, MIM# 617228 | 13q12.12 | 2.018x10-4 | 3.9 | 19 | 1.05x10-3 | ||
| Glucocorticoid deficiency 5, MIM# 617825 | 22q11.2 LCRA-D | 1.417x10-4 | 3.4 | 49, 37, | 18 | 7.86x10-4 | |
| Brody myopathy, MIM# 601003 | 16p11.2 distal | 1.377x10-4 | 4.1 | 73, | 18 | 7.79x10-4 | |
| Polymicrogyria, perisylvian, with cerebellar hypoplasia and arthrogryposis, MIM# 616531 | 22q11.2 LCRA-D | 1.417x10-4 | 4.3 | 16 | 8.68x10-4 | ||
| Short-rib thoracic dysplasia 17 with or without polydactyly, MIM# 617405 | 3q29 | 2.136x10-5 | 1.1 | 60, 44, | 16 | 1.36x10-4 | |
| Metabolic encephalomyopathic crises, recurrent, with rhabdomyolysis, cardiac arrhythmias, and neurodegeneration, MIM# 616878 | 22q11.2 LCRA-D | 1.417x10-4 | 2.6 | 80, | 13 | 1.11x10-3 | |
| PEHO syndrome, MIM# 260565 | 17q12 | 6.886x10-5 | 3.7 | 96, | 13 | 5.47x10-4 | |
| Cockayne syndrome, type B, MIM# 133540 | 10q11.21q11.23 | 1.353x10-4 | 3.5 | 12 | 1.17x10-3 | ||
| Interstitial nephritis, karyomegalic, MIM# 614817 | 15q13.3 BP4-BP5 | 9.97x10-5 | 3.0 | 23, | 11 | 8.94x10-4 | |
| Sjogren-Larsson syndrome, MIM# 270200 | Smith Magenis Syndrome | 5.333x10-5 | 2.7 | 41, 24, | 11 | 4.71x10-4 |
aThe rows of the table are arranged in a descending order based on the NAHR deletion allele’s contribution to the recessive disease, i.e., the percent of all affected individuals carrying at least one NAHR deletion allele, which are represented as the bold numbers in the column “Top 3 allele contribution to disease.” Note that the numbers from the column “Top 3 allele” are not disjunct events. For example, a patient can be compound heterozygous for the top 1 allele and the 2nd allele, and thus contribute to both numbers. Genes with an NAHR allele contribution to disease lower than 20% are not tabulated here, but can be found in Additional file 1: Table S5
Clinically significant sequence variants uncovered by the deletions. Subjects #1 and #2 were identified in a COX10-phenotype-driven cohort analysis. Subjects #1, #3, and #4 were identified in the molecular-deletion-driven clinical exome data reanalysis
| ID | Deletion/ allele frequency | Gene (RefSeq transcript) | Genic variant | Genomic coordinate (GRCh38) | MAF in gnomAD v3.1 | Classification | Category |
|---|---|---|---|---|---|---|---|
17p12 HNPP/ 3.148 × 10−4 | (NM_001303.3) | c.1277_1282dup (p.M426_L427dup) | chr17:14207158_ 14207163dup | 0 | VUS | NDAC | |
17p12 HNPP/ 3.148 × 10−4 | (NM_001303.3) | c.858G>T (p.W286C) | chr17:14192151G>T | 6.567 × 10−6 | VUS | NDAC | |
10q11.21q11.23 deletion/ 1.353 × 10−4 | (NM_000124.3) | c.1490T>C (p.F497S) | chr10:49505920A>G | 0 | VUS | NDAC | |
15q13.3 BP4-BP5 deletion/ 9.97 × 10−5 | (NM_130901.2) | c.2023_2066del (p.D675Hfs*188) | chr15:31484009_ 31484052del | 6.58 × 10−5 a | VUS | NDGMC | |
| 16p11.2 proximal / 5.076 × 10−4 | (NM_145239.2) | c.649dup (p.R217fs*8) | chr16: 29813703dup | 1.472 × 10−4 b | Pathogenic | NDGMC |
aThis variant is marked with “low complexity region” label in gnomAD, suggesting ambiguous variant call quality. It has a variant count of 2 in gnomAD v3.1. However, manual review of the alignment data from gnomAD suggests only 1 is of higher quality. The allele frequency is adjusted in half accordingly
bThis variant is marked with “low complexity region” label in gnomAD, suggesting ambiguous variant call quality. It is located in a homopolymer region that is susceptible to false positive variant calling. The variant allele frequency quoted here may be overestimated
Abbreviations: MAF, minor allele frequency; VUS, variant of unknown clinical significance; NDAC, new disease allele characterization; NDGMC, new disease gene/mechanism characterization
Fig. 3ΔNIRD scores for a population group relative to the general population. ΔNIRD is calculated by subtracting the general population NIRD from the specific population group NIRD. A positive ΔNIRD score suggests an even higher recessive disease contribution of the NAHR deletion from that population group. A negative ΔNIRD score indicates a relatively higher probability for an affected patient to carry biallelic small variants in that population group. Genes that have zero ΔNIRD scores for all four population groups are not depicted in this heatmap
Fig. 4A “segmental haploid genomics” approach for characterization of new disease genes or alleles for autosomal recessive conditions. The illustration above depicts dynamics of disease alleles in an autosomal recessive condition whose collective carrier burden is contributed by a recurrent genomic deletion (red area) and a few single-nucleotide variants (SNVs, gray area). Individuals affected with biallelic pathogenic changes frequently carry the deletion as one of the two alleles, because the deletion arises recurrently in multiple lineages (indicated by the red lightning bolt arrow). SNV disease alleles for recessive genes tend to be passed on from ancestral generations and may drift away without being noticed if they do not converge with another disease allele (SNV3). However, they may emerge to medical attention frequently in families with high degree of autozygosity, as illustrated in generations 3 and 4 for SNV2
| … | ||||||
| ⁝ | ⁝ | ⁝ | ||||
| ⁝ | ⁝ | ⁝ | ⁝ | |||