Literature DB >> 34107974

Sex-specific recombination patterns predict parent of origin for recurrent genomic disorders.

Trenell J Mosley1,2, H Richard Johnston2,3, David J Cutler2, Michael E Zwick2,4, Jennifer G Mulle5,6.   

Abstract

BACKGROUND: Structural rearrangements of the genome, which generally occur during meiosis and result in large-scale (> 1 kb) copy number variants (CNV; deletions or duplications ≥ 1 kb), underlie genomic disorders. Recurrent pathogenic CNVs harbor similar breakpoints in multiple unrelated individuals and are primarily formed via non-allelic homologous recombination (NAHR). Several pathogenic NAHR-mediated recurrent CNV loci demonstrate biases for parental origin of de novo CNVs. However, the mechanism underlying these biases is not well understood.
METHODS: We performed a systematic, comprehensive literature search to curate parent of origin data for multiple pathogenic CNV loci. Using a regression framework, we assessed the relationship between parental CNV origin and the male to female recombination rate ratio.
RESULTS: We demonstrate significant association between sex-specific differences in meiotic recombination and parental origin biases at these loci (p = 1.07 × 10-14).
CONCLUSIONS: Our results suggest that parental origin of CNVs is largely influenced by sex-specific recombination rates and highlight the need to consider these differences when investigating mechanisms that cause structural variation.

Entities:  

Keywords:  3q29 deletion; Copy number variants; Meiotic recombination; Parent of origin

Mesh:

Year:  2021        PMID: 34107974      PMCID: PMC8190997          DOI: 10.1186/s12920-021-00999-8

Source DB:  PubMed          Journal:  BMC Med Genomics        ISSN: 1755-8794            Impact factor:   3.063


Background

Genomic disorders are caused by pathological structural variation in the human genome usually arising de novo during parental meiosis [1-4]. The most common pathogenic variety of these rearrangements are copy number variants (CNVs), i.e. a deletion or duplication of > 1 kb of genetic material [3, 5, 6]. The clinical phenotypes of genomic disorders are varied. They include congenital dysmorphisms, neurodevelopmental, neurodegenerative, and neuropsychiatric manifestations, and even more common complex phenotypes such as obesity and hypertension [7-12]. CNVs have been observed in 10% of sporadic cases of autism [13, 14], 15% of schizophrenia cases [15, 16], and 16% of cases of intellectual disability [17]. These and other associations highlight the importance of structural variation to human health and the need to understand the factors influencing how they arise. There is an intense interest in understanding the mechanisms by which CNVs form [18, 19]. In several regions of the genome, de novo CNVs with approximately the same breakpoints recur in independent meioses (recurrent CNVs) [1, 20]. The presence of segmental duplications flanking these intervals is a hallmark feature of recurrent CNVs. It is hypothesized that misalignment and subsequent recombination between non-allelic low copy repeat (LCR) segments within the segmental duplication regions is the formative event giving rise to the CNV [21, 22], so-called non-allelic homologous recombination (NAHR). Risk factors that may favor NAHR have been investigated and include sequence composition and orientation of the LCRs themselves [21, 23] as well as the presence of inversions at the locus [24, 25]. Parental sex bias for the origin of recurrent de novo CNVs remains unexplained. De novo deletions at the 16p11.2 and 17q11.2 loci are more likely to arise on maternally inherited chromosomes [26-29]. Deletions at the 22q11.2 locus show a slight maternal bias as well [30]. In contrast, deletions at the 5q35.3 locus (Sotos syndrome [MIM: 117550]) display a paternal origin bias [31, 32]. Deletions at the 7q11.23 locus (Williams syndrome [MIM: 194050]) do not show a bias in parental origin [24]. While it has been suggested that sex-specific recombination rates might influence sex biases in NAHR [26], this hypothesis has not been formally tested. The majority of recurrent CNVs are thought to form during meiosis when homologous chromosomes align and synapse during prophase I [33]. It is well established that meiosis differs significantly between males and females. In males, spermatagonia continuously divide and complete meiosis throughout postpubescent life with all four products of meiosis resulting in gametes. In contrast, in human females, oogonia are established in fetal life and enter into an extended period of prolonged stasis in prophase I of meiosis until they complete meiosis upon ovulation and fertilization [34]. Additionally, in female meiosis, only one of four products of meiosis result in a gamete. Sexual dimorphism in meiosis extends to the patterns and processes of recombination during meiosis [33]. Here we seek to ask whether local sex-specific rates in meiotic recombination can predict the parental bias for the origin of recurrent de novo CNVs.

Methods

Parent of origin determination

Literature search and parental origin data curation

For this analysis, we considered the 55 known genomic disorder CNV loci described in Coe et al. [7]. A locus was eligible for inclusion in the current analysis if it is flanked by LCRs, i.e. mediated by NAHR, and not imprinted (n = 38 eligible loci). For each of these 38 loci, we performed a systematic PubMed search to identify published data on parental origin. Studies were admitted to this paper’s analysis when the following criteria were met: (1) the study detailed parent of origin data for one of the 38 eligible NAHR-mediated loci as designated by Coe et al. [7], (2) the authors of the study interrogated the entire canonical CNV interval to confirm the presence of a deletion or duplication in the patients, (3) the authors determined the investigated CNVs were de novo, and (4) the study clearly treated monozygotic twins as one meiotic event and not two (Additional file 1: Supplemental Methods, Additional file 2: Table S1, and Additional file 3: Table S2). The literature search led to a manual review of 1268 papers, out of which we identified 77 manuscripts across 24 loci with suitable data for analysis: 1q21.1 [35-39], 1q21.1 TAR [40], 2q13 [37], 3q29 [37-42], 5q35 [31, 32], 7q11.23 [24, 40, 43–54], 8p23.1 [55, 56], 11q13.2q13.4 [57], 15q13.3 [38, 40, 58], 15q24 (AC, AD, BD, and BE intervals) [59-64], 15q25.2 [65-67],16p11.2 [26, 37, 40, 68–70], distal 16p11.2 [37, 38, 70], 16p11.2p12.1 [71], 16p31.11 [37], 17p11.2 [72-76], 17q11.2 [28, 29, 77], 17q12 [37, 38, 78], 17q21.31 [19, 25, 67, 79–84], 17q23.1q23.2 [69, 85] and 22q11.2 [30, 43, 53, 86–102] (Table 1). For the remaining 14 loci, no published parent of origin data could be identified. At the 3q29 locus, we generated new data to determine the parent of origin for de novo events (http://genome.emory.edu/3q29/).
Table 1

Summary of CNV loci included in literature search and curated studies

LocusMIM number# Studies includedaStudy references
1q21.1612474/6124755[3539]
1q21.1 TAR2740001[40]
2q11.20
2q11.2q130
2q131[37]
3q29609425/6119366[3742]
5q351175502[31, 32]
7q11.23194050/60975714[24, 40, 4354]
7q11.23 distal6137290
7q11.23 proximal0
8p23.12[55, 56]
10q236122420
11q13.2q13.41[57]
15q11.26156560b
15q13.36120013[38, 40, 58]
15q24c6[5964]
15q25.26142943[6567]
15q25.2 (Cooper)0
16p11.2611913/6146716[26, 37, 40, 6870]
16p11.2 distal6134443[37, 38, 70]
16p11.2p12.20
16p11.2p12.11[71]
16p12.11365700
16p13.111[37]
17p11.2182290/6108835[7276]
17p11.2p12d118220/1625000
17q11.2613675/6188743[28, 29, 77]
17q12614526/6145273[37, 38, 78]
17q21.31610443/6135339[19, 25, 67, 7984]
17q230
17q23.1q23.2613355/6136182[69, 85]
22q11.2188400/19243020[30, 43, 53, 86102]
22q11.2 distal6118670

aIndependent studies from which the parent of origin data for the current analysis were obtained. Studies may be repeated between loci

bRecombination rates could not be calculated for 15q11.2 as the breakpoints were outside the range of recombination maps

c15q24 locus is represented as 6 different intervals in Coe et al. [7]

d17p11.2p12 excluded due to inconsistencies in the mechanism of formation. See Additional file 1: Supplemental Methods

Summary of CNV loci included in literature search and curated studies aIndependent studies from which the parent of origin data for the current analysis were obtained. Studies may be repeated between loci bRecombination rates could not be calculated for 15q11.2 as the breakpoints were outside the range of recombination maps c15q24 locus is represented as 6 different intervals in Coe et al. [7] d17p11.2p12 excluded due to inconsistencies in the mechanism of formation. See Additional file 1: Supplemental Methods

Determination of parental origin for 3q29 deletion

Study subject recruitment

This study was approved by Emory University’s Institutional Review Board (IRB00064133). Individuals with a clinically confirmed diagnosis of 3q29 deletion were ascertained through the internet-based 3q29 registry (https://3q29deletion.patientcrossroads.org/) as previously described [103]. Blood samples were obtained from 14 families. SNP genotyping was performed on 12 of the 14 families (10 full trios, 2 mother–child pairs) using the Illumina GSA-24 v 3.0 array. For 2 full trios (6 samples), parent of origin was determined from whole-genome sequence data on Illumina's NovaSeq 6000 platform. Quality control was performed with PLINK 1.9 [104] and our custom pipeline (Additional file 1: Supplemental Methods).

Parental origin analysis

Parental origin of the 3q29 deletion was determined for all 14 families using PLINK 1.9 [104]. SNPs located within the 3q29 deletion region (chr3:196029182–197617792; hg38) were isolated for analysis and the pattern of Mendelian errors (MEs) were analyzed. The parent with the most MEs was considered the parent of origin for the 3q29 deletion (Additional file 1: Supplemental Methods). The mean age of fathers in our 3q29 cohort was collected from self-reported data in conjunction with the Emory University 3q29 project (http://genome.emory.edu/3q29/) and compared to the U.S. average (NCHS; https://www.cdc.gov/nchs/index.htm) via a two-tailed two-sample t-test using R [105].

Calculation of recombination rates and ratios

Chromosome male and female recombination rates (cM/Mb) were obtained from the deCODE sex-specific maps, which are based on over 4.5 million crossover recombination events from 126,427 meioses, with an average resolution of 682 base pairs [106]. The recombination rate (cM/Mb) data from deCODE is publicly available as recombination rates calculated for a physical genomic interval bounded by two SNP markers (Additional file 1: Supplemental Methods). Therefore, for our calculation of the average male and female recombination rates, each bounded recombination rate was weighted by the total number of base pairs contained within the respective SNP marker interval. Weighted rates were then averaged across the CNV interval for males and females, separately. The ratio of the weighted average male and female recombination rates was then calculated for each CNV interval by dividing the weighted average male recombination rate by the weighted average female recombination rate (Additional file 1: Figure S1). To account for slight differences in the recombination rate ratios calculated for the different LCR22 intervals at the 22q11.2 locus we used an adjusted recombination rate ratio composed of the weighted recombination rate ratios calculated for each LCR22 interval. Weights were based on the estimated population prevalence of the different 22q11.2 deletion intervals (Additional file 1: Table S3) [107].

Logistic regression analysis

Parental origin data was curated for CNVs at the 24 CNV loci from 77 independent studies; only independent samples were included in the analysis (duplicate or overlapping samples were removed). For each CNV locus, the male to female recombination rate ratio was calculated as described above. A logistic regression model was fitted to the data with the loge-transformed male to female recombination rate ratio as the predictor and parental origin (paternal vs. maternal) as the response variable. We performed a secondary analysis stratified by deletions and duplications. See Table 2 and Additional file 4: Table S4 for the data calculated and used in the logistic regression analyses.
Table 2

Summary of genomic disorder loci CNVs recombination calculations

LocusCNV typeBED coordinates [7]# Samples (%)M:F origin countsaDel/dup countsAvg. male recombination rate [106]bAvg. female recombination rate [106]bLoge M:F recombination ratio [106]c
1q21.1Del/Dupchr1:147101794–1479212629 (0.46%)6:37/20.123316890.50839541− 1.416626
1q21.1 TARDelchr1:145686999–1460484951 (0.05%)1:01/00.157123880.77814863− 1.599883
2q13Dupchr2:110625954–1123359521 (0.05%)1:00/10.448545391.64377881− 1.298743
3q29Delchr3:195988732–19762873222 (1.11%)21:122/03.13052110.277759882.422197
5q35Delchr5:176290391–17763039341 (2.07%)36:541/01.299553550.979413550.282822
7q11.23Del/Dupchr7:73328061–74727726618 (31.26%)296:322598/200.493535541.92657023− 1.361890
8p23.1Del/Dupchr8:8235068–120350823 (0.15%)1:21/20.672017521.81857951− 0.995527
11q13.2q13.4Delchr11:67985953–715713061 (0.05%)0:11/00.84317652.23501635− 0.974828
15q13.3Del/Dupchr15:30840505–321905076 (0.30%)5:15/11.636407261.89901039− 0.148828
15q24 ACDelchr15:72670606–752406061 (0.05%)1:01/00.284799190.86129537− 1.106653
15q24 ADDelchr15:72670606–757206043 (0.15%)1:23/00.276135440.82152961− 1.090277
15q24 BDDelchr15:73720606–757206041 (0.05%)0:11/00.307399670.68432207− 0.800280
15q24 BEDelchr15:73720606–778406032 (0.10%)0:22/00.234851250.72623183− 1.128917
15q25.2Delchr15:82513967–840702445 (0.25%)0:55/00.212250810.32633295− 0.430177
16p11.2Del/Dupchr16:29641178–3019117898 (4.96%)11:8779/190.065709351.28740565− 2.974798
16p11.2 distalDel/Dupchr16:28761178–291011784 (0.20%)0:43/10.121509491.61662624− 2.600350
16p11.2p12.1Dupchr16:21341178–294311781 (0.05%)1:00/10.55346552.68382469− 1.578799
16p13.11Del/Dupchr16:15408642–161986422 (0.10%)1:11/11.670723782.46524529− 0.389038
17p11.2Del/Dupchr17:16805961–2057609571 (3.59%)44:2759/120.18880661.19115966− 1.841959
17q11.2Delchr17:30838856–3188886862 (3.14%)10:5262/00.260242851.85442774− 1.963716
17q12Delchr17:36460073–378462636 (0.30%4:26/00.647506543.64754421− 1.728720
17q21.31Del/Dupchr17:45626851–4610685139 (1.97%)19:2035/40.382341790.98304273− 0.944338
17q23.1q23.2Delchr17:59987857–622278572 (0.10%)0:22/00.564660541.30765625− 0.839748
22q11.2Delchr22:18924718–21111383d978 (49.47%)411:567978/01.459464943.69205976− 0.920692
AllDel/Dup1977 (100%)870:11071913/64

Summarized CNV data. Data are consolidated by locus. BED coordinates correspond to hg38 (LiftOver from hg18 coordinates in Coe et al. [7])

aMale to female CNV parent of origin counts

bAverage male and female recombination rates are the average of the recombination rates calculated for each sample observed for the locus, e.g., 0.123331689 is the average male recombination rate calculated from the male recombination rates of the nine 1q21.1 CNVs

cNatural log-transformed average male to female recombination rate ratio for the locus

dBreakpoints cited by ClinGen for ~ 3.0 Mb LCR22A-LCR22D interval

Summary of genomic disorder loci CNVs recombination calculations Summarized CNV data. Data are consolidated by locus. BED coordinates correspond to hg38 (LiftOver from hg18 coordinates in Coe et al. [7]) aMale to female CNV parent of origin counts bAverage male and female recombination rates are the average of the recombination rates calculated for each sample observed for the locus, e.g., 0.123331689 is the average male recombination rate calculated from the male recombination rates of the nine 1q21.1 CNVs cNatural log-transformed average male to female recombination rate ratio for the locus dBreakpoints cited by ClinGen for ~ 3.0 Mb LCR22A-LCR22D interval

Linear regression analysis

For linear regression, locus-specific estimates for parental origin were derived by combining the data from all published studies for a given locus. To alleviate the uncertainty introduced by small sample sizes, only those loci with more than 10 observations were included. The loge-transformed combined male to female parental origin count ratio for each locus was regressed on the calculated average loge-transformed average male to female recombination rate ratio for that locus’ CNV interval. Each locus was weighted based on its sample size.

Results

Recurrent genomic disorder loci literature search

We conducted a systematic literature search for the 38 non-imprinted and NAHR-mediated CNV loci in Coe et al. [7] (Table 1, Additional file 2: Table S1). We identified parent-of-origin studies that met inclusion criteria as stated in “Methods” section. 77 studies met inclusion criteria; from these 77 studies, data were curated for 24 loci, including copy number variants at 1q21.1 [35-39], 1q21.1 TAR [40], 2q13 [37], 3q29 [37-42], 5q35 [31, 32], 7q11.23 [24, 40, 43–54], 8p23.1 [55, 56], 11q13.2q13.4 [57], 15q13.3 [38, 40, 58], 15q24 (AC, AD, BD, and BE intervals) [59-64], 15q25.2 [65-67],16p11.2 [26, 37, 40, 68–70], distal 16p11.2 [37, 38, 70], 16p11.2p12.1 [71], 16p31.11 [37], 17p11.2 [72-76], 17q11.2 [28, 29, 77], 17q12 [37, 38, 78], 17q21.31 [19, 25, 67, 79–84], 17q23.1q23.2 [69, 85] and 22q11.2 [30, 43, 53, 86–102] (Table 2). Each locus has between one and twenty independent studies representing in total 1977 de novo deletion (N = 1913) and duplication (N = 64) events (Table 2).

Parent of origin of 3q29 deletion

We determined parent of origin in 12 full trios where a proband had a de novo 3q29 deletion; in 2 additional trios where only proband and maternal DNA samples were available, parent of origin was inferred. For the 12 trios evaluated by SNP arrays, in all cases, the number of Mendelian errors between the presumed inherited (intact) parental allele was zero, and the mean Mendelian errors for the presumed de novo parent of origin allele were 41, with a range of 27–66. For the two trios evaluated with sequence data, Mendelian errors were 20–33-fold elevated when comparing the inherited versus de novo parent. In these 14 trios, 13 deletions (92.9%) arose on the paternal genome indicating a significant departure from the null expectation of 50% (p = 0.002, binomial exact). When accounting for only full trios, 11 of 12 (91.7%) deletions arose on paternal haplotypes (p = 0.006, binomial exact), altogether indicating there is a paternal bias for origin of the 3q29 deletion (Additional file 1: Table S5). We examined the age distribution of male parents in our cohort; the mean age is 34 years (median = 34 years) and is not significantly different from the 2018 U.S national average, (31.8 years) (p = 0.08, Two-tailed two-sample t-test), These data indicate the bias in the 3q29 sample is unlikely to be due to oversampling of older fathers (Additional file 1: Table S5).

Meiotic recombination and parental origin

We tested the hypothesis that sex-dependent differences in meiotic recombination could explain the parental biases observed for recurrent genomic disorder loci mediated by NAHR. We determined the male and female origin counts of the CNVs curated from the literature search. Of the 1977 CNVs, 870 were paternal in origin and 1107 were of maternal origin. We calculated the average male and female recombination rates (cM/Mb) across the CNV intervals at all 24 loci using recombination rates published by the deCODE genetics group [106] (Additional file 1: Figure S2–S12). We fit a simple logistic model to the data, with the male-to-female recombination rate ratio as the predictor and parental origin as the response variable (Table 2; Additional file 4: Table S4). Our data reveal that the sex-dependent recombination rate ratio significantly predicts parental de novo origin of a given CNV (p = 1.07 × 10–14, β = 0.6606, CI95% = (0.4980, 0.8333), OR = 1.936) (Fig. 1). In other words: for a given region, the higher the male recombination rate is relative to the female rate, the more likely a CNV formed in that region will be paternal in origin. Stratified analyses on deletions and duplications separately lead to a nearly identical model (Deletions: p = 8.88 × 10–14, β = 0.6721, CI95% = (0.5009, 0.8546), OR = 1.9584; Duplications: p = 0.02, β = 0.8304, CI95% = (0.1508, 1.6017), OR = 2.2942) (Additional file 1: Figure S13–S14, Table S6–S7). Simple linear regression on the subset of CNV loci with more than 10 samples, shows the striking correlation between relative recombination rates and parental origin, where relative recombination rates explain 85% of the variance in parental bias (Additional file 1: Figure S15 and Table S8). Our logistic model can be used to predict paternal origin rates for any locus with estimable recombination in males and females, and we have done so (Additional file 1: Table S9). CNVs at the 15q13.3 and 17q23 both are predicted to have a paternal origin approximately 60% of the time, while at the 16p11.2 distal locus CNVs are predicted to have a maternal origin 76% of the time (Additional file 1: Table S9). If correct, our model would predict these loci exhibit a bias in parental origin.
Fig. 1

Recombination rates associate with parental origin. Predicted (curve) and observed paternal origin proportions for 1977 CNVs from 24 loci. Curated parent of origin data from 77 published studies are collapsed by loci into single data points; recombination rate ratios are the average of the metric for all CNVs within the data point. Data point size and color correspond to the number of CNV data collapsed into the data point. Recombination rate ratios predict parental origin for CNV mediated by NAHR (p = 1.07 × 10–14, β = 0.6606, CI95% = (0.4980, 0.8333), OR = 1.936)

Recombination rates associate with parental origin. Predicted (curve) and observed paternal origin proportions for 1977 CNVs from 24 loci. Curated parent of origin data from 77 published studies are collapsed by loci into single data points; recombination rate ratios are the average of the metric for all CNVs within the data point. Data point size and color correspond to the number of CNV data collapsed into the data point. Recombination rate ratios predict parental origin for CNV mediated by NAHR (p = 1.07 × 10–14, β = 0.6606, CI95% = (0.4980, 0.8333), OR = 1.936)

Discussion

Parent of origin bias for de novo events at recurrent CNV loci has been well-documented but has lacked a compelling explanation. Our analysis of data gathered on 1977 CNVs from 77 published reports demonstrate that sex-specific variation in local meiotic recombination rates predicts parent of origin at recurrent CNV loci. Human male and female meiotic recombination rates and patterns differ greatly across the broad scale of human chromosomes. Recombination events are nearly uniformly distributed across the chromosome arms in females but tend to be clustered closer to the telomeres in males [108]. We note that this pattern has been previously recognized [26]. Here we have formally tested the hypothesis that recombination variation drives parent of origin variation using a rigorous, statistical framework (Fig. 1) and provided an estimate for the variance in parent of origin bias that is due to sex-specific recombination rates (Additional file 1: Figure S15). Investigations into the mechanism by which recurrent CNVs arise have focused on LCRs and their makeup [1, 109]. These regions are composed of units of sequence repeats that vary in orientation, percent homology, length, and copy number. Consequently, LCRs are mosaics of varying units, imparting complexity to LCR architecture [23]. The frequency of NAHR events mediated by LCRs is a function of these characteristics and other features of the genomic architecture [21]. Specifically, the rate of NAHR is known to correlate positively with LCR length and percent homology and decrease as the distance between LCRs increases [19, 21]. However, because LCRs are challenging to study with short-read sequencing technology, the population-level variability of these regions is not well described [110]. Recent breakthroughs with long-read sequencing and optical mapping have revealed remarkable variation in LCRs [111-113], and haplotypes with higher risks for CNV formation have now been identified [114]. LCRs are substrates for NAHR [1], and thus are subject to the recombination process. Local recombination rates may influence how likely an NAHR event will happen between two LCRs. Therefore, when analyzing LCR haplotypes and their susceptibility to NAHR, one would need to take into account sex differences in recombination. For example, at loci with maternal biases, specific risk haplotypes may be required for males to form CNVs and vice versa. Greater enrichment of GC content, homologous core duplicons or the PRDM9 motifs, or other recombination-favoring factors may also be required [1, 19]. Variation in recombination rates between sexes is well established [108, 115–118]. Prediction of individual risk may also need to consider individual variation in meiotic recombination, particularly due to heritable variation and the presence or absence of inversion polymorphisms [117, 119]. Variants in several genes, including PRDM9, have been shown to affect recombination rates and the distribution of double-stranded breaks in mammals [120, 121]. Common alleles in PRDM9 are evidenced to affect the percentage of recombination events within individuals that take place at hotspots [120], and variants in RNF212 are associated with opposite effects on recombination rate between males and females [116, 121]. The unexplained variance in our study may be due to these additional factors, which are rich substrates for future study. Many human genetic studies have observed correlations between inversion polymorphisms and genomic disorder loci [25, 122]. Because these inversions are copy-number neutral and often located in complex repeat regions, [123] they can be difficult to assay with current high-throughput strategies and their true impact remains to be explored. One model proposes that during meiosis these regions may fail to synapse properly and increase the probability of NAHR [124, 125]. Another theory suggests formation of inversions increases directly oriented content in LCRs leading to an NAHR-favorable haplotype [126]. Supporting these theories, inversion polymorphisms have been identified at the majority of recurrent CNV loci [24, 25, 30, 122, 124, 126, 127]. At the 7q11.23, 17q21.31, and 5q35 loci [24, 25, 127], compelling data indicts inversions as a highly associated marker of CNV formation. However, heterozygous inversions are known to suppress recombination perturbing the local pattern of recombination and altering the fate of chiasmata [119]. The analysis presented here strongly suggests that recombination is the driving force for CNV formation giving rise to an alternate explanation for the association between inversions and CNVs; they are both the consequence (and neither one the cause) of recombination between non-allelic homologous LCRs. Inversions and CNVs appear to be associated because both are being initiated by aberrant recombination. Viewing the system in this manner also explains the frequency of individual inversions at CNV loci. Inversions are arising via rare aberrant recombination, like CNVs, but subsequently being driven to higher frequency by natural selection, because they act to suppress recombination and “save offspring” from deleterious genomic disorders. Of course, frequent mutations leading to inversions and the details of LCR structure such as relative orientation and homology within a genomic region may promote or impede CNV formation in a locus-specific manner [128-130]. Further exploration of this relationship with improved genomic mapping can test these alternative models [131]. One testable prediction of the model described here is that inversions should be at higher frequency at loci giving rise to highly deleterious CNVs, as opposed to loci harboring recurrent benign CNVs. To our knowledge, this study is the first comprehensive investigation of parental origin of recurrent, NAHR-mediated CNV loci. Investigations of predominantly nonrecurrent CNVs show paternal bias [132-134]. Unlike recurrent CNVs, nonrecurrent CNVs are mostly formed via non-homologous end joining (NHEJ) and replicative mechanisms [1, 135, 136]. The standing hypothesis is that replication-based mechanisms of nonrecurrent CNV formation, which are known to accumulate errors in male germlines, contribute to this bias [132]. Our study reinforces the idea that the factors influencing recurrent CNVs differ from those impacting nonrecurrent CNVs. Future genome-wide analyses with larger sample sizes can further help refine our understanding of the divergent forces at play affecting recurrent and nonrecurrent CNV formation. We conducted a comprehensive literature search at 38 loci and ultimately identified 1977 samples for analysis. We note that the majority of the data come from 7 well-studied loci (Table 1). While we thoroughly curated the data in a systematic way, it is possible that our data is subject to publication bias, where loci that exhibit parent of origin biases are more likely to have parental origin reported. Further exacerbating potential publication bias, genetic testing for the affected patient (and even more so for the parents) can be difficult to obtain due to concerns such as insurance coverage, potential future discrimination, and privacy concerns [137-140]. However, we note individuals with CNVs are generally not ascertained or recruited under the expectation that recombination affects parent of origin, and therefore, any potential publication or ascertainment bias is unlikely to confound the results of our analysis. Analysis of a larger cohort of CNV loci including benign CNVs will give greater insight into the role of recombination, and sex differences in recombination influencing parent of origin in CNVs. Our estimates of recombination rates summarize CNV-scale (broad-scale) patterns of recombination, rather than fine-scale patterns near the sites of relevant recombination events that form these CNVs—LCRs. For example, local sex-specific hotspots within LCRs could be the underlying drivers behind the correlation between recombination rates and parental origin. Given the nature of repetitive regions like LCRs and our inability to adequately interrogate them with current sequencing technologies, accurate recombination data across and within the LCR regions is not available. In other words, the data is currently insufficient to conclude whether or not these broad-scale patterns are tightly correlated with fine-scale recombination rates in and around the LCRs. The best available data in the field allows us to infer the following: broad-scale patterns of recombination tightly predict patterns of parental origin.

Conclusions

In this study, we determined male and female differences in meiotic recombination rates significantly predict parent of origin for recurrent CNV loci. Combining the sex-specific recombination landscape and the mechanistic factors underlying it with a more detailed understanding of existing structural factors at genomic disorder loci can be expected to help guide standards used to identify and perform genetic counseling for individuals at risk of genomic rearrangement. Additional file 1: Figure S1. Schematic of recombination rate calculation method; Figures S2–S12. Recombination rates of 24 loci analyzed; Figure S13. Logistic regression with deletions only; Figure S14. Logistic regression with duplications only; Figure S15. Linear regression with combined CNV parent of origin data; Table S3. LCR22 recombination rate data; Table S5. Demographic data for 3q29 cohort; Table S6. Summarized data for logistic regression with deletions only; Table S7. Summarized data for logistic regression with duplications only; Table S8. Sensitivity analysis results for linear regression analysis with deletions and duplications combined; Table S9. Predicted paternal origin probability for loci with small sample sizes or missing from analysis; Supplemental Materials and Methods; Supplemental References. Additional file 2: Table S1. Exclusion/Inclusion statuses and literature search results of genomic disorder loci conducted January 2021. Additional file 3: Table S2. List of 1268 search results curated from literature search. Additional file 4: Table S4. Logistic regression data for 1977 CNVs.
  137 in total

Review 1.  Structural variation in the human genome and its role in disease.

Authors:  Paweł Stankiewicz; James R Lupski
Journal:  Annu Rev Med       Date:  2010       Impact factor: 13.739

2.  An emerging phenotype of interstitial 15q25.2 microdeletions: clinical report and review.

Authors:  Orazio Palumbo; Pietro Palumbo; Teresa Palladino; Raffaella Stallone; Mattia Miroballo; Maria Rosaria Piemontese; Leopoldo Zelante; Massimo Carella
Journal:  Am J Med Genet A       Date:  2012-11-19       Impact factor: 2.802

Review 3.  Autistic and psychiatric findings associated with the 3q29 microdeletion syndrome: case report and review.

Authors:  Fabiola Quintero-Rivera; Pantea Sharifi-Hannauer; Julian A Martinez-Agosto
Journal:  Am J Med Genet A       Date:  2010-10       Impact factor: 2.802

4.  Insurance coverage for genomic tests.

Authors:  Kathryn A Phillips; Julia R Trosman; Patricia A Deverka; Bruce Quinn; Sean Tunis; Peter J Neumann; James D Chambers; Louis P Garrison; Michael P Douglas; Christine B Weldon
Journal:  Science       Date:  2018-04-19       Impact factor: 47.728

5.  Microdeletions of 3q29 confer high risk for schizophrenia.

Authors:  Jennifer Gladys Mulle; Anne F Dodd; John A McGrath; Paula S Wolyniec; Adele A Mitchell; Amol C Shetty; Nara L Sobreira; David Valle; M Katharine Rudd; Glen Satten; David J Cutler; Ann E Pulver; Stephen T Warren
Journal:  Am J Hum Genet       Date:  2010-08-13       Impact factor: 11.025

6.  15q24.1 BP4-BP1 microdeletion unmasking paternally inherited functional polymorphisms combined with distal 15q24.2q24.3 duplication in a patient with epilepsy, psychomotor delay, overweight, ventricular arrhythmia.

Authors:  Minh-Tuan Huynh; Anne-Sophie Lambert; Lucie Tosca; François Petit; Christophe Philippe; Frédéric Parisot; Virginie Benoît; Agnès Linglart; Sophie Brisset; Cong Toai Tran; Gérard Tachdjian; Aline Receveur
Journal:  Eur J Med Genet       Date:  2018-03-14       Impact factor: 2.708

7.  Gross deletions of the neurofibromatosis type 1 (NF1) gene are predominantly of maternal origin and commonly associated with a learning disability, dysmorphic features and developmental delay.

Authors:  M Upadhyaya; M Ruggieri; J Maynard; M Osborn; C Hartog; S Mudd; M Penttinen; I Cordeiro; M Ponder; B A Ponder; M Krawczak; D N Cooper
Journal:  Hum Genet       Date:  1998-05       Impact factor: 4.132

8.  Variability in clinical phenotype despite common chromosomal deletion in Smith-Magenis syndrome [del(17)(p11.2p11.2)].

Authors:  Lorraine Potocki; Christine J Shaw; Pawel Stankiewicz; James R Lupski
Journal:  Genet Med       Date:  2003 Nov-Dec       Impact factor: 8.822

9.  Copy-number variations measured by single-nucleotide-polymorphism oligonucleotide arrays in patients with mental retardation.

Authors:  Janine Wagenstaller; Stephanie Spranger; Bettina Lorenz-Depiereux; Bernd Kazmierczak; Michaela Nathrath; Dagmar Wahl; Babett Heye; Dieter Glaser; Volkmar Liebscher; Thomas Meitinger; Tim M Strom
Journal:  Am J Hum Genet       Date:  2007-08-28       Impact factor: 11.025

10.  Genomic inversions of human chromosome 15q11-q13 in mothers of Angelman syndrome patients with class II (BP2/3) deletions.

Authors:  Giorgio Gimelli; Miguel Angel Pujana; Maria Grazia Patricelli; Silvia Russo; Daniela Giardino; Lidia Larizza; Joseph Cheung; Lluís Armengol; Albert Schinzel; Xavier Estivill; Orsetta Zuffardi
Journal:  Hum Mol Genet       Date:  2003-04-15       Impact factor: 6.150

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.