| Literature DB >> 27723784 |
Tibor Vaszkó1, János Papp1, Csilla Krausz2,3, Elena Casamonti2, Lajos Géczi4, Edith Olah1.
Abstract
Due to its palindromic setup, AZFc (Azoospermia Factor c) region of chromosome Y is one of the most unstable regions of the human genome. It contains eight gene families expressed mainly in the testes. Several types of rearrangement resulting in changes in the cumulative copy number of the gene families were reported to be associated with diseases such as male infertility and testicular germ cell tumors. The best studied AZFc rearrangement is gr/gr deletion. Its carriers show widespread phenotypic variation from azoospermia to normospermia. This phenomenon was initially attributed to different gr/gr subtypes that would eliminate distinct members of the affected gene families. However, studies conducted to confirm this hypothesis have brought controversial results, perhaps, in part, due to the shortcomings of the utilized subtyping methodology. This proof-of-concept paper is meant to introduce here a novel method aimed at subtyping AZFc rearrangements. It is able to differentiate the partial deletion and partial duplication subtypes of the Deleted in Azoospermia (DAZ) gene family. The keystone of the method is the determination of the copy number of the gene family member-specific variant(s) in a series of sequence family variant (SFV) positions. Most importantly, we present a novel approach for the correct interpretation of the variant copy number data to determine the copy number of the individual DAZ family members in the context of frequent interloci gene conversion.Besides DAZ1/DAZ2 and DAZ3/DAZ4 deletions, not yet described rearrangements such as DAZ2/DAZ4 deletion and three duplication subtypes were also found by the utilization of the novel approach. A striking feature is the extremely high concordance among the individual data pointing to a certain type of rearrangement. In addition to being able to identify DAZ deletion subtypes more reliably than the methods used previously, this approach is the first that can discriminate DAZ duplication subtypes as well.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27723784 PMCID: PMC5056753 DOI: 10.1371/journal.pone.0163936
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Structure of the AZFc region with the eight gene families.
The colored arrows show the direct and inverted repeats of AZFc. The colored arrowheads indicate the members of the eight gene families located in the region. The two rectangles enclose the gene family members eliminated by gr/gr deletion with two different breakpoints (g1/g2 versus r2/r4 deletion), respectively. The STS markers that have usually been analyzed in search for deletions are also shown.
Fig 2Schematic representation of the four DAZ family members along with the studied markers.
Panel A: DAZ1 and DAZ2. Panel B: DAZ3 and DAZ4. The figure was captured from the UCSC Genome Browser after uploading an appropriate BED file (S7 File). Below the scale, chromosome Y coordinates according to the human genome build NCBI36/hg18 are shown. Below the RefSeq Genes label, the structure of the DAZ family members are illustrated with the exons indicated by perpendicular lines. The aligned arrowheads in the thick green line show the direction of the coding sequence. Variants presented in red and blue colors are specific and non-specific for the corresponding DAZ family members, respectively. For non-specific variants, a digit in square brackets indicates the DAZ family member for which the relevant SFV position contains a specific variant. The numbers in the variants' name indicate the variants' location in Fragments I or II.
Fig 3Clustering of AUC ratios at two SFV positions in fifty-two samples.
The fifty-two samples sequenced form five distinct clusters according to the AUC ratio measured at SVF positions 1926 (A) and 1702 (C). Samples 1–5 are the duplication samples (referred to as Ydup_01–05 in the text), whereas samples 6–13 are the deletion samples (referred to as Ydel_06–13). Samples 14–52, each having one copy of all four DAZ family members, constitute the control panel. Representative electropherogram pictures of SVF positions 1926 (B) and 1702 (D) obtained in samples belonging to the distinct clusters are shown. The average (AUC ratioav) and standard deviation (StDev) of the AUC ratio values were calculated from all samples belonging to a cluster. Comparing with the AUC ratio–variant ratio relationship determined in control mixes, a variant ratio was assigned to each cluster at both positions. The AUC ratio (presented here as percentage) calculation is described in the Methods section.
Relationship between a sample’s AZFc partial deletion/duplication status and its horizontal variant ratio distribution.
| AZFc partial deletion/duplication status | Type of SFV positions (#specific variant: #non-specific variant) | ||||||
|---|---|---|---|---|---|---|---|
| 0:2x | 2x:0 | x:x | 1:3 | 1:5 | 2:4 | 4:2 | |
| No partial rearrangement (x = 2) | + | - | + | + | - | - | - |
| Partial deletion affecting two DAZ family members (x = 1) | + | + | + | - | - | - | - |
| Partial deletion affecting two DAZ family members followed by duplication (x = 2) | + | + | + | - | - | - | - |
| Partial duplication affecting two DAZ family members (x = 3) | + | - | + | - | + | + | + |
A + sign means that the corresponding type of SFV position may be present in a sample with the relevant deletion/duplication status. A - sign means that the corresponding type of SFV position is not expected in a sample with the relevant deletion/duplication status. The value of x is the function of the deletion/duplication status.
Fig 4Flowchart showing the steps involved in the elaboration and application of variant ratio analysis.
The numbered arrows stand for the processes applied, the rectangles symbolize the result of the respective process. The green rectangle shows the desired end-result of the analysis. Brown rectangles symbolize calibration data used to help derive variant ratios from AUC ratios. Red rectangles stand for input data required for stage 2 or, if there are marker associations, stage 3 analysis. Samples with different partial rearrangement types are indicated by three different shades of gray. The dashed arrow means that only deletion and duplication samples undergo stage 2 or stage 3 analysis. The large rectangle with transparent grey background contains the steps that are required for a complete analysis. The steps outside the large rectangle were used for the elaboration of the method. All processes are listed below. 1/ Aligning the sequences of the four DAZ genes extracted from the human reference genome in order to select amplifiable fragments which i) consist of four amplicons belonging to the four DAZ genes, respectively, and ii) contain as many SFV positions as possible (Fragments I and II) (Fig 2, S1 Fig, S5–S7 Files, S1 and S2 Tables). 2/ Amplification of selected regions in all individual samples to get Fragments I and II. 3/ Sequencing Fragments I and II in all individual samples. 4/ Preparing control plasmid mixtures by cloning Fragment I amplified from samples selected to contain every DAZ1, DAZ2, DAZ3 and DAZ4-specific variant in order to mimic wild-type, AZFc partial deletion and AZFc partial duplication samples having known variant ratios at each SFV position (S4 Table). 5/ Amplification of selected regions to get Fragment I in control mixtures. 6/ Sequencing Fragment I in control mixtures. 7/ Measuring AUCs by ImageJ software and calculating the AUC ratio at each SFV position in every control mixture. 8/ Correlating the AUC ratios measured in control mixtures with known variant ratios (S5 Table). 9/ Measuring AUCs and calculating the AUC ratio at each SFV position in all individual samples. 10/ Plotting AUC ratios throughout all samples for each studied position to visualize AUC ratio clustering (Fig 3). 11/ Assigning a variant ratio to each SFV position in all individual samples. 12/ Determining the horizontal variant ratio distribution in all individual samples. 13/ Grouping samples according to AZFc partial deletion/duplication status based on their horizontal variant ratio distribution (Tables 1–4). The results of this step were validated by a generally accepted DAZ dosage test (not shown) and two multiplex PCRs amplifying six sY markers (S2 Fig). 14/ Specifying the variant ratios that remained ambiguous on the basis of the AUC ratio (electropherogram picture) in step 11 (0:2x, x:x and 2x:0 positions). 15/ Deducing the copy number of the specific variant at each SFV position in all individual samples from the relevant variant ratio. 16/ Cloning Fragments I and II in four selected control samples to separate the four amplicons derived from the four DAZ family members, respectively. 17/ Sequencing an appropriate number of colonies in order to study the co-segregation of DAZ family member-specific variants, i.e. the fulfillment of requirement (c) imposed on an ideal marker (S6 Table).18/ Determining the vertical variant ratio distribution throughout all control samples at each studied SFV position. 19/ Determining p1 and p2 values on the basis of the vertical variant ratio distribution at each SFV position within the control panel in order to study the fulfillment of requirements (a) and (b) also imposed on an ideal marker (Table 4, the equations are seen in the text). 20/ Classifying family member-specific variants on the basis of p1 and p2, which results in the distinction of class I, class II/a, class II/b and class III markers (Table 5). 21/ Determining the relationship between the copy number of a gene family member-specific variant and the copy number of the corresponding gene family member for class II/a and class II/b markers. It results in the “restricted” applicability of the markers that will be utilized at stage 2 and stage 3 analyses (S1 and S2 Files, Tables 6 and 7). 22/ Searching for perfect associations between variants specific to different DAZ family members in the control panel in order to re-evaluate (extend) the applicability of some markers if possible (Table 4). 23/ Determining the relationship between the copy number of the members of the associated marker pairs and the copy number of the corresponding gene family members for the relevant class II/a and class II/b markers, which results in their “extended” applicability that will be utilized at stage 3 analysis (S3 and S4 Files, Tables 8 and 9). 24/ Evaluating partial deletion and partial duplication samples identified in step 13 using the classified markers with restricted applicability (stage 2 and 3) or, if exists, extended applicability (stage 3) to determine the samples’ deletion and duplication subtype (S8 and S9 Tables, Tables 2 and 3). 25/ Comparing the series of the variant ratios (variant ratio haplotype, VRH) of the deletion and duplication samples with the VRHs observed in the control panel in order to support the existence of the assigned rearrangement subtypes in the studied population (S7 and S11 Tables). 26/ Checking if the assigned rearrangement subtypes are concordant with or might even be indicated by the variant ratios determined at positions also examined but not utilized for stage 2 and stage 3 analyses for some reason (S10 Table).
The human reference genome-based characteristics of the studied SFV positions and their variant ratios determined in 39 control samples grouped according to their variant ratio haplotype.
| SFV position on Fragment I | SFV position on Fragment II | VRH | No. | ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Location | 972 | 1209 | 1702 | 1820 | 1926 | 2481 | 111 | 978 | 1005 | 1053 | 1636 | 1646 | 1952 | 1961 | 1964 | 1964 | 2071 | ||
| Marker specificity | DAZ1 | DAZ2 | DAZ2 | DAZ4 | DAZ1 | - | - | DAZ4 | DAZ3 | DAZ3 | DAZ2 | DAZ3 | DAZ3 | DAZ3 | DAZ3 | DAZ4 | DAZ4 | ||
| Variants | A:G | T:C | T:C | C:A | G:A | G:T | G:C(:T) | C:T | G:A | C:T | G:T | A:G | T:A | T:C | C:(A+G) | A:(C+G) | G:C | ||
| Variant ratios | 1:3 | 1:3 | 1:3 | 1:3 | 1:3 | 2:2 | 2:2 | 1:3 | 1:3 | 1:3 | 1:3 | 1:3 | 1:3 | 1:3 | 1:(1+2) | 1:(1+2) | 1:3 | RefSeq | 0 |
| 0:4 | 1:3 | 1:3 | 0:4 | 1:3 | 1:3 | 2:2 | 1:3 | 1:3 | 1:3 | 1:3 | 1:3 | 1:3 | 1:3 | 1:(0+3) | 0:(1+3) | 1:3 | 3b | 12 | |
| 0:4 | 1:3 | 1:3 | 0:4 | 1:3 | 1:3 | 2:2 | 1:3 | 1:3 | 1:3 | 1:3 | 1:3 | 1:3 | 1:3 | 1:(1+2) | 1:(1+2) | 1:3 | 3a/1 | 2 | |
| 0:4 | 1:3 | 1:3 | 0:4 | 1:3 | 1:3 | 2:1(:1) | 1:3 | 1:3 | 1:3 | 2:2 | 1:3 | 1:3 | 1:3 | 1:(1+2) | 1:(1+2) | 1:3 | 3a/2 | 3 | |
| 0:4 | 1:3 | 1:3 | 0:4 | 1:3 | 2:2 | 2:2 | 1:3 | 1:3 | 1:3 | 1:3 | 1:3 | 1:3 | 1:3 | 1:(1+2) | 1:(1+2) | 1:3 | 3a/3 | 1 | |
| 1:3 | 1:3 | 1:3 | 1:3 | 1:3 | 2:2 | 2:2 | 1:3 | 1:3 | 1:3 | 1:3 | 1:3 | 1:3 | 1:3 | 1:(0+3) | 0:(1+3) | 1:3 | 2 | 6 | |
| 1:3 | 1:3 | 1:3 | 1:3 | 1:3 | 1:3 | 2:2 | 0:4 | 2:2 | 2:2 | 1:3 | 2:2 | 2:2 | 2:2 | 2:(0+2) | 0:(2+2) | 0:4 | 1 | 13 | |
| 0:4 | 1:3 | 1:3 | 0:4 | 1:3 | 2:2 | N/A | 0:4 | 0:4 | 0:4 | 1:3 | 2:2 | 2:2 | 2:2 | 2:(0+2) | 0:(2+2) | 0:4 | 4 | 1 | |
| 0:4 | 1:3 | 1:3 | 0:4 | 1:3 | 2:2 | N/A | 1:3 | 0:4 | 0:4 | 1:3 | 1:3 | 1:3 | 1:3 | 1:(0+3) | 0:(1+3) | 1:3 | 3c | 1 | |
| p1 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | - | - | 1.00 | 0.65 | 0.65 | 0.92 | 0.64 | 0.64 | 0.64 | 0.64 | 1.00 | 1.00 | ||
| p2 | 0.49 | 1.00 | 1.00 | 0.49 | 1.00 | - | - | 0.64 | 0.92 | 0.92 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 0.15 | 0.64 | ||
| Class of marker | II/b | I | I | II/b | I | - | - | II/b | III | III | II/a | II/a | II/a | II/a | II/a | II/b | II/b | ||
1Variants are arranged as family member-specific variant(s):non-specific variant(s) at each SFV position.
2At position 2481 in Fragment I, there is no specific variant (DAZ1/2: G, DAZ3/4: T according to the human reference assembly).
3At position 111 in Fragment II, there is no specific variant (DAZ1/2: G, DAZ3/4: C according to the human reference assembly). In three samples (VRH 3a/2), T was found to replace one of the Cs. Based on cloning experiments, T is located in DAZ3 (S6C Table)
4At position 1964, there is a DAZ3-specific C and a DAZ4-specific A according to the human reference assembly.
5Variant ratio haplotypes were named arbitrarily based on similarities.
6Number of samples belonging to a variant ratio haplotype among the 39 controls
7RefSeq variant ratio haplotype was determined by the alignment of the corresponding DAZ regions derived from the human reference assembly hg18.
8The meaning and calculation of p1 and p2 can be found in the text
Classification of DAZ family member-specific markers.
| Evaluation of control samples | Evaluation of unknown samples | ||||
|---|---|---|---|---|---|
| p1 | p2 | Requirements fulfilled | Class of marker | Inference that could be drawn from the presence of the marker | Inference that could be drawn from the absence of the marker |
| > cut-off1 | > cut-off2 | (a), (b) | I | DAZ family member present | DAZ family member absent |
| > cut-off1 | ≤ cut-off2 | (b) | II/b | DAZ family member present | - |
| ≤ cut-off1 | > cut-off2 | (a) | II/a | - | DAZ family member absent |
| ≤ cut-off1 | ≤ cut-off2 | - | III | - | - |
Variants have been classified according to how they fulfill requirements (a) and (b) imposed on an ideal family member-specific marker. Cut-off1 and cut-off2 can be arbitrarily set. p1 and p2 are the reliability values which can be calculated from the vertical variant ratio distribution obtained in the control panel at the respective SFV position (the equations are seen in the text).
Restricted applicability of class II/a markers.
| Copy number of a class II/a marker | Copy number of the DAZ family member | |
|---|---|---|
| Deletion samples | Duplication samples | |
| 0 | 0 | - |
| 1 | no conclusion | 1 |
| 2 | 1 | no conclusion |
| 3 | - | no conclusion |
| 4 | - | 2 |
Conclusions allowed to be drawn from the copy number of a class II/a DAZ family member-specific variant for the copy number of the relevant DAZ family member in partial deletion and partial duplication samples, respectively, are shown. The conclusions were established by determining the whole spectrum of gene conversions assumed to be able to duplicate a DAZ family member-specific variant, then combining each potential pairwise DAZ deletion or duplication event with each gene conversion, the latter allowed to take place either before or after the large rearrangement (S1 File). These conclusions are exploited in the stage 2 and stage 3 analyses of unknown samples (S8 and S9 Tables b-c). The '-' sign means a not expected scenario.
Restricted applicability of class II/b markers.
| Copy number of a class II/b marker | Copy number of the DAZ family member | |
|---|---|---|
| Deletion samples | Duplication samples | |
| 0 | no conclusion | no conclusion |
| 1 | 1 | no conclusion |
| 2 | - | 2 |
Conclusions allowed to be drawn from the copy number of a class II/b DAZ family member-specific variant for the copy number of the relevant DAZ family member in partial deletion and partial duplication samples, respectively, are shown. The above conclusions were established by determining the whole spectrum of gene conversions assumed to be able to eliminate a DAZ family member-specific variant, then combining each potential pairwise DAZ deletion or duplication event with each gene conversion, the latter allowed to take place either before or after the large rearrangement (S2 File). These conclusions are exploited in the stage 2 and stage 3 analyses of unknown samples (S8 and S9 Tables b-c). The '-' sign means a not expected scenario.
Extended applicability of DAZ1-specific A972 and DAZ4-specific C1820.
| Copy number of specific variants | Copy number of DAZ family members | ||||
|---|---|---|---|---|---|
| Deletion samples | Duplication samples | ||||
| DAZ1-specific A972 | DAZ4-specific C1820 | DAZ1 | DAZ4 | DAZ1 | DAZ4 |
| 0 | 0 | no conclusion | no conclusion | no conclusion | no conclusion |
| 0 | 1 | 0 | 1 | 1 | 2 |
| 1 | 0 | 1 | 0 | 2 | 1 |
| 1 | 1 | 1 | 1 | no conclusion | no conclusion |
| 1 | 2 | - | - | 1 | 2 |
| 2 | 1 | - | - | 2 | 1 |
| 2 | 2 | - | - | 2 | 2 |
Conclusions allowed to be drawn for the copy number of DAZ1 and DAZ4 in partial deletion and partial duplication samples, based on the combined analysis of the class II/b DAZ1-specific A972 and DAZ4-specific C1820, are shown. To establish the above conclusions, the whole spectrum of gene conversions assumed to be able to eliminate the class II/b DAZ1-specific A972 and the class II/b DAZ4-specific C1820 in Fragment I, jointly or separately but simultaneously, was determined. Based on the perfect association of these two markers, which had been observed in the control panel, that spectrum could be limited to three conversion pairs (DAZ1>DAZ4 plus DAZ4>DAZ1, DAZ3>DAZ4 plus DAZ2>DAZ1and DAZ2>DAZ4 plus DAZ3>DAZ1). The conclusions were derived by combining each potential pairwise DAZ deletion or duplication subtype with each of the three gene conversion pairs, the latter allowed to take place either before or after the large rearrangement (S3 File). These results are exploited in the stage 3 analysis of unknown samples (S8C and S9C Tables). The '-' sign means a not expected scenario.
Extended applicability of the class II/a DAZ3- and class II/b DAZ4-specific markers located in Fragment II.
| Copy number of specific variants | Copy number of DAZ family members | ||||
|---|---|---|---|---|---|
| Deletion samples | Duplication samples | ||||
| DAZ3-specific variants | DAZ4-specific variants | DAZ3 | DAZ4 | DAZ3 | DAZ4 |
| 0 | 0 | 0 | 0 | - | - |
| 0 | 1 | 0 | 1 | - | - |
| 1 | 0 | no conclusion | no conclusion | - | - |
| 1 | 1 | 1 | 1 | 1 | 1 |
| 2 | 0 | 1 | 1 | 1 | 1 |
| 2 | 1 | - | - | no conclusion | no conclusion |
| 3 | 0 | - | - | no conclusion | no conclusion |
| 1 | 2 | - | - | 1 | 2 |
| 2 | 2 | - | - | 2 | 2 |
| 4 | 0 | - | - | 2 | 2 |
| 3 | 1 | - | - | 2 | 2 |
Conclusions allowed to be drawn for the copy number of DAZ3 and DAZ4 in partial deletion and partial duplication samples, based on the combined analysis of the class II/a DAZ3- and class II/b DAZ4-specific markers located in Fragment II, are shown. To establish the above conclusions, the whole spectrum of gene conversions assumed to be able to duplicate the class II/a DAZ3-specific variants and eliminate the class II/b DAZ4-specific variants in Fragment II, jointly or separately but simultaneously, was determined. Based on the perfect association between the duplication of these DAZ3-specific and the lack of these DAZ4-specific markers, which had been observed in the control panel, that spectrum could be limited to DAZ3>DAZ4 gene conversion. The conclusions were derived by combining each potential pairwise DAZ deletion or duplication with DAZ3>DAZ4 gene conversion, the latter allowed to take place either before or after the large rearrangement (S4 File). These results are exploited in the stage 3 analysis of unknown samples (S8C and S9C Tables). The '-' sign means a not expected scenario.
The human reference genome-based characteristics of the studied SFV positions and their variant ratios found in eight partial deletion samples.
| SFV position on Fragment I | SFV position on Fragment II | Deleted | Identifier | ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Location | 972 | 1209 | 1702 | 1820 | 1926 | 2481 | 111 | 978 | 1005 | 1053 | 1636 | 1646 | 1952 | 1961 | 19644 | 1964 | 2071 | ||
| Marker specificity | DAZ1 | DAZ2 | DAZ2 | DAZ4 | DAZ1 | - | - | DAZ4 | DAZ3 | DAZ3 | DAZ2 | DAZ3 | DAZ3 | DAZ3 | DAZ3 | DAZ4 | DAZ4 | ||
| Variants | A:G | T:C | T:C | C:A | G:A | G:T | G:C(:T) | C:T | G:A | C:T | G:T | A:G | T:A | T:C | C:(A+G) | A:(C+G) | G:C | ||
| Variant ratios | 0:2 | 1:1 | 1:1 | 0:2 | 1:1 | 1:1 | 1:1 | 0:2 | 0:2 | 0:2 | 1:1 | 0:2 | 0:2 | 0:2 | 0:(0+2) | 0:(0+2) | 0:2 | DAZ3/4 | Ydel_06 |
| 0:2 | 1:1 | 1:1 | 0:2 | 1:1 | 2:0 | 2:0 | 0:2 | 0:2 | 0:2 | 1:1 | 0:2 | 0:2 | 0:2 | 0:(0+2) | 0:(0+2) | 0:2 | DAZ3/4 | Ydel_08 | |
| 1:1 | 1:1 | 1:1 | 0:2 | 1:1 | 2:0 | 2:0 | 0:2 | 0:2 | 0:2 | 0:25 | 0:2 | 0:2 | 0:2 | 0:(0+2) | 0:(0+2) | 0:2 | DAZ3/4 | Ydel_07 | |
| 1:1 | 0:2 | 0:2 | 0:2 | 1:1 | 1:1 | 1:1 | 0:2 | 1:1 | 1:1 | 0:2 | 1:1 | 1:1 | 1:1 | 1:(0+1) | 0:(1+1) | 0:2 | DAZ2/4 | Ydel_09 | |
| 1:1 | 0:2 | 0:2 | 0:2 | 1:1 | 1:1 | 1:1 | 0:2 | 1:1 | 1:1 | 0:2 | 1:1 | 1:1 | 1:1 | 1:(0+1) | 0:(1+1) | 0:2 | DAZ2/4 | Ydel_10 | |
| 0:2 | 0:2 | 0:2 | 1:1 | 0:2 | 0:2 | 0:2 | 0:2 | 2:0 | 2:0 | 0:2 | 2:0 | 2:0 | 2:0 | 2:(0+0) | 0:(2+0) | 0:2 | DAZ1/2 | Ydel_11 | |
| 0:2 | 0:2 | 0:2 | 1:1 | 0:2 | 0:2 | 0:2 | 0:2 | 2:0 | 2:0 | 0:2 | 2:0 | 2:0 | 2:0 | 2:(0+0) | 0:(2+0) | 0:2 | DAZ1/2 | Ydel_12 | |
| 0:2 | 0:2 | 0:2 | 1:1 | 0:2 | 0:2 | 0:2 | 0:2 | 2:0 | 2:0 | 0:2 | 2:0 | 2:0 | 2:0 | 2:(0+0) | 0:(2+0) | 0:2 | DAZ1/2 | Ydel_13 | |
1Variants are arranged as family member-specific variant(s):non-specific variant(s) at each SFV position.
2At position 2481 in Fragment I, there is no specific variant (DAZ1/2: G, DAZ3/4: T according to the human reference assembly).
3At position 111 in Fragment II, there is no specific variant (DAZ1/2: G, DAZ3/4: C according to the human reference assembly). In several of our samples, a T is substituted for one of the Cs.
4At position 1964, there is a DAZ3-specific C and a DAZ4-specific A according to the human reference assembly.
5Variant ratio not in accordance with the concluded deletion subtype
The human reference genome-based characteristics of the studied SFV positions and their variant ratios found in five partial duplication samples.
| SFV position on Fragment I | SFV position on Fragment II | Duplicated | Identifier | ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Location | 972 | 1209 | 1702 | 1820 | 1926 | 2481 | 111 | 978 | 1005 | 1053 | 1636 | 1646 | 1952 | 1961 | 1964 | 1964 | 2071 | ||
| Marker specificity | DAZ1 | DAZ2 | DAZ2 | DAZ4 | DAZ1 | - | - | DAZ4 | DAZ3 | DAZ3 | DAZ2 | DAZ3 | DAZ3 | DAZ3 | DAZ3 | DAZ4 | DAZ4 | ||
| Variants | A:G | T:C | T:C | C:A | G:A | G:T | G:C(:T) | C:T | G:A | C:T | G:T | A:G | T:A | T:C | C:(A+G) | A:(C+G) | G:C | ||
| Variant ratios | 0:6 | 1:5 | 1:5 | 0:6 | 1:5 | 2:4 | 2:4 | 1:5 | 3:3 | 3:3 | 1:5 | 3:3 | 3:3 | 3:3 | 3:(0+3) | 0:(3+3) | 1:5 | DAZ3/4 | Ydup_01 |
| 0:6 | 2:4 | 2:4 | 0:6 | 2:4 | 2:4 | 4:2 | 1:5 | 1:5 | 1:5 | 2:4 | 1:5 | 1:5 | 1:5 | 1:(0+5) | 0:(1+5) | 1:5 | DAZ1/2 | Ydup_05 | |
| 1:5 | 2:4 | 2:4 | 2:4 | 1:5 | 1:5 | 3:3 | 0:6 | 3:3 | 3:3 | 2:4 | 3:3 | 3:3 | 3:3 | 3:(0+3) | 0:(3+3) | 0:6 | DAZ2/4 | Ydup_03 | |
| 1:5 | 2:4 | 2:4 | 2:4 | 1:5 | 1:5 | 3:3 | 0:6 | 3:3 | 3:3 | 2:4 | 3:3 | 3:3 | 3:3 | 3:(0+3) | 0:(3+3) | 0:6 | DAZ2/4 | Ydup_04 | |
| 0:6 | 1:5 | 1:5 | 0:6 | 1:5 | 1:5 | 2:4 | 2:4 | 2:4 | 2:4 | 1:5 | 2:4 | 2:4 | 2:4 | 2:(2+2) | 2:(2+2) | 2:4 | DAZ3/4 | Ydup_02 | |
1Variants are arranged as family member-specific variant(s):non-specific variant(s) at each SFV position.
2At position 2481 in Fragment I, there is no specific variant (DAZ1/2: G, DAZ3/4: T according to the human reference assembly).
3At position 111 in Fragment II, there is no specific variant (DAZ1/2: G, DAZ3/4: C according to the human reference assembly). In several of our samples, a T is substituted for one of the Cs.
4At position 1964, there is a DAZ3-specific C and a DAZ4-specific A according to the human reference assembly.