| Literature DB >> 28378101 |
Andrea Massaia1,2, Yali Xue3.
Abstract
The human Y chromosome provides a fertile ground for structural rearrangements owing to its haploidy and high content of repeated sequences. The methodologies used for copy number variation (CNV) studies have developed over the years. Low-throughput techniques based on direct observation of rearrangements were developed early on, and are still used, often to complement array-based or sequencing approaches which have limited power in regions with high repeat content and specifically in the presence of long, identical repeats, such as those found in human sex chromosomes. Some specific rearrangements have been investigated for decades; because of their effects on fertility, or their outstanding evolutionary features, the interest in these has not diminished. However, following the flourishing of large-scale genomics, several studies have investigated CNVs across the whole chromosome. These studies sometimes employ data generated within large genomic projects such as the DDD study or the 1000 Genomes Project, and often survey large samples of healthy individuals without any prior selection. Novel technologies based on sequencing long molecules and combinations of technologies, promise to stimulate the study of Y-CNVs in the immediate future.Entities:
Mesh:
Year: 2017 PMID: 28378101 PMCID: PMC5418319 DOI: 10.1007/s00439-017-1788-5
Source DB: PubMed Journal: Hum Genet ISSN: 0340-6717 Impact factor: 4.132
Fig. 1Summary of CNVs on the human Y chromosome. a Male-specific euchromatic region of the Y chromosome. The Y-specific unique region is shown in yellow, the X–Y transposed region in red, Y-specific repeats in blue, heterochromatic segments in purple and other regions in grey. b Medically important Y-CNVs. c CNVs discovered from population studies. Deletions are shown in orange, duplications in green, and deletions/duplications in blue. d Y-CNV mutation events inferred from the available data. Single events are shown in yellow, recurrent events in blue and unknown ones in dark grey. e Y-CNVs associated with segmental duplications or other repeats are shown in dark orange, and non-repeat-associated ones in yellow
Fig. 2Shadowing effect for intensity data. The top half shows schematic representations of CNVs and the corresponding intensity data plots. a A unique region (left) or duplicated region (right) in the reference genome is shown in red. b Corresponding plots showing the intensity signal for each probe, here represented by a single dot, on the Y axis, and the position for the probe on the X axis. c A hypothetical duplication of the unique region (left), and of one of the copies of the duplicated region (right). d The unique region will show a stronger increase in signal (left), as compared to the duplicated region (right); in the duplicated region, moreover, the increase will be detected in both reference copies, as the method is unable to distinguish between them. The bottom half shows real examples for both CNVs in unique regions (left) and in a repeated region showing the shadowing effect (right) (from Wei et al. 2015). On the right, the RBMY gene copies all show co-ordinated intensity changes
Summary of CNV detection and follow-up methods used currently or in the past in Y chromosome studies
| Method | Resolution* | Throughput | Analysis procedure | Application |
|---|---|---|---|---|
| Karyotyping | 5 Mb | Low | Visual inspection | Genome-wide detection |
| Interphase FISH | 100 kb–1 Mb | Low | Visual inspection | Validation, detection of complex structures |
| Fibre-FISH | 10 kb–1 Mb | Low | Visual inspection | Validation, detection of complex structures |
| BAC array CGH | 100 kb | High | Intensity detection and processing | Genome-wide detection |
| Oligo array CGH | 500 bp | High | Intensity detection and processing | Genome-wide detection, validation |
| Short read WG sequencing | 1 bp | High | Read mapping and variant calling, de novo assembly | Genome-wide detection |
| Long read WG sequencing | 1 bp | High | Read mapping and variant calling, de novo assembly | Genome-wide detection |
| qPCR, paralogue ratio test (PRT) | 200 bp | Medium | Fluorescence detection | Validation, copy number quantification |
| Digital droplet PCR (ddPCR) | 200 bp | High | Fluorescence detection | Validation, copy number quantification |
| Sequence-tagged site (STS) | 200 bp | Medium | Electrophoresis: band presence/absence | Validation, targeted assay |
| Breakpoint PCR | 1 bp | Low | Electrophoresis: band presence/absence | Validation and refinement |
* Resolution indicates the (approximate) minimum size of variants each method is able to detect, except when a range is given, where the maximum is also indicated. Note that not all methods are suitable for all CNVs; further details are given in the text
Fig. 3Numbers of CNV mutational events inferred from the phylogenetic tree. a Phylogeny based on the 1000 Genomes Project phase 3 data (from Poznik et al. 2016). b Examples of single mutational event CNVs (light yellow background) and multiple event CNVs (light green background). The 7.4 deletion at 5.8 Mb CNV, with a thicker surround, indicates that all sampled members of this haplogroup carry this CNV, while in the other examples only some member(s) of the haplogroup carry the CNV. For each listed CNV, approximate chromosomal position in GRCh37 is given by, e.g. ‘at 16.3 Mb’