| Literature DB >> 25957587 |
Wei Wei1, Tomas W Fitzgerald, Tomas Fitzgerald, Qasim Ayub, Andrea Massaia, Blair H Smith, Blair B Smith, Anna F Dominiczak, Anna A Dominiczak, Andrew D Morris, Andrew A Morris, David J Porteous, David D Porteous, Matthew E Hurles, Chris Tyler-Smith, Yali Xue.
Abstract
We have assessed copy number variation (CNV) in the male-specific part of the human Y chromosome discovered by array comparative genomic hybridization (array-CGH) in 411 apparently healthy UK males, and validated the findings using SNP genotype intensity data available for 149 of them. After manual curation taking account of the complex duplicated structure of Y-chromosomal sequences, we discovered 22 curated CNV events considered validated or likely, mean 0.93 (range 0-4) per individual. 16 of these were novel. Curated CNV events ranged in size from <1 kb to >3 Mb, and in frequency from 1/411 to 107/411. Of the 24 protein-coding genes or gene families tested, nine showed CNV. These included a large duplication encompassing the AMELY and TBL1Y genes that probably has no phenotypic effect, partial deletions of the TSPY cluster and AZFc region that may influence spermatogenesis, and other variants with unknown functional implications, including abundant variation in the number of RBMY genes and/or pseudogenes, and a novel complex duplication of two segments overlapping the AZFa region and including the 3' end of the UTY gene.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25957587 PMCID: PMC4460274 DOI: 10.1007/s00439-015-1562-5
Source DB: PubMed Journal: Hum Genet ISSN: 0340-6717 Impact factor: 4.132
Fig. 4Haplogroup distribution of curCNVEs present in more than one individual. Left phylogeny of the Y-chromosomal haplogroups detected in 149 samples; branch lengths are arbitrary. Blue circles haplogroup frequency in the entire 149 individuals, with circle area proportional to frequency. Remaining circles haplogroup frequencies in individual curCNVEs
Fig. 1Validated examples of CNVs based on both CNV probe and SNP intensity data. Top panel male-specific euchromatic region of the Y chromosome derived from the UCSC genome browser showing gaps in the reference sequence (black bars), chromosomal regions (yellow Y-specific unique, red X–Y transposed, blue Y-specific repeated, purple heterochromatic), segmental duplications and the CNVEs illustrated in the rest of the figure. Remaining panels paired probe intensity (blue) and SNP intensity (red) plots for seven curCNVEs. Each rawCNVE is indicated by buff shading, chromosomal coordinates in Mb are shown at the bottom and overlapping protein-coding genes within the plot regions are included below the SNP intensities
Fig. 2Likely examples of CNVs based on CNV probe intensity data. a Male-specific euchromatic region of the Y chromosome derived from the UCSC genome browser showing gaps in the reference sequence (black bars), chromosomal regions (yellow Y-specific unique, red X–Y transposed, blue Y-specific repeated, purple heterochromatic), segmental duplications and the CNVEs illustrated in the rest of the figure (green duplications, orange deletions, purple both). b curCNVE1. c curCNVEs 10 and 11. d and f curCNVE5 in two different individuals showing the coordinated decrease or increase of rawCNVEs 5.1, 5.2 and 5.3. e and g similar plots for curCNVE16 and its corresponding rawCNVEs. Each rawCNVE is indicated by buff shading, chromosomal coordinates in Mb are shown at the bottom and overlapping protein-coding genes are plotted at the bottom of b and c, f and g
Summary of 34 rawCNVEs and 22 curCNVEs called in this study
| curCNVE | rawCNVE | rawCNVE | curCNVE | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Start | End | Size (bp) | SNP support | Known | Duplications | Deletions | Total frequency | Protein-coding gene contenta | ||
| curCNVE1 | rawCNVE1 | 2,891,036 | 2,903,671 | 12,635 | 1 | 0.0024 | ||||
| curCNVE2 | rawCNVE2 | 6,138,072 | 9,161,980 | 3,023,908 | + | Murphy et al. ( | 1 | 0.0024 |
| |
| curCNVE3 | rawCNVE3 | 7,659,321 | 8,497,398 | 838,077 | 1 | 0.0024 | ||||
| curCNVE4 | rawCNVE4 | 9,170,730 | 9,175,364 | 4634 | 53 | 11 | 0.1557 |
| ||
| curCNVE5 | rawCNVE5.1 | 9,196,977 | 9,198,235 | 1258 | 8 | 68 | 0.1849 |
| ||
| curCNVE5 | rawCNVE5.2 | 9,311,600 | 9,311,765 | 165 | ||||||
| curCNVE5 | rawCNVE5.3 | 9,383,079 | 9,384,475 | 1396 | ||||||
| curCNVE6 | rawCNVE6 | 9,196,977 | 9,382,943 | 185,966 | Oakey and Tyler-Smith ( | 2 | 29 | 0.0754 |
| |
| curCNVE7 | rawCNVE7 | 9,785,127 | 9,792,677 | 7550 | + | 1 | 0.0024 | |||
| curCNVE8 | rawCNVE8.1 | 14,588,389 | 14,745,226 | 156,837 | + | 1 | 0.0024 | |||
| curCNVE8 | rawCNVE8.2 | 15,034,145 | 15,475,430 | 441,285 | + |
| ||||
| curCNVE9 | rawCNVE9 | 15,144,435 | 15,146,222 | 1787 | 1 | 0.0024 | ||||
| curCNVE10 | rawCNVE10 | 15,869,445 | 16,096,260 | 226,815 | 1 | 0.0024 | ||||
| curCNVE11 | rawCNVE11 | 16,170,165 | 16,233,113 | 62,948 | 1 | 0.0024 | ||||
| curCNVE12 | rawCNVE12 | 18,733,053 | 18,762,614 | 29,561 | + | 1 | 0.0024 | |||
| curCNVE13 | rawCNVE13 | 21,032,549 | 21,074,621 | 42,072 | + | 1 | 0.0024 | |||
| curCNVE14 | rawCNVE14.1 | 23,441,081 | 23,649,415 | 208,334 | + | 1 | 0.0024 | |||
| curCNVE14 | rawCNVE14.2 | 23,756,420 | 24,005,801 | 249,381 | + | |||||
| curCNVE15 | rawCNVE15 | 24,218,723 | 24,218,783 | 60 | 18 | 1 | 0.0462 |
| ||
| curCNVE16 | rawCNVE16.1 | 23,660,808 | 23,709,077 | 48,269 | 31 | 76 | 0.2603 |
| ||
| curCNVE16 | rawCNVE16.2 | 24,005,497 | 24,062,091 | 56,594 |
| |||||
| curCNVE16 | rawCNVE16.3 | 24,316,281 | 24,327,019 | 10,738 |
| |||||
| curCNVE16 | rawCNVE16.4 | 24,551,695 | 24,562,435 | 10,740 |
| |||||
| curCNVE17 | rawCNVE17.1 | 24,551,695 | 24,658,825 | 107,130 | 4 | 19 | 0.0560 |
| ||
| curCNVE17 | rawCNVE17.2 | 24,551,695 | 24,795,554 | 243,859 |
| |||||
| curCNVE18 | rawCNVE18 | 25,130,433 | 27,895,495 | 2,765,062 | Fernandes et al. ( | 8 | 17 | 0.0608 |
| |
| curCNVE19 | rawCNVE19 | 24,658,743 | 25,428,575 | 769,832 | Repping et al. ( | 2 | 9 | 0.0268 |
| |
| curCNVE20 | rawCNVE20.1 | 25,284,428 | 25,428,580 | 144,152 | Saxena et al. ( | 7 | 6 | 0.0316 |
| |
| curCNVE20 | rawCNVE20.2 | 26,950,819 | 27,177,168 | 226,349 |
| |||||
| curCNVE21 | rawCNVE21.1 | 25,829,578 | 26,194,226 | 364,648 | Machev et al. ( | 1 | 1 | 0.0049 |
| |
| curCNVE21 | rawCNVE21.2 | 27,768,203 | 27,768,295 | 92 |
| |||||
| curCNVE22 | rawCNVE22.1 | 28,472,070 | 28,654,473 | 182,403 | + | 1 | 0.0024 | |||
| curCNVE22 | rawCNVE22.2 | 28,688,829 | 28,704,081 | 15,252 | + | |||||
| curCNVE22 | rawCNVE22.3 | 28,723,589 | 28,804,541 | 80,952 | + | |||||
Genome coordinates are based on GRCh37/hg19. Gene names are from GENCODE v20
aGenes showing CNV. For genes that are members of families, the copy that is actually duplicated or deleted is unknown because of shadowing effects
Fig. 3Size and frequency distribution of validated and likely CNVs in 238 individuals. a rawCNVE size distribution. b curCNVE frequency distribution; each frequency bar is coloured according to the proportion of duplications (green) and deletions (orange) among the total 381 curCNVE calls
Fig. 5Novel partial duplication of UTY. curCNVE8 showing the relationship with the AZFa deletion and the protein-coding genes in the region