| Literature DB >> 22844521 |
Yong-Biao Zhang1, Xin Li, Feng Zhang, Duen-Mei Wang, Jun Yu.
Abstract
Genetic features of Tibetans have been broadly investigated, but the properties of copy number variation (CNV) have not been well examined. To get a preliminary view of CNV in Tibetans, we scanned 29 Tibetan genomes with the Illumina Human-1 M high-resolution genotyping microarray and identified 139 putative copy number variable regions (CNVRs), consisting of 70 deletions, 61 duplications, and 8 multi-allelic loci. Thirty-four of the 139 CNVRs showed differential allele frequencies versus other East-Asian populations, with P values <0.0001. These results indicated a distinct pattern of CNVR allele frequency distribution in Tibetans. The Tibetan CNVRs are enriched for genes in the disease class of human reproduction (such as genes from the DAZ, BPY2, CDY, and HLA-DQ and -DR gene clusters) and biological process categories of "response to DNA damage stimulus" and "DNA repair" (such as RAD51, RAD52, and MRE11A). These genes are related to the adaptive traits of high infant birth weight and darker skin tone of Tibetans, and may be attributed to recent local adaptation. Our results provide a different view of genetic diversity in Tibetans and new insights into their high-altitude adaptation.Entities:
Mesh:
Year: 2012 PMID: 22844521 PMCID: PMC3402393 DOI: 10.1371/journal.pone.0041768
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Structural variation map of Tibetan samples.
The schematic summarizes the distribution of duplications, deletions and multi-allelic loci on each human chromosome.
Characteristic summary of CNVRs found in Tibetan and other populations.
| Population | Tibetan | Han Taiwanese | Han Chinese | Han Taiwanese | Korean | CHB+JPT | CEU | YRI |
| Sample size | 29 | 300 | 692 | 813 | 3578 | 90 | 90 | 90 |
| Type of chips | Illu. 1 M | Affy. 100 k | Affy. 500 k | Illu. 550 k | Affy. 5.0 | Affy. 6.0 | Affy. 6.0 | Affy. 6.0 |
| Number of CNVRs | 139 | 230 | 725 | 365 | 4003 | 646 | 657 | 916 |
| Average size of CNV (kb) | 313 | 186 | 299 | 107 | 18.9 | ND | ND | ND |
| Average number of CNVsper individual genome | 19.38 | 5.02 | 10 | 4.6 | 40.3 | ND | ND | ND |
| Average CNV coverage per genome (Mb) | 6.07 | 0.93 | 3.56 | 0.497 | ND | ND | ND | ND |
| Average size of CNVR (kb) | 370.6 | 322 | 356 | 235.1 | 60.4 | 35.5 | 35.12 | 25.5 |
| Number of overlappedCNVRs with Tibetan | 139 | 9 | 25 | 11 | 33 | 45 | 47 | 53 |
| Number of reportedCNVRs in DGV | 121 | 133 | 529 | 298 | 1926 | 646 | 657 | 916 |
Abbreviation: Illu., Illumina; Affy., Affymetrix; ND, No data; kb, kilo base; Mb, mega base; CNV, copy number variation; CNVR, copy number variation region; CHB, Han Chinese; JPT, Japanese; CEU, descendant of European; YRI, Yoruban.
CNVRs with Entrez gene and significant difference in allele frequency.
| Chr | Start-Stop (kb) | Size (bp) | Type | Allele frequency | Modified | Entrez gene list | |||
| Tibetan | EA | CEU | YRI | ||||||
| 4 | 128908–129039 | 130875 | deletion | 10.34 | NR | NR | NR | / |
|
| 4 | 69047–69210 | 162544 | deletion | 75.86 | 32.85 | 55.56 | 38.89 | 3×10−6 |
|
| 5 | 68845–70959 | 2113387 | deletion | 13.79 | NR | NR | NR | / |
|
| 6 | 32561–32767 | 205223 | deletion | 6.90 | 67.18 | 92.22 | 98.89 | 3×10−11 |
|
| 7 | 101901–102109 | 208600 | deletion | 20.69 | NR | NR | NR | / |
|
| 11 | 48346–49011 | 664649 | duplication | 41.38 | 2.17 | 5.56 | 0.00 | 2×10−26 |
|
| 15 | 18811–20093 | 1281179 | multi-allele | 27.59 | 71.36 | 71.11 | 52.22 | 1×10−06 |
|
| 17 | 18855–19017 | 162199 | deletion | 31.03 | NR | NR | NR | / |
|
| 22 | 21199–21591 | 391792 | deletion | 17.24 | NR | NR | NR | / |
|
| X | 61647–64531 | 2884335 | duplication | 44.83 | 4.09 | 55.56 | 58.89 | 2×10−24 |
|
| X | 47756–47870 | 113782 | deletion | 6.90 | 50.00 | 51.11 | 58.89 | 9×10−05 |
|
| Y | 23611–24132 | 520913 | duplication | 20.69 | 66.67 | 53.33 | 43.33 | 4×10−05 |
|
| Y | 24259–27226 | 2967515 | duplication | 17.24 | 67.78 | 53.33 | 43.33 | 5×10−06 |
|
| Y | 18179–19061 | 881529 | duplication | 34.48 | NR | NR | NR | / |
|
Abbreviation: NR, not reported; kb, kilo base; bp, base pair; EA, East-Asian.
the allele frequency was calculated with populations from East-Asian in Table 1 except Han Taiwanese [25] for data unavailable.
P value was estimate from Tibetan and East Asian.
Genes in Tibetan CNVRs that are associated with disease risk.
| CLASS | Genetic association diseases | Gene count |
| Gene list |
| reproduction | infertility, male | 10 | 0.0188 |
|
| pregnancy loss, recurrent | ||||
| birth weight diabetes | ||||
| type 1 head circumference at birth | ||||
| birth weight body mass | ||||
| birth weight gestational infections | ||||
| cancer | breast cancer | 13 | 0.0238 |
|
| ovarian cancer | ||||
| prostate cancer | ||||
| infection | HIV | 9 | 0.0439 |
|
| arthritis, psoriatic | ||||
| lupus erythematosus | ||||
| rheumatoid arthritis | ||||
| sclerosis, systemic |
GO term enrichment of genes within Tibetan CNVRs.
| GO term | GO category | Count |
| Genes |
| GO:0007600 | sensory perception | 20 | 1.12E-04 |
|
| GO:0042742 | defense response to bacterium | 11 | 5.22E-04 |
|
| GO:0006974 | response to DNA damage stimulus | 11 | 0.002 |
|
| GO:0007565 | female pregnancy | 7 | 0.003 |
|
| GO:0006281 | DNA repair | 9 | 0.004 |
|
| GO:0019882 | antigen processing and presentation | 5 | 0.007 |
|
| GO:0009617 | response to bacterium | 11 | 0.008 |
|
| GO:0033554 | cellular response to stress | 12 | 0.013 |
|
| GO:0007131 | reciprocal meiotic recombination | 3 | 0.020 |
|
| GO:0006312 | mitotic recombination | 2 | 0.044 |
|