| Literature DB >> 22413087 |
Hui-Qi Qu, Quan Li, Shuhua Xu, Joseph B McCormick, Susan P Fisher-Hoch, Momiao Xiong, Ji Qian, Li Jin.
Abstract
The population of Han Chinese is ∼1.226 billion people. Genetic heterogeneity between northern Han Chinese (N-Han) and southern Han Chinese (S-Han) has been demonstrated by recent genome-wide studies. As an initial step toward health disparities and personalized medicine in Chinese population, this study developed a set of ancestry informative markers (AIM) for Han Chinese population.Entities:
Keywords: Han Chinese; ancestry informative marker; genetic association study; population structure
Year: 2012 PMID: 22413087 PMCID: PMC3291503 DOI: 10.1534/g3.112.001941
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Geographic distribution of the 236 Han Chinese individuals
| Geographic Location | Number of Individuals | Historic Classification |
|---|---|---|
| Beijing | 22 | Northern Han |
| Gansu | 13 | Northern Han |
| Non-specific northern Han | 9 | Northern Han |
| Hebei | 39 | Northern Han |
| Heilongjiang | 7 | Northern Han |
| Henan | 10 | Northern Han |
| Jilin | 3 | Northern Han |
| Liaoning | 4 | Northern Han |
| Neimeng | 7 | Northern Han |
| Ningxia | 2 | Northern Han |
| Shandong | 26 | Northern Han |
| Shannxi | 3 | Northern Han |
| Shanxi | 10 | Northern Han |
| Tianjin | 1 | Northern Han |
| Xinjiang | 6 | Northern Han |
| Anhui | 1 | Southern Han |
| Guangdong | 24 | Southern Han |
| Guangxi | 1 | Southern Han |
| Hubei | 2 | Southern Han |
| Hunan | 1 | Southern Han |
| Jiangsu | 11 | Southern Han |
| Jiangxi | 2 | Southern Han |
| Shanghai | 14 | Southern Han |
| Sichuan | 3 | Southern Han |
| Yunnan | 2 | Southern Han |
| Zhejiang | 13 | Southern Han |
Figure 1 Maximum Matthews correlation coefficient (MCC) of principal component analysis (PCA) clustering using different number of ancestry informative markers (AIM). The clustering performance is compromised obviously when the number of AIMs decreases to 30. Horizontal axis: the number of SNPs with robust I. Vertical axis: maximum MCC of each set of AIMs.
Classification performance of different number of AIMs
| Number of AIMs | PC1 Cutoff | MCC | Specificity | Sensitivity |
|---|---|---|---|---|
| 15 | −0.04 | 0.810 | 0.966 | 0.901 |
| 20 | −0.03 | 0.871 | 0.930 | 0.952 |
| 25 | −0.03 | 0.901 | 0.971 | 0.952 |
| 30 | −0.02 | 0.902 | 0.932 | 0.969 |
| 35 | −0.03 | 0.951 | 0.986 | 0.976 |
| 40 | −0.03 | 0.961 | 0.986 | 0.982 |
| 45 | −0.04 | 0.951 | 1.000 | 0.970 |
| 50 | −0.03 | 0.961 | 1.000 | 0.976 |
| 60 | −0.03 | 0.971 | 1.000 | 0.982 |
| 70 | −0.03 | 0.990 | 1.000 | 0.994 |
| 80 | −0.03 | 0.990 | 1.000 | 0.994 |
| 90 | −0.02 | 0.990 | 0.987 | 1.000 |
| 100 | −0.03 | 1.000 | 1.000 | 1.000 |
| 110 | −0.04 | 0.990 | 1.000 | 0.994 |
| 120 | −0.04 | 0.990 | 1.000 | 0.994 |
| 130 | −0.03 | 1.000 | 1.000 | 1.000 |
| 140 | −0.04 | 1.000 | 1.000 | 1.000 |
| 150 | −0.03 | 1.000 | 1.000 | 1.000 |
AIM, ancestry informative marker; MCC, Matthews correlation coefficient; PC1, first principal component.