| Literature DB >> 26194999 |
Boon-Peng Hoh1,2,3, Lian Deng4, Mat Jusoh Julia-Ashazila5, Zakaria Zuraihan5, Ma'amor Nur-Hasnah5, Ab Rajab Nur-Shafawati6, Wan Isa Hatin6, Ismail Endom7, Bin Alwi Zilfalil8, Yusoff Khalid5,9, Shuhua Xu10,11,12.
Abstract
Fine scale population structure of Malays - the major population in Malaysia, has not been well studied. This may have important implications for both evolutionary and medical studies. Here, we investigated the population sub-structure of Malay involving 431 samples collected from all states from peninsular Malaysia and Singapore. We identified two major clusters of individuals corresponding to the north and south peninsular Malaysia. On an even finer scale, the genetic coordinates of the geographical Malay populations are in correlation with the latitudes (R(2) = 0.3925; P = 0.029). This finding is further supported by the pairwise FST of Malay sub-populations, of which the north and south regions showed the highest differentiation (FST [North-south] = 0.0011). The collective findings therefore suggest that population sub-structure of Malays are more heterogenous than previously expected even within a small geographical region, possibly due to factors like different genetic origins, geographical isolation, could result in spurious association as demonstrated in our analysis. We suggest that cautions should be taken during the stage of study design or interpreting the association signals in disease mapping studies which are expected to be conducted in Malay population in the near future.Entities:
Mesh:
Year: 2015 PMID: 26194999 PMCID: PMC4509480 DOI: 10.1186/s40246-015-0039-x
Source DB: PubMed Journal: Hum Genomics ISSN: 1473-9542 Impact factor: 4.639
Fig. 1Principle Component Analysis (PCA) (a) Global PCA including populations from HapMap3. GIH, Gujarati India Houston; CEU, Northern and Western European from CEPH collection; YRI, Yoruba Ibadan from Nigeria; CHB, Chinese Beijing; JPT, Japanese Tokyo; MEX, Mexican ancestry from Los Angeles; MAS, Metropolitan Malays from Singapore; PMM, Malays from Peninsular Malay. The Malay populations are of East Asian descendant. (b) PCA plot including samples categorized into North vs Centre vs South; (c) PCA plot which included only North vs South. Symbols in red represent the northern region; symbols in blue represent southern region. Several outliers were excluded from the PCA plot
Fig. 2ADMXITURE analysis of the Malay populations classified according to regions. The bottom plots represented by percentages (Y-axis) indicates the average ADMIXTURE values for each region
Fig. 3Average PC1 values of the Malay sub-populations from Peninsular Malaysia and Singapore. Standard error of each population is indicated. The PC1 values correlated well to the geographical locations of each population except for Johor
Fig. 4Correlation between PC1 and latitude coordinate (P = 0.029)
Pairwise FST bootstrap values of the Malay between the 3 regions of Peninsular Malaysia
| North | Centre | South | |
|---|---|---|---|
| North | - | 0.00083315 (CI =2.0684E-04) | 0.00111661 (CI = 2.68108E-06) |
| Centre | - | 0.00058556 (CI = 4.0972E-06) | |
| South | - |
Pairwise FST values calculated by bootstrap resampling 1,000 replications
Top 0.1 % SNPs that are highly differentiated between the Malays from northern and southern region of Peninsular (total SNP = 42633)
| rsID | Chr | Position | Minor allele | FST | MAF_North | MAF_South | Gene | Category |
|---|---|---|---|---|---|---|---|---|
| rs4149264 | 9 | 107,677,211 | C | 0.2256 | 0.4856 | 0.1682 |
| intronic |
| rs10102377 | 8 | 83,762,822 | T | 0.2251 | 0.4097 | 0.2336 | ||
| rs4148475 | 13 | 95,853,574 | A | 0.2242 | 0.4676 | 0.184 |
| intronic |
| rs1056836 | 2 | 38,298,203 | G | 0.2037 | 0.4757 | 0.1934 |
| coding |
| rs1126965 | 17 | 70,642,790 | G | 0.1931 | 0.5 | 0.1822 |
| 3utr |
| rs17769090 | 15 | 70,630,120 | A | 0.1636 | 0.4648 | 0.2381 | ||
| rs6974363 | 7 | 47,633,187 | G | 0.1421 | 0.493 | 0.2333 | ||
| rs837395 | 1 | 47,269,338 | A | 0.1400 | 0.4897 | 0.2383 |
| intronic |
| rs4646430 | 2 | 38,306,415 | G | 0.1384 | 0.4621 | 0.1981 | ||
| rs215101 | 16 | 16,052,973 | G | 0.1206 | 0.4752 | 0.271 |
| intronic |
| rs12920607 | 16 | 73,728,620 | C | 0.1183 | 0.475 | 0.2736 | ||
| rs837398 | 1 | 47,266,422 | A | 0.1124 | 0.4897 | 0.2664 |
| intronic |
| rs809367 | 10 | 89,741,806 | A | 0.1088 | 0.4307 | 0.2009 | ||
| rs316133 | 6 | 52,847,551 | C | 0.0957 | 0.4823 | 0.2594 |
| intronic |
| rs6130511 | 20 | 42,681,088 | A | 0.0916 | 0.2801 | 0.1 |
| intronic |
| rs2132845 | 4 | 140,587,125 | T | 0.0910 | 0.4255 | 0.215 |
| 5utr |
| rs5761313 | 22 | 26,313,745 | T | 0.0887 | 0.4964 | 0.2804 |
| intronic |
| rs10485805 | 20 | 54,945,783 | G | 0.0853 | 0.4397 | 0.2336 |
| intronic |
| rs10489142 | 1 | 7,363,310 | G | 0.0835 | 0.4507 | 0.3364 |
| intronic |
| rs2274928 | 13 | 24,044,546 | A | 0.0779 | 0.3601 | 0.4346 |
| intronic |
| rs11935505 | 4 | 145,226,422 | A | 0.0758 | 0.04643 | 0.1682 | ||
| rs1566869 | 12 | 52,266,348 | A | 0.0695 | 0.3406 | 0.1682 | ||
| rs1884897 | 20 | 6,612,832 | G | 0.0695 | 0.2812 | 0.1215 | ||
| rs4530975 | 7 | 104,415,415 | T | 0.0679 | 0.1862 | 0.05607 |
| intronic |
| rs6024831 | 20 | 54,938,464 | G | 0.0675 | 0.4161 | 0.3915 |
| intronic |
| rs1160798 | 6 | 112,438,446 | C | 0.0661 | 0.06897 | 0.1934 |
| intronic |
| rs2158196 | 4 | 114,416,596 | C | 0.0658 | 0.2391 | 0.09434 |
| intronic |
| rs16961766 | 13 | 103,899,499 | A | 0.0645 | 0.3143 | 0.1524 | ||
| rs10962015 | 9 | 15,387,949 | A | 0.0638 | 0.2482 | 0.1028 | ||
| rs6884962 | 5 | 172,682,382 | A | 0.0623 | 0.4896 | 0.3271 | ||
| rs17126776 | 12 | 39,311,625 | A | 0.0603 | 0.1759 | 0.3318 | ||
| rs2755209 | 13 | 41,137,804 | C | 0.0602 | 0.25 | 0.1075 |
| intronic |
| rs9783586 | 13 | 108,361,559 | T | 0.0601 | 0.2671 | 0.4393 |
| intronic |
| rs10968093 | 9 | 27,753,227 | A | 0.0594 | 0.06338 | 0.1776 | ||
| rs2458286 | 8 | 103,978,699 | T | 0.0591 | 0.4424 | 0.3774 | ||
| rs4875364 | 8 | 4,444,592 | C | 0.0583 | 0.3094 | 0.1557 |
| intronic |
| rs11145506 | 9 | 80,264,584 | T | 0.0578 | 0.3821 | 0.217 |
| intronic |
| rs11604366 | 11 | 28,887,766 | C | 0.0577 | 0.2695 | 0.4387 | ||
| rs9881633 | 3 | 112,881,539 | T | 0.0574 | 0.3 | 0.472 |
| intronic |
| rs5762448 | 22 | 28,408,444 | C | 0.0573 | 0.3514 | 0.1916 |
| intronic |
| rs6467991 | 7 | 83,954,737 | C | 0.0570 | 0.344 | 0.4811 | ||
| rs10089677 | 8 | 122,660,248 | A | 0.0569 | 0.2329 | 0.09813 | ||
| rs1923254 | 13 | 41,084,241 | G | 0.0567 | 0.3776 | 0.2143 | ||
| rs7813806 | 8 | 5,142,665 | C | 0.0567 | 0.2817 | 0.1355 | ||
| rs7625411 | 3 | 112,811,428 | A | 0.0564 | 0.3403 | 0.486 | ||
| rs2922249 | 6 | 127,954,614 | C | 0.0564 | 0.09441 | 0.2196 | ||
| rs2294088 | 8 | 124,526,607 | A | 0.0564 | 0.4514 | 0.2804 |
| intronic |
| rs12289262 | 11 | 12,894,758 | T | 0.0560 | 0.3986 | 0.2336 |
| intronic |
| rs4937523 | 11 | 130,347,190 | T | 0.0558 | 0.2937 | 0.4626 |
| intronic |
| rs10807768 | 7 | 13,662,014 | A | 0.0554 | 0.3169 | 0.1651 | ||
| rs976272 | 14 | 61,449,328 | A | 0.0551 | 0.4897 | 0.3178 |
| coding |
| rs13027801 | 2 | 143,602,503 | C | 0.0551 | 0.2862 | 0.4533 | ||
| rs17701834 | 19 | 22,121,458 | G | 0.0549 | 0.2172 | 0.08879 | ||
| rs7193843 | 16 | 54,677,292 | G | 0.0548 | 0.1448 | 0.285 | ||
| rs7097885 | 10 | 16,506,501 | C | 0.0542 | 0.2832 | 0.4486 |
| intronic |
| rs2791398 | 1 | 245,965,551 | G | 0.0540 | 0.05944 | 0.1651 |
| intronic |
| rs10486802 | 7 | 39,723,768 | A | 0.0531 | 0.1884 | 0.07009 |
| intronic |
| rs8031676 | 15 | 96,910,440 | C | 0.0529 | 0.4306 | 0.2664 | ||
| rs7186479 | 16 | 82,602,736 | C | 0.0527 | 0.2517 | 0.1168 | ||
| rs6054383 | 20 | 6,584,604 | T | 0.0526 | 0.3986 | 0.2383 | ||
| rs4460308 | 7 | 104,420,060 | C | 0.0521 | 0.1866 | 0.07009 |
| intronic |
| rs3775779 | 4 | 70,709,207 | A | 0.0520 | 0.476 | 0.3551 |
| intronic |
| rs9375877 | 6 | 132,690,239 | G | 0.0517 | 0.4862 | 0.3458 |
| intronic |
| rs2180691 | 20 | 54,964,361 | A | 0.0517 | 0.45 | 0.2857 |
| intronic |
| rs7778955 | 7 | 39,740,487 | G | 0.0517 | 0.1438 | 0.04206 |
| intronic |
| rs4608114 | 12 | 92,384,658 | A | 0.0517 | 0.4366 | 0.2736 |
| intronic |
| rs6946733 | 7 | 106,670,288 | A | 0.0514 | 0.4281 | 0.2664 | ||
| rs816650 | 10 | 601,089 | T | 0.0514 | 0.114 | 0.2406 |
| intronic |
| rs6490805 | 13 | 24,084,809 | C | 0.0507 | 0.08394 | 0.1981 | ||
| rs17171480 | 7 | 35,585,669 | A | 0.0506 | 0.09286 | 0.2103 | ||
| rs17015112 | 3 | 77,319,487 | G | 0.0505 | 0.4545 | 0.3785 |
| intronic |
| rs573186 | 3 | 124,178,276 | C | 0.0503 | 0.2483 | 0.1168 |
| intronic |
| rs10752609 | 1 | 154,791,128 | A | 0.0501 | 0.3147 | 0.1698 |
| intronic |
| rs1862737 | 16 | 75,281,964 | C | 0.0500 | 0.4155 | 0.257 |
| 5utr |
Fig. 5The geographical map of Peninsular Malaysia. The sampling locations are shown in red dots
Regional categorization of the Peninsular Malaysia states according to geographical locations and final number of sample included after QC
| Region | States | Latitude coordinate* | No. subjects |
|---|---|---|---|
| North | Perlis | 6°23′40.06″N (6.394462) | 7 |
| Kedah | 5°19′45.2″N (5.329221) | ||
| Pulau Pinang | 5°19′45.2″N (5.329221 | ||
| Perak | 4°37′8.46″N (4.619018) | 5 | |
| Kelantan (Kota Bharu) | 5°42′55.13″N (5.715314) | 56 | |
| Kelantan (Jeli) | 5°29′49.4″N (5.497056) | 74 | |
| Terengganu | 5°19′30.33″N (5.325092) | 4 | |
| Centre | Pahang (Pekan) | 3°29′32″N (3.492092) | 51 |
| Selangor | 2°50′34.7″N (2.842971) | 98 | |
| South | Negeri Sembilan | 2°33′12.75″N (2.553541) | 9 |
| Melaka | 2°12′11.81″N (2.203281) | 5 | |
| Johor | 1°27′41.98″N (1.461662) | 4 | |
| Singapore (Malay) | 1°17′13.35″N (1.287043) | 89 |
*Latitude coordinate from Yandex (http://map.yandex.com)