| Literature DB >> 25811879 |
Emile R Chimusa1, Ayton Meintjies2, Milaine Tchanga2, Nicola Mulder2, Cathal Seoighe, Cathal Seioghe3, Himla Soodyall4, Rajkumar Ramesar5.
Abstract
We report a study of genome-wide, dense SNP (∼ 900K) and copy number polymorphism data of indigenous southern Africans. We demonstrate the genetic contribution to southern and eastern African populations, which involved admixture between indigenous San, Niger-Congo-speaking and populations of Eurasian ancestry. This finding illustrates the need to account for stratification in genome-wide association studies, and that admixture mapping would likely be a successful approach in these populations. We developed a strategy to detect the signature of selection prior to and following putative admixture events. Several genomic regions show an unusual excess of Niger-Kordofanian, and unusual deficiency of both San and Eurasian ancestry, which were considered the footprints of selection after population admixture. Several SNPs with strong allele frequency differences were observed predominantly between the admixed indigenous southern African populations, and their ancestral Eurasian populations. Interestingly, many candidate genes, which were identified within the genomic regions showing signals for selection, were associated with southern African-specific high-risk, mostly communicable diseases, such as malaria, influenza, tuberculosis, and human immunodeficiency virus/AIDs. This observation suggests a potentially important role that these genes might have played in adapting to the environment. Additionally, our analyses of haplotype structure, linkage disequilibrium, recombination, copy number variation and genome-wide admixture highlight, and support the unique position of San relative to both African and non-African populations. This study contributes to a better understanding of population ancestry and selection in south-eastern African populations; and the data and results obtained will support research into the genetic contributions to infectious as well as non-communicable diseases in the region.Entities:
Mesh:
Year: 2015 PMID: 25811879 PMCID: PMC4374865 DOI: 10.1371/journal.pgen.1005052
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Fig 1(A) Admixture analysis in southern African and other sub-Saharan African populations compared with Europeans and Asians.
From Fig. 1A, ‡Khomani (KHO), Xhosa (XHS) and Sandawe (SAW) are 3-way, 2-way and 3-way admixed, respectively consistent with previous results [16, 33, 36]. (B-D) Admixture analyses using best proxy ancestral populations of each admixed southern African population. (B) For ‡Khomani (KHO) (C) for Sandawe (SAW) and (D) for Xhosa (XHS).
Fig 2Relationship between genetic distances from southern African populations and their corresponding geographic distances.
We identify 10 outlier points (points situated at 0.05 units from the regression line), suggesting possible obstacles to migration (see S4 Table).
Fig 3(A) Mean block sizes per chromosome and analysis panel.
Error bars are twice the standard error of the mean. (B) Decay of linkage disequilibrium with physical distance along chromosome 1 for each analysis panel.
Fig 4Proportion of shared recombination hotpots between the southern African and HapMap populations (CEU, YRI).
List of SNPs whose unusual differentiation between pair-wise indigenous southern African populations are genome-wide significant.
| CHR | SNP | A1/A2 | P values | Gene | Pathway | Associated Disease | ||||
|---|---|---|---|---|---|---|---|---|---|---|
|
| ||||||||||
| 1q41 | rs11118642 | C/A | 1.1e-13 | HLX | Metabolic | Fryns syndrome, Hernia, acute myeloid leukemia | ||||
|
| ||||||||||
| 17q25.1 | rs2279053 | C/A | 5.5e-10 | QRICH2 | Metabolic | Drug metabolism other enzymes | ||||
| 12q24.21 | rs4767374 | C/A | 1.3e-09 | MED13L | Obesity/Transposition-of-Great-Arteries | Drug metabolism other enzymes | ||||
|
| ||||||||||
| 12p11 | rs3816834 | C/A | 2.9e-08 | ITPR2 | Long term depression | Amyotrophic lateral Sclerosis, arrhythmia, rheumatism, Alzheimer's disease, hypertension, liver cancer, hepatitis b and pancreatitis | ||||
| 11q22.1 | rs1943760 | C/A | 4.0e-09 | PGR | Immune response MIF-JAB1 signalling, Oocyte meiosis | Thyroiditis, breast carcinoma, tumors, carcinoma ductal and Breast cancer | ||||
| 21q22.3 | rs2839439 | C/A | 9.6e-09 | C21orf121 | Metabolic | Choroiditis and Down Syndrome | ||||
| 2p21 | rs4588165 | A/C | 1.5e-09 | CRIM1 | MAPK signalling pathway | Neuronitis, ataxia and macular degeneration | ||||
| 6q22.2 | rs2049923 | C/A | 9.3e-10 | MARCKS | Fc gamma R-mediated phagocytosis | Hepatitis, malignant syringoma, bipolar disorder, brain disease, Alzheimer's disease, asthma, and colorectal cancer | ||||
| 7q21 | rs4730838 | C/A | 2.5e-10 | MAGI2 | Tight junction | Ulcerative and Colitis-and-Crohn's-Disease | ||||
| 6q25.1 | rs9384458 | A/C | 2.9e-08 | ARID1B | Drug metabolism and other enzymes | Coffin-Siris and Syndrome | ||||
| 12p13.3 | rs11062658 | C/A | 1.6e-09 | PRMT8 | Drug metabolism other enzymes | Malaria, peripheral primitive neuroectodermal tumor, primitive neuroectodermal tumor, and neuroectodermal tumors | ||||
| 6q23.2 | rs9478984 | A/C | 3.5e-08 | RPS12 | Ribosome | Malaria, Carcinoma and Tuberculosis | ||||
| 3q21.2 | rs1373606 | A/C | 2.9e-08 | KALRN | Drug metabolism and other enzymes | Neuronitis Human Immunodeficiency Virus Infectious disease | ||||
| 4q34.3 | rs1567475 | C/A | 1.3e-08 | AGA | Glycan degradation | Influenza and Aspartylglucosaminuria | ||||
| 2q21.2 | rs1561019 | C/A | 9.6e-09 | 0.23 | 0.04 | LRP1B | Metabolic | Cholesterol Thyroiditis | ||
| 9q31.2 | rs7039618 | C/A | 9.6e-09 | 0.23 | 0.04 | TMEM38B | Metabolic | Cleft Lip | ||
| 7q31.1 | rs2037048 | C/A | 2.9e-08 | 0.23 | 0.04 | C7orf66 | - | - | ||
| 14q21 | rs2054492 | C/A | 5.2e-08 | 0.23 | 0.04 | PELI2 | Metabolic | Ataxia | ||
| 5q11.2 | rs1075420 | C/A | 9.6e-09 | 0.23 | 0.04 | MAP3K1 | GnRH signaling | Breast Cancer | ||
| 12q24.32 | rs10773557 | A/C | 9.6e-09 | 0.23 | 0.04 | TMEM132C | - | - | ||
|
| ||||||||||
| 3p11 | rs4858960 | A/C | 7.6e-09 | 0.25 | 0.04 | POU1F1 | Metabolic | Combined Pituitary Hormone deficiency, growth hormone deficiency | ||
|
| ||||||||||
| 14q21.1 | rs10148725 | C/A | 1.8e-08 | 0.24 | 0.04 | FBXO33 | Metabolic | Osteoporosis | ||
| 3p11 | rs4858960 | A/C | 5.3e-10 | 0.24 | 0.04 | POU1F1 | Metabolic | Combined Pituitary Hormone deficiency, growth hormone deficiency | ||
| 1q25 | rs234654 | C/A | 1.8e-08 | 0.24 | 0.04 | FAM129A | Metabolic | Carcinoma | ||
| 3q28 | rs260559 | A/C | 1.8e-08 | 0.24 | 0.04 | TPRG1 | Metabolic | Parkinson's Disease | ||
| 14q13.2 | rs10132268 | C/A | 1.8e-08 | 0.24 | 0.04 | INSM2 | Metabolic | Insulinoma | ||
|
| ||||||||||
| 3p11 | rs4858960 | A/C | 9.5e-10 | 0.25 | 0.04 | POU1F1 | Metabolic | Combined Pituitary Hormone deficiency, growth hormone deficiency | ||
| 3q13.32 | rs1521293 | C/A | 3.2e-08 | 0.25 | 0.05 | IGSF11 | Metabolic | Carcinoma | ||
We obtained the associated disease genes using the MalaCards Database, an integrate compendium for diseases and their annotations [44, 53].
Four regions showing excess of YRI ancestry and three regions of reduced CEU and KHS ancestry in ‡Khomani.
|
|
|
|
|
|
|
|
|
|
| 12q24.1 | 4 | 112,842,994–112,856,642 | 13,649 | YRI | 4.2e-9 | RPL6,RPL11,RPS3,RPS15A,RPL4,RPL7 | Ribosome | Malaria, Noonan syndrome, t-cell leukemia, colorectal cancer, gastric cancer,carcinoma, lupus erythematosus |
| 13q14.3 | 2 | 58,513,521–58,515,045 | 1,525 | YRI | 3.1e-08 | IHNRNPA1,HNRNPA1L2, | Spliceosome | Malaria |
| 18p11.23 | 12 | 6,729,821–6,915,715 | 185,895 | YRI | 2.2e-08 | LAMA1,ARHGAP28,ITGA1,ITGA2,C3,LAMB1,PLAT | Signalling by Rho GTPases,Signal Transduction,Pathways in cancer,Focal adhesion | Meningioma, Lung cancer, Congenital muscular dystrophy, Neuromuscula, Alzheimer's and Hirschsprung's diseases |
| 18p11.31 | 18 | 5,954,705–6,414,910 | 460,206 | YRI | 2.9e-08 | L3MBTL4,LOC100130480 | - | Parkinson's disease, Breast cancer |
|
|
|
|
|
|
|
|
|
|
| 12p13.31 | 1 | 7,864,050–7,870,155 | 6,106 | KHS | 3.7e-08 | DPPA3,IPO5 | - | Seminoma, Testicular germ cell tumor, Teratocarcinoma, Germ cell tumor,Carcinoma |
| 14q13.2 | 14 | 36,007,558–36,278,510 | 270,953 | KHS,CEU | 3.4.e-09 | GARNL1,BRMS1L,RALGAPA1,NFKBIA,INSM2,NFKB1,PTCSC3,KIAA0391,RELA,CHUK,IKBKB,IKBKG,TCF3,MYC,ZSCAN1,SAP30,BRMS1,RBBP4,RBBP7,ING2 | Toll-Like Receptors Pathway | Tuberous sclerosis, Prader-Willi syndrome, Breast cancer, Lung cancer, Tumors inflammation, Leukemia T-cell, Diabetes mellitus, Parkinson's disease |
| Molecular Mechanisms of Cancer | ||||||||
| NF-kappa B Activation by Viruses, Itk and Tcr Signalling | ||||||||
| 14q13.3 | 7 | 36,985,602–36,990,354 | 4,753 | KHS,CEU | 1.1e-08 | NKX2–1,NKX2–8,PTCSC3,SFTA3,CCDC59,NCK1,MAPK1,NCOA2,RARA | Cell adhesion Tight junctions | Chorea benign Hereditary, Hepatocellular carcinoma, Lung cancer, Adenocarcinoma lung |
We obtained the associated disease genes using the MalaCards database [44, 53] (Materials and Methods).
Three regions showing excess of YRI ancestry and other three showing deficiency of CEU and KHS ancestry in Sandawe.
|
|
|
|
|
|
|
|
|
|
| 12q24.1 | 4 | 112,842,994–112,856,642 | 13,649 | YRI | 1.4e-10 | RPL6,RPL11,RPS3,RPS15A,RPL4,RPL7 | Ribosome | Malaria, Noonan syndrome, t-cell leukemia, colorectal cancer, gastric cancer, carcinoma, lupus erythematosus |
| 18p11.31 | 30 | 5,954,705–6,414,910 | 460,206 | YRI | 1.9e-13 | LOC645355,L3MBTL4,MIR3976,ARHGAP28,LOC100130480 | Signalling by Rho GTPases, Signal Transduction | Benign meningioma, Meningioma, Parkinson's disease, Hamartoma, Retinitis, Acute myeloid leukemia |
| 18p11.23 | 13 | 6,941,743–7,117,813 | 176,071 | YRI | 2.0e-12 | LAMA1 | Cell adhesion Endothelial cell contacts by non-junctional mechanisms and Cytoskeleton remodelling Integrin outside-in signalling | Muscular dystrophy, Myopia, Choriocarcinoma, Congenital muscular dystrophy, Alport syndrome, Hirschsprung and Alzheimer's disease |
|
|
|
|
|
|
|
|
|
|
| 12p13.31 | 1 | 7,864,050–7,870,155 | 6,106 | KHS,CEU | 3.3e-08 | DPPA3, IPO5 | - | Seminoma, Testicular germ cell tumor, Teratocarcinoma, Germ cell tumor, Carcinoma |
| 14q13.2 | 11 | 36,007,558–36,278,510 | 270,953 | KHS,CEU | 1.4.e-10 | GARNL1,BRMS1L,RALGAPA1,NFKBIA,INSM2,NFKB1,PTCSC3,KIAA0391,RELA,CHUK,IKBKB,IKBKG,TCF3,MYC,ZSCAN1,SAP30,BRMS1,RBBP4,RBBP7,ING2 | Toll-Like Receptors Pathway | Tuberous sclerosis, Prader-Willi syndrome, Breast cancer, Lung cancer, Tumours inflammation, Leukemia T-cell, Diabetes mellitus, Parkinson's disease |
| Molecular Mechanisms of Cancer | ||||||||
| NF-kappa B Activation by Viruses | ||||||||
| ITK and TCR Signalling | ||||||||
| RANK Pathway | ||||||||
| 14q13.3 | 5 | 36,985,602–36,990,354 | 4,753 | KHS,CEU | 2.3e-08 | NKX2–1,NKX2–8,PTCSC3,SFTA3,CCDC59,NCK1,MAPK1,NCOA2,RARA | Cell adhesion Tight junctions | Chorea (Benign Hereditary), Hepatocellular carcinoma, Lung cancer, Adenocarcinoma lung |
We obtained the associated disease genes using the MalaCards Database [44, 53] (Materials and Methods).
Number of known copy number polymorphisms (of a total of 1130 autosomal CNPs) that are polymorphic in each analysis panel.
| Number of polymorphic CNPs | Proportion of CNPs polymorphic | |
|---|---|---|
|
| 577 | 0.63 |
|
| 837 | 0.92 |
|
| 486 | 0.50 |
|
| 636 | 0.67 |
|
| 561 | 0.57 |
|
| 482 | 0.49 |
|
| 338 | 0.38 |
Copy number variants shared among study populations and with previously reported structural variants.
| CNVs | Novel CNVs | ||
|---|---|---|---|
| HapMap & Southern African | 279 | 39 | |
| African only | 315 | 51 | |
| Southern African only | 323 | 51 | |
| Southern African Bantu | 429 | 61 | |
|
| 1873 | 397 | |
|
| |||
| CEU | 210 | 65 | |
| YRI | 145 | 40 | |
| STS | 37 | 11 | |
| XHS | 69 | 14 | |
| ZUL | 32 | 8 | |
| HER | 28 | 11 | |
| KHS | 47 | 8 | |
*compared to Database of Genomic Variants
Number of singleton copy number variants (CNVs) in each population.
| Population | Singletons | Deletions | Duplications |
|---|---|---|---|
| CEU | 63 | 63 | 0 |
| YRI | 24 | 0 | 24 |
| STS | 8 | 8 | 0 |
| XHS | 23 | 14 | 9 |
| ZUL | 12 | 2 | 10 |
| HER | 2 | 0 | 2 |
| KHS | 5 | 0 | 5 |
|
|
|
|