| Literature DB >> 26484159 |
Sumi Elsa John1, Gaurav Thareja1, Prashantha Hebbar1, Kazem Behbehani1, Thangavel Alphonse Thanaraj1, Osama Alsmadi1.
Abstract
Kuwaiti native population comprises three distinct genetic subgroups of Persian, "city-dwelling" Saudi Arabian tribe, and nomadic "tent-dwelling" Bedouin ancestry. Bedouin subgroup is characterized by presence of 17% African ancestry; it owes it origin to nomadic tribes of the deserts of Arabian Peninsula and North Africa. By sequencing whole genome of a Kuwaiti male from this subgroup at 41X coverage, we report 3,752,878 SNPs, 411,839 indels, and 8451 structural variations. Neighbor-joining tree, based on shared variant positions carrying disease-risk alleles between the Bedouin and other continental genomes, places Bedouin genome at the nexus of African, Asian, and European genomes in concordance with geographical location of Kuwait and Peninsula. In congruence with participant's medical history for morbid obesity and bronchial asthma, risk alleles are seen at deleterious SNPs associated with obesity and asthma. Many of the observed deleterious 'novel' variants lie in genes associated with autosomal recessive disorders characteristic of the region.Entities:
Keywords: Arabian Peninsula; Intergenome distances; Kuwaiti population; Nomadic Bedouin ancestry; Whole genome sequence; “Tent-dwelling” Bedouins
Year: 2014 PMID: 26484159 PMCID: PMC4535864 DOI: 10.1016/j.gdata.2014.11.016
Source DB: PubMed Journal: Genom Data ISSN: 2213-5960
Fig. 1Scatter plot representing the first two principal components of merged data sets of the three Kuwaiti subgroups. The nomadic Bedouin sample considered for whole genome sequencing in this study is color-coded.
Deleterious SNPs annotated for association with diseases in OMIM database and/or in GWAS Catalog.
| SNPs | Geno-type | Strongest SNP-risk allele (GWAS Catalog, ClinVar, OMIM) | Mapped gene (OMIM ID) | Disease/trait; (phenotype MIM #); [inheritance] |
|---|---|---|---|---|
| rs2297950 | het | T | CHIT1(600031) | Chitotriosidase deficiency; (#614122 ); [?] |
| rs1056827 | hom | ? | CYP1B1(601771) | Glaucoma 3A, primary open angle, congenital, juvenile, or adult onset; (#231300,#604229); [AR] |
| rs34231037 | het | G | KDR(191306) | Hemangioma, capillary infantile, susceptibility to; (#602089 ); [AD] |
| rs1573496 | het | ? | ADH7(103720) | Aerodigestive tract cancer, squamous cell, alcohol-related, protection against; (#103780); [MF] |
| rs1801394 | het | G | MTRR(602568) | Neural tube defects, folate-sensitive, susceptibility todown syndrome, susceptibility to, included; (#601634); [AR] |
| rs351855 | hom | A | FGFR4(134935) | Cancer progression and tumor cell motility; (no OMIM Id); [?] |
| rs3807153 | het | G | ATP6V0A4(605239) | Renal tubular acidosis, distal; (#602722); [AR] |
| rs1801968 | het | G | TOR1A(605204) | Dystonia-1, modifier of Dystonia-1, torsion; (#128100); [AD] |
| rs1800450 | het | T | MBL2(154545) | Chronic infections, due to MBL deficiency; (#614372); [?] |
| rs3135506 | het | C | APOA5(606368) | Hypertriglyceridemia, susceptibility to; (#145750); [AD} |
| rs7308720 | het | G | LRRK2(609007) | Parkinson disease 8; (#607060); [AD] |
| rs2232387 | het | T | KRT75(609025) | Pseudofolliculitis barbae, susceptibility to; (#612318); [?] |
| rs10151259 | het | T | RPGRIP1(605446) | Cone-rod dystrophy 13; (#608194); [AR] |
| rs3743930 | het | G | MEFV(608107) | Familial Mediterranean fever, AD, familial Mediterranean fever, AR; (#134610,#249100); [AD; AR] |
| rs4673 | het | G | CYBA(608508) | Chronic granulomatous disease, (#233690); [AR] |
| rs6504649 | het | G | XYLT2(608125) | Pseudoxanthoma elasticum, modifier of severity of; (#264800); [AR] |
| rs1545 | het | ? | MKKS(605552) | Abdominal obesity—metabolic syndrome; (%605552); [?] |
| rs1801265 | hom | A | DPYD(612779) | Dihydropyrimidine dehydrogenase deficiency; |
| rs486907 | hom | T | RNASEL(180435) | Prostate cancer 1; (#601518); [AD] |
| rs2286963 | het | G | ACADL | Metabolite levels; [?] |
| rs6180 | het | C | GHR(600946) | Hypercholesterolemia, familial, modification of; (#143890); [AD] |
| rs1051931 | hom | G | PLA2G7(601690) | Asthma, susceptibility to, Atopy;(#600807, #147050); [AD, MF] |
| rs7133914 | het | A | LRRK2(609007) | Parkinson disease 8; (#607060); [AD] |
| rs10246939 | hom | C | TAS2R38(607751) | Phenylthiocarbamide tasting; (#171200); [AD] |
| rs61751507 | Het | T | CPN1(603103) | Carboxypeptidase N deficiency; (#212070); [AR] |
| rs3827103 | Het | ? | MC3R(155540) | Mycobacterium tuberculosis, protection against; (%612929); [?] |
| rs3811444 | hom | A | CERS2 | Platelet counts, red blood cell traits; [?] |
| rs676210 | het | G | APOB | LDL (oxidized), lipid metabolism phenotypes; [?] |
| rs6756629 | het | G | ABCG5 | Cholesterol total, LDL cholesterol (protective effect?); [?] |
| rs2043112 | het | A | RICTOR | Obesity-related traits; [?] |
| rs240768 | het | T | ASCC3 | Economic and political preferences (immigration/crime); [?] |
| rs11042023 | hom | C | RPL27A | Obesity; [?] |
| rs11820589 | het | A | BUD13 | Metabolic syndrome (bivariate traits); [?] |
| rs3213764 | het | G | ATF7IP | Prostate-specific antigen levels; [?] |
| rs2297067 | het | T | EXOC3L4 | Platelet counts; [?] |
| rs2303759 | het | C | DKKL1 | Multiple sclerosis; [?] |
| rs267738 | het | A | CERS2 | Rhegmatogenous retinal detachment; [?] |
| rs1919128 | het | A | C2orf16 | Waist circumference—triglycerides (WC-TG); [?] |
| rs2275848 | hom | T | NINJ1 | Obesity (early onset extreme); [?] |
| rs874628 | het | A | MPV17L2 | Multiple sclerosis; [?] |
| rs2239785 | het | G | APOL1 | Glomerulosclerosis; [?] |
| rs11887534 | het | C | ABCG89(605460) | Gallstones, gallbladder disease 4; (#611465); [?] |
| rs2227564 | hom | C | ABCG8(191840, 605526) | Inflammatory bowel disease; Alzheimer disease, late-onset, susceptibility to;(#104300); [AD] |
| rs1799853 | het | ? | CYP2C9(601130) | Warfarin maintenance dose, warfarin sensitivity; (#122700); [AD] |
| rs4149056 | het | T,C | SLCO1B1(604843) | Sex hormone-binding globulin levels, response to statin therapy; rotor type hyperbilirubinemia,; |
| rs1801272 | het | T | CYP2A6(122720) | Smoking behavior, coumarin resistance (#122700); [AD] |
| rs1799990 | het | A | PRNP(176640) | Prion diseases, Creutzfeldt–Jakob disease (#606688); [AD] |
| rs738409 | het | G | PNPLA3(609567) | Nonalcoholic fatty liver disease; (%613282); [MF] |
Abbreviations:
Variants for which the associated phenotype traits are seen with the participant (or his family) that provided sample for genome sequencing.
The alternate allele seen in the Bedouin genome corresponds to the risk allele.
AD: Autosomal dominant; AR: Autosomal recessive; MF: Multi-factorial; DR: digenic recessive;?: not known or multi-factorial.
Fig. 2Neighbor-joining tree based on intergenome distances calculated using genome-wide variant positions shared between the KWB genome, KWS genomes, and representative genomes from intercontinental populations.
Fig. 3Neighbor-joining tree based on intergenome distances calculated using variant positions associated with OMIM disease genes and shared between the KWB genome, KWS genomes, and representative genomes from intercontinental populations.
Fig. 4Summary of analysis of genomes from Kuwait subgroup of Bedouin ancestry. Tracks (from outer to inner): karyotype of human genome; density (in every window of 1 Mb) of ‘known’ SNPs (i.e. annotated in dbSNP 138); density of ‘novel’ SNPs (i.e. not annotated in dbSNP138); Density of ‘known’ indels; density of ‘novel’ indels; density of long deletions; density of long insertions; density of inversions; density of duplications; links representing intra- and interchromosomal translocations.