| Literature DB >> 35945619 |
Gerald van Eeden1, Caitlin Uren1,2, Evlyn Pless3, Mira Mastoras3, Gian D van der Spuy1,2,4, Gerard Tromp1,2,4, Brenna M Henn3, Marlo Möller5,6.
Abstract
BACKGROUND: Recombination maps are important resources for epidemiological and evolutionary analyses; however, there are currently no recombination maps representing any African population outside of those with West African ancestry. We infer the demographic history for the Nama, an indigenous Khoe-San population of southern Africa, and derive a novel, population-specific recombination map from the whole genome sequencing of 54 Nama individuals. We hypothesise that there are no publicly available recombination maps representative of the Nama, considering the deep population divergence and subsequent isolation of the Khoe-San from other African groups.Entities:
Keywords: Genetic map; Khoe-San; Recombination map; Recombination rate; Selection scan
Mesh:
Year: 2022 PMID: 35945619 PMCID: PMC9361568 DOI: 10.1186/s13059-022-02744-5
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 17.906
Fig. 1A The inferred effective population size history for the Nama plotted on a log10 scale with SMC++ results on the right and AS-IBDNe results for the Nama on the left. B–D The AS-IBDNe results for the LWK (B), GBR (C) and Nama (D) ancestral components in the Nama
Fig. 2Heatmap indicating the Spearman rank correlation between the genetic maps of 27 populations, including the Nama, at a 2-kilobase resolution. The colour of the population labels represent distinct super-population groups, with the Nama highlighted in red. There is clear clustering according to super-population groups and the Nama recombination map correlates the best with other African populations
Fig. 3The recombination rate of the combined Phase II HapMap recombination map and the inferred recombination map for the Nama plotted over a segment of chromosome 1. There is a high degree of overlap between the maps across this region, but there are positions with recombination hotspots indicated by the Nama map that are not indicated by the combined Phase II HapMap map, e.g. at 24.2 Mb
Fig. 4Venn diagram of the candidate genes found using the 1.0% highest selection scan results (absolute value iHS) for the selection scan using the combined Phase II HapMap map (white) and the selection scan using the Nama map (grey)
Fig. 5A brief overview of the methods used in effective population size inference and the subsequent recombination rate inference
Fig. 6An overview of the AS-IBDNe pipeline. Input SNP array data in plink binary format is split by chromosomes using plink v1.9. Each chromosome is then phased and converted to vcf format by SHAPEIT2. IBD segments are next inferred using RefinedIBD, and merge-ibd-segments is used to remove gaps between them. Meanwhile, RFMix2.0 is run to estimate the ancestry of differently sized genomic segments. Finally, RFMix-produced ancestries are assigned to each IBD segment, and IBDNe is run to produce ancestry-specific effective population size estimates