| Literature DB >> 27853181 |
Charlotte E Inchley1, Cynthia D A Larbey1, Nzar A A Shwan2,3, Luca Pagani1,4, Lauri Saag4, Tiago Antão5, Guy Jacobs6, Georgi Hudjashov4,7, Ene Metspalu4, Mario Mitt8,9, Christina A Eichstaedt1,10, Boris Malyarchuk11, Miroslava Derenko11, Joseph Wee12, Syafiq Abdullah13, François-Xavier Ricaut14, Maru Mormina15, Reedik Mägi8, Richard Villems4,16,17, Mait Metspalu4, Martin K Jones1, John A L Armour2, Toomas Kivisild1,4.
Abstract
Humans have more copies of amylase genes than other primates. It is still poorly understood, however, when the copy number expansion occurred and whether its spread was enhanced by selection. Here we assess amylase copy numbers in a global sample of 480 high coverage genomes and find that regions flanking the amylase locus show notable depression of genetic diversity both in African and non-African populations. Analysis of genetic variation in these regions supports the model of an early selective sweep in the human lineage after the split of humans from Neanderthals which led to the fixation of multiple copies of AMY1 in place of a single copy. We find evidence of multiple secondary losses of copy number with the highest frequency (52%) of a deletion of AMY2A and associated low copy number of AMY1 in Northeast Siberian populations whose diet has been low in starch content.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27853181 PMCID: PMC5112570 DOI: 10.1038/srep37198
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Distribution of estimates of maximum pairwise Fst per 200 kb non-overlapping segment among 12 human population groups.
‘AMY locus’ refers to data from three neighbouring 200 kb segments from the Chr1:103,800,000-104,400,000 region. The distribution of the maximum FST scores of the 200 kb regions is shown by bins of 0.05. The only 5% significant (red dotted line) FST estimate for the AMY locus comes from the African and Northeast Siberian comparison at the Chr1:104,000,000-104,200,000 segment.
Genome-wide significance of the window containing AMY genes by three selection tests in 12 regional population groups.
| AMY WINDOW (Chr1 104-104.2 Mb) | iHS | nSL | TD |
|---|---|---|---|
| Africa | 0.99 | 0.97 | |
| Middle East | 0.12 | ||
| South & West Europe | 0.08 | 0.25 | 0.07 |
| East & North Europe | 0.93 | 0.06 | 0.08 |
| Volga Uralic | 0.19 | 0.28 | 0.14 |
| South Asia | 0.16 | 0.23 | |
| West Siberia | 0.88 | 0.41 | 0.24 |
| South Siberia & Mongolia | 0.21 | 0.10 | 0.11 |
| Central Siberia | 0.12 | 0.19 | |
| Northeast Siberia | 0.38 | 0.36 | 0.06 |
| Southeast Asia Mainland | 0.91 | 0.19 | 0.08 |
| Island Southeast Asia | 0.06 | 0.18 |
*Indicates cases where the AMY genes containing window had the relevant selection test score ranking in the top 5% of the windows (p < 0.05), bold
**indicates cases where the AMY windows was within the top 1% of windows for the given positive selection test (p < 0.01).
Figure 2Distribution of genetic diversity on chromosome 1 in human populations.
Presented on the y axis is the sum of derived allele frequency over all polymorphic loci in 1000 Genomes African data, estimated by 50,000 bp non-overlapping segments of chromosome 1, relative to the divergence of the human reference sequence from the ancestral sequence (determined by the 6 primate sequence consensus). Red dotted line indicates the 1% cut-off considering the empirical genome-wide distribution. Only those 50,000 bp segments which had >90% sites covered in human, Altai Neanderthal and Denisovan data were considered.
Figure 3Phylogenetic tree and Bayesian Skyline Plot based on the analyses of a ~66 kb region downstream of the AMY locus with BEAST.
The analyses were restricted to bi-allelic SNP variants from a ~66 kb long high LD region (Chr1:104,303,310-104,369,301) which is downstream of (centromeric to) AMY1C gene (Figure S1). Coalescent time estimates are shown near branching points and assume mutation rate of 5.5 × 10−10 per bp per year34. BSP–Bayesian Skyline Plot; thick brown line shows the median estimates of human effective population size (Ne) x generation time (y-axis) over time (x-axis) as estimated from the ~66 kb region data in the global sample of 480 individuals. The grey shaded area around the brown line shows the 95% higher posterior density intervals of the BSP estimates. A dotted blue arrow highlights the branch of the human phylogeny that has been the likely source of an introgression of a chunk of ~16 kb (Chr1:104,350,432-104,366,871) into the genome of the Altai Neanderthal.
Geographic distribution of haplotypes in the Chr1:104,303 K-104,369 K region.
| N individuals | Average | Average diploid copy number (σ2) | Haplotype distribution (Heterozygote counting as 1, homozygote as 2 copies) | |||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| |latitude| | A | B | C | D | E | F | G | H | I | J | K | L | M | |||||
| Africa | 33 | 3.6 | 6.4 (3.4) | 2.3 (0.2) | 2.1 (0.1) | 5 | 10 | 4 | 3 | 4 | 8 | 4 | 3 | 12 | 11 | 2 | ||
| West Asia/Caucasus | 59 | 39.2 | 7.2 (6.2) | 2.2 (0.5) | 2.3 (0.3) | 15 | 25 | 13 | 3 | 10 | 6 | 25 | 21 | |||||
| Southwest Europe | 23 | 47.3 | 6.5 (3.9) | 1.8 (0.2) | 2 (0) | 6 | 5 | 13 | 4 | 6 | 1 | 11 | ||||||
| Northeast Europe | 80 | 57.3 | 6.5 (6.5) | 1.8 (0.3) | 2.1 (0) | 25 | 40 | 36 | 8 | 9 | 5 | 11 | 4 | 22 | ||||
| Volga-Ural region | 25 | 54.9 | 7.4 (5.8) | 1.7 (0.6) | 2 (0) | 10 | 10 | 11 | 3 | 3 | 3 | 10 | ||||||
| South Asia | 29 | 23 | 7.2 (3.3) | 2.2 (0.3) | 2.1 (0.2) | 8 | 16 | 20 | 1 | 2 | 11 | |||||||
| Central Asia | 24 | 40.9 | 7.6 (6.5) | 2.2 (0.5) | 2.1 (0.2) | 9 | 7 | 12 | 2 | 5 | 1 | 12 | ||||||
| West Siberia | 18 | 63.7 | 5.9 (5.9) | 1.4 (0.6) | 2 (0) | 8 | 6 | 9 | 2 | 1 | 2 | 8 | ||||||
| South Siberia | 34 | 52.8 | 7.6 (4.2) | 1.9 (0.3) | 2 (0) | 16 | 20 | 15 | 1 | 2 | 14 | |||||||
| Central Siberia | 27 | 64.1 | 6.5 (13) | 2.1 (0.3) | 2 (0) | 11 | 9 | 16 | 2 | 16 | ||||||||
| Northeast Siberia | 23 | 62.6 | 6.2 (14.8) | 1 (0.6) | 2 (0) | 1 | 7 | 9 | 29 | |||||||||
| South America | 28 | 24.2 | 9.3 (9.3) | 1.9 (0.1) | 2 (0) | 10 | 31 | 15 | ||||||||||
| East Asia | 26 | 22.9 | 7.7 (4.3) | 2 (0) | 2 (0) | 12 | 20 | 11 | 9 | |||||||||
| Island Southeast Asia | 45 | 9.4 | 7.1 (2.8) | 2.1 (0.1) | 2 (0) | 19 | 26 | 16 | 1 | 28 | ||||||||
| Papua New Guinea | 6 | 7 | 6.3 (0.3) | 2.5 (0.3) | 2 (0) | 12 | ||||||||||||
| Total | 480 | 39.5 | 7.1 (6.5) | 1.9 (0.4) | 2.1 (0.1) | 5 | 10 | 154 | 225 | 197 | 21 | 33 | 16 | 12 | 55 | 12 | 12 | 208 |
| 0.01 | 0.01 | 0.16 | 0.23 | 0.21 | 0.02 | 0.03 | 0.02 | 0.01 | 0.06 | 0.01 | 0.01 | 0.22 | ||||||
Figure 4Distributions of AMY1 and AMY2A copy number across the major continental groups.
Geographic distribution of the AMY2A deletion.
| N | Haplotype distribution of | ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| count | frequency | A | B | C | D | E | F | G | H | I | J | K | L | M | |||
| 1 | Africa | 36 | 0 | 0.00 | |||||||||||||
| 2 | West Asia/Caucasus | 110 | 5 | 0.04 | 1 | 4 | |||||||||||
| 3 | South Asia | 58 | 1 | 0.02 | 1 | ||||||||||||
| 4 | Southwest Europe | 46 | 5 | 0.11 | 1 | 1 | 1 | 2 | |||||||||
| 5 | Northeast Europe | 158 | 16 | 0.10 | 2 | 3 | 11 | ||||||||||
| 6 | Central Asia | 48 | 4 | 0.08 | 1 | 3 | |||||||||||
| 7 | Volga Ural region | 44 | 8 | 0.18 | 1 | 3 | 4 | ||||||||||
| 8 | West Siberia | 36 | 10 | 0.28 | 1 | 2 | 7 | ||||||||||
| 9 | South Siberia | 68 | 6 | 0.09 | 1 | 2 | 3 | ||||||||||
| 10 | Central Siberia | 54 | 3 | 0.06 | 3 | ||||||||||||
| 11 | Northeast Siberia | 46 | 24 | 0.52 | 1 | 23 | |||||||||||
| 12 | South America | 56 | 2 | 0.04 | 2 | ||||||||||||
| 13 | East Asia | 52 | 1 | 0.02 | 1 | ||||||||||||
| 14 | Island Southeast Asia | 90 | 1 | 0.01 | 1 | ||||||||||||
| 15 | Papua New Guinea | 6 | 0 | 0.00 | |||||||||||||
| Deletions | 0 | 0 | 8 | 1 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 11 | 64 | ||||
| Total count | 2 | 4 | 151 | 218 | 195 | 20 | 30 | 15 | 4 | 54 | 11 | 11 | 199 | ||||
| Deletion frequency | 0.00 | 0.00 | 0.05 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1.00 | 0.32 | ||||
Note: N – number of phased chromosomes with information for AMY2A and AMY2B copy numbers.
Correlations between amylase copy numbers and geographic latitude and longitude.
| Spearman rank-order | Copy number of | ||
|---|---|---|---|
| correlation coefficients rs | |||
| Absolute latitude | −0.19 (p = 2 × 10−5) | −0.33 (p < 10−6) | −0.05 (p = 0.29) |
| Longitude | 0.001 (p = 0.98) | 0.001 (p = 0.98) | −0.098 (p = 0.035) |
| 0.21 (p = 9 × 10−6) | |||
| −0.15 (p = 0.0015) | 0.48 (p < 10−6) | ||
Note significance for the two-tailed Spearman rank-order correlation test is shown in parenthesis.