| Literature DB >> 19629189 |
Pascale Gerbault1, Céline Moret, Mathias Currat, Alicia Sanchez-Mazas.
Abstract
BACKGROUND: The lactase enzyme allows lactose digestion in fresh milk. Its activity strongly decreases after the weaning phase in most humans, but persists at a high frequency in Europe and some nomadic populations. Two hypotheses are usually proposed to explain the particular distribution of the lactase persistence phenotype. The gene-culture coevolution hypothesis supposes a nutritional advantage of lactose digestion in pastoral populations. The calcium assimilation hypothesis suggests that carriers of the lactase persistence allele(s) (LCT*P) are favoured in high-latitude regions, where sunshine is insufficient to allow accurate vitamin-D synthesis. In this work, we test the validity of these two hypotheses on a large worldwide dataset of lactase persistence frequencies by using several complementary approaches.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19629189 PMCID: PMC2711333 DOI: 10.1371/journal.pone.0006369
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Correlation coefficients between genetics and geography.
| Region | n | rgen,geo | P val. | rLCT*P-lat | P val. | rLCT*P-long | P val. |
| World Scale | 115 | 0.050 | 0.0400 | 0.274 | <0.0001 | −0.080 | 0.003 |
| Europe | 33 | 0.260 | 0.0001 | 0.400 | 0.0005 | −0.010 | 0.568 |
| Europe (indo-europeans only) | 26 | 0.420 | 0.0000 | 0.840 | <0.0001 | −0.310 | 0.006 |
| Africa | 38 | 0.016 | 0.1700 | 0.329 | 0.0010 | 0.134 | 0.048 |
| Near East &East Asia | 34 | 0.016 | 0.3600 | −0.041 | 0.2560 | −0.046 | 0.230 |
n: number of populations; r gen,geo: correlation coefficient between genetic and geographic distances; r LCT*P-lat: correlation coefficient between LCT*P frequency and latitude; rLCT*P-long:correlation coefficient between LCT*P frequency and longitude; P.val.: P-value for significance. Population used are listed in Table S1.
Comparison of LCT*P with other genetic polymorphisms (Europe).
| Locus | n | rgen,geo | P val. | rloc-LCT*P | P val. |
| HLA-A | 22 | 0.528 | <0.0001 | 0.377 | 0.0003 |
| HLA-B | 22 | 0.440 | 0.0003 | 0.476 | <0.0001 |
| HLA-DRB1 | 14 | 0.460 | 0.003 | 0.620 | <0.0001 |
| RH | 17 | 0.420 | 0.001 | 0.200 | 0.014 |
n: number of populations; r gen,geo: correlation coefficient between genetic and geographic distances; r loc-LCT*P: correlation coefficient between each locus and LCT*P genetic distances; P.val.: P-value for significance.
Figure 1Map location of European and Near-Eastern populations used for the computer simulation.
Dots locate the populations used for computer simulation (Note that the Iran sample should be located farther East, outside of the map). When more than one population from the same country were used, the name of the corresponding city is given (see Table 3). Below each population name, the approximate date of domestication and its standard error (years BP) are mentioned. When simulating the Demic Diffusion hypothesis (DD), dotted arrows show the simulated relations between population source and "new founded" populations (regarding initial frequency of LCT*P, see Material and Methods and Supplementary Information). To facilitate reading, the dotted area regroups populations for which we use Greece as the source.
Parameters for simulated populations.
| N | Population (location) | Lat | Long | Lactase persistence frequency | Simulated number of generation | L values |
| 105 | Iranian | 32.0 | 53.0 | 0.171 | 370 | 0.000 |
| 225 | Lebanese | 33.8 | 35.8 | 0.111 | 370 | 0.075 |
| 75 | Syrian | 35.0 | 38.0 | 0.046 | 370 | 0.126 |
| 67 | Cypriot | 35.0 | 33.0 | 0.152 | 355 | 0.126 |
| 100 | Sicilian | 37.5 | 14.0 | 0.157 | 278 | 0.230 |
| 200 | Greek | 39.0 | 22.0 | 0.134 | 315 | 0.293 |
| 119 | Spanish (Valencia) | 39.5 | −0.4 | 0.471 | 281 | 0.314 |
| 100 | Sardinian (Sassari) | 40.7 | 8.6 | 0.073 | 280 | 0.364 |
| 178 | Italian (Napoli) | 40.8 | 14.3 | 0.082 | 289 | 0.368 |
| 839 | Italian (Roma) | 41.9 | 12.5 | 0.171 | 285 | 0.414 |
| 338 | Spanish (Santiago de Compostela) | 42.9 | −8.6 | 0.417 | 282 | 0.456 |
| 55 | French (Nice) | 43.7 | 7.3 | 0.352 | 270 | 0.490 |
| 208 | Italian (Brescia) | 45.6 | 10.2 | 0.286 | 275 | 0.569 |
| 153 | Slovenian (Ljubljana) | 46.1 | 14.5 | 0.490 | 275 | 0.590 |
| 51 | Swiss (Geneva) | 46.2 | 6.2 | 0.423 | 274 | 0.594 |
| 102 | French (Nantes) | 47.2 | −1.6 | 0.516 | 243 | 0.636 |
| 528 | Austrian (Innsbruck) | 47.3 | 11.4 | 0.551 | 262 | 0.640 |
| 221 | German (Munchen) | 48.1 | 11.6 | 0.632 | 273 | 0.674 |
| 136 | German (Stuttgart) | 48.4 | 9.1 | 0.515 | 264 | 0.686 |
| 200 | Czech (Plzen) | 49.8 | 13.4 | 0.646 | 270 | 0.745 |
| 275 | Polish | 52.0 | 20.0 | 0.388 | 276 | 0.837 |
| 246 | German (Berlin) | 52.5 | 13.4 | 0.527 | 265 | 0.858 |
| 162 | English (Birmingham) | 52.5 | −1.9 | 0.776 | 224 | 0.858 |
| 441 | German (Bremen) | 53.1 | 8.8 | 0.714 | 257 | 0.883 |
| 50 | Irish (Dublin) | 53.3 | −6.3 | 0.800 | 212 | 0.891 |
| 761 | Danish (Copenhagen) | 55.9 | 12.4 | 0.827 | 204 | 1.000 |
n: sample size; Lat: latitude; Long: longitude; L values: coefficient used to increase the strength of selection with latitude as follow s = sL where s is the simulated selection coefficient in one given population and s the coefficient of selection tested overall (see Supplementary Information for further details).
Figure 2Selection coefficients required to fit the observed estimates of lactase persistence frequencies.
Each graph corresponds to one of the four scenario simulated: DD/gcc; DD/cal; CD/gcc; CD/cal (see Material and Methods). Bars represent the 95% CI of the selection coefficient estimated for the population and the central point is the MLE (Maximum Likelihood Estimate). Populations are ordered from the highest (right) to the lowest (left) latitude: Danish (Dan), Irish (Iri), German from Bremen (Bre), German from Berlin (Ber), English (Eng), Polish (Pol), Czech (Cze), German from Stuttgart (Stu), German from Munchen (Mun), Austrian (Aus), French from Nantes (Nan), Swiss (Swi), Slovenian (Slo), Italian from Brescia (Bre), French from Nice (Nic), Spanish from Santiago de Compostela (Com), Italian from Roma (Rom), Italian from Napoli (Nap), Sardinian (Sas), Spanish from Valencia (Val), Greek (Gre), Sicilian (Sic), Cypriot (Cyp), Lebanese (Leb) and Iranian (Ira).