| Literature DB >> 20465856 |
Angelica B W Boldt1, Iara J Messias-Reason, Diogo Meyer, Carlos G Schrago, Florian Lang, Bertrand Lell, Klaus Dietz, Peter G Kremsner, Maria Luiza Petzl-Erler, Jürgen F J Kun.
Abstract
BACKGROUND: Polymorphisms of the mannose-binding lectin gene (MBL2) affect the concentration and functional efficiency of the protein. We recently used haplotype-specific sequencing to identify 23 MBL2 haplotypes, associated with enhanced susceptibility to several diseases.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20465856 PMCID: PMC2885306 DOI: 10.1186/1471-2156-11-38
Source DB: PubMed Journal: BMC Genet ISSN: 1471-2156 Impact factor: 2.797
Nucleotide changes and haploypes of MBL2.
| 259 | 273 | 311 | 388 | 396 | 456 | 474 | 477 | 478 | 482 | 487 | 495 | 578 | 598 | 602 | 658 | 659 | 712 | 753 | 788 | 797 | 826 | 925 | 926 | 965 | 1045 | 1052 | 1061 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ( | ( | ( | ( | ( | ( | ( | ( | ( | ( | ( | ||||||||||||||||||
| C | C | G | G | A | G | A | C | G | A | A | AAAGAG | G | C | G | C | C | A | C | T | C | C | C | T | G | C | G | G | |
| . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | T | . | . | |
| . | G | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | |
| . | G | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | T | . | . | |
| . | G | . | . | . | . | . | . | . | . | . | . | . | A | . | . | . | . | . | . | . | . | . | . | . | . | . | . | |
| . | . | . | . | . | . | . | . | . | . | . | . | . | . | C | . | . | . | . | . | . | . | . | . | . | . | . | . | |
| . | . | C | . | . | . | . | . | . | . | . | . | . | . | C | . | . | . | . | . | . | . | . | . | . | . | . | . | |
| T | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | |
| . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | G | G | . | . | . | . | |
| . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | A | . | |
| . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | C | . | . | . | . | . | . | A | . | |
| . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | A | . | . | . | . | . | . | . | . | . | . | . | . | |
| . | . | . | A | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | |
| . | . | . | A | . | . | . | T | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | |
| . | . | . | . | . | . | . | . | A | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | |
| . | . | . | . | . | . | . | . | . | . | G | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | . | |
| . | . | . | . | . | . | G | . | . | . | G | ------ | . | . | . | . | . | . | T | . | . | T | . | . | . | . | . | . | |
| . | . | . | . | C | . | G | . | . | . | G | ------ | . | . | . | . | . | . | T | . | . | T | . | . | . | . | . | . | |
| . | . | . | . | C | . | G | . | . | . | G | ------ | . | . | . | . | . | . | T | . | . | T | . | . | C | . | . | . | |
| . | . | . | . | C | T | G | . | . | . | G | ------ | . | . | . | . | . | . | T | . | . | T | . | . | . | . | . | . | |
| . | . | . | . | C | . | G | . | . | . | G | ------ | . | . | . | . | T | . | T | . | . | T | . | . | . | . | . | . | |
| . | . | . | . | C | . | G | . | . | . | G | ------ | A | . | . | . | . | . | T | . | . | T | . | . | . | . | . | . | |
| . | . | . | . | C | . | G | . | . | . | G | ------ | . | . | . | . | . | . | T | . | . | T | . | . | . | . | . | A | |
| . | . | . | . | C | . | G | . | . | . | G | ------ | . | . | . | . | . | . | T | . | A | T | . | . | . | . | . | A | |
| . | . | . | . | C | . | G | . | . | G | G | ------ | . | . | . | . | . | . | T | . | A | T | . | . | . | . | . | A | |
| . | . | . | . | C | . | G | . | . | . | G | ------ | . | . | . | . | . | T | T | . | . | T | . | . | . | . | . | A |
The positions corresponding to the SNPs and to the deletion (position of the first deleted nucleotide) are shown in the first row (Reference sequence: Y16577). Haplotype nomenclature was given according to others [17]. The common nomenclature of formerly known SNPs and haplotypes (excluding LYPF and LYQE), is given in parentheses. *1B3, *1C2 and *1F2 were not investigated for MBL concentration, and thus did not receive the designation "l" (for low MBL levels) or "h" (for high levels). For all other haplotypes, MBL levels have been reported by us and by others [2,7,9-13]. In gray: coding region of exon 1. In SNP database: g.259C > T as rs35451939, g.273G > C as rs11003125, g.311G > C as ss107796309, g.388G > A as rs7100749, g.396A > C as rs11003124, g.456G > T as rs35615810, g.474A > G as rs7084554, g.477C > T as ss107796300, g.478G > A as ss107796301, g.482A > G as ss107796302, g.487A > G as rs36014597, g.495delAAAGAG as rs10556764, g.578G > A as rs35236971, g.598C > A as ss107796311, g.602G > C as rs7096206, g.658C > A as ss107796303, g.659C > T as ss107796304, g.712A > T as ss107796305, g.753C > T as rs11003123, g.788T > C as ss107796312, g.797C > A as rs45602536, g.826C > T as rs7095891, g.925C > G as ss107796306, g.926T > G as ss107796307, g.965G > C as ss107796308, g.1045C > T as rs5030737, g.1052G > A as rs1800450, g.1061G > A as rs1800451.
Figure 1Maximum parsimony tree with suggested phylogenetic nomenclature (see text). The recombinant haplotype *1A1.1B2-l (LYPD) was excluded. Bootstrap values are given at nodes of the tree.
MBL2 haplotype frequencies (%) in diverse populations.
| Haplotypes | N = 856 (N = 64) | N = 48 | N = 208 (N = 48) | N = 62 | N = 262 |
|---|---|---|---|---|---|
| 16.8 (9.5) | 27.1 | 4.81 (8.3) | 1.61 | 3.44 | |
| 0 | 0 | 0 | 0 | 0.38 | |
| 5.37 (6.3) | 6.25 | 29.8 (41.7) | 35.5 | 29.39 | |
| 0.12 (0) | 2.08 | 8.17 (10.4) | 9.68 | 6.49 | |
| 0 | 0 | 0 | 0 | 0.38 | |
| 14.6 (18.8) | 14.6 | 21.6 (8.3) * | 16.1 | 20.23 | |
| 0 | 0 | 0.48 | 0 | 0 | |
| 3.09 (0) | 0 | 0 | 0 | 0 | |
| 0.12 (0) | 0 | 0 | 0 | 0 | |
| 2.45 (1.6) | 2.08 | 11.1 (10.4) | 12.9 | 14.89 | |
| 0 | 0 | 0 | 0 | 0 | |
| 0.12 (0) | 0 | 0 | 0 | 0 | |
| 7.01 (3.1) | 6.25 | 0 | 0 | 1.15 | |
| 0.12 (0) | 0 | 0 | 0 | 0.38 | |
| 0.82 (3.1) | 0 | 0 | 0 | 0 | |
| 0.7 (0) | 0 | 0 | 0 | 0 | |
| 0.12 (0) | 0 | 0 | 0 | 0 | |
| 25.6 (40.6) ** | 18.8 | 23.1 (20.9) | 24.2 | 20.99 | |
| 0.23 (0) | 0 | 0 | 0 | 0 | |
| 0 | 0 | 0.48 (0) | 0 | 0.76 | |
| 0.23 (1.6) | 0 | 0 | 0 | 0 | |
| 5.37 (1.6) | 6.25 | 0 | 0 | 0 | |
| 16.7 (12.5) | 14.6 | 0 | 0 | 0.76 | |
| 1.75 (1.6) | 0 | 0.48 (0) | 0 | 0.76 | |
| 0.58 (0) | 0 | 0 | 0 | 0 | |
| 1.17 (0) | 2.08 | 0 | 0 | 0 | |
| 0 | 6.52 | 2.08 | 0.63 | 1.59 (0) | |
| 0 | 0 | 0 | 0 | 0 | |
| 54.9 (31.3) ** | 28.3 | 45.8 | 48.1 | 52.4 (65.4) | |
| 0 | 4.35 | 0 | 0 | 0 | |
| 0 | 0 | 0 | 0 | 0 | |
| 14.1 (22.9) | 19.6 | 10.4 | 0 | 0.79 (0) | |
| 0 | 0 | 0 | 0 | 0 | |
| 0 | 0 | 0 | 0 | 0 | |
| 0 | 0 | 0 | 0 | 0 | |
| 14.1 (29.2)&** | 13.0 | 16.7 | 47.5 | 26.2 (23.1) | |
| 0 | 0 | 0 | 0 | 1.59 (0) | |
| 0 | 0 | 0 | 0 | 0 | |
| 2.87 (4.2) | 2.17 | 16.7 | 0 | 15.9 (11.5) | |
| 0 | 0 | 0 | 0 | 0 | |
| 0 | 0 | 0 | 0 | 0 | |
| 0 | 0 | 0 | 0 | 0 | |
| 0 | 0 | 0 | 0 | 0 | |
| 14.1 (6.3) | 19.6 | 8.33 | 3.8 | 0.79 (0) | |
| 0 | 0 | 0 | 0 | 0 | |
| 0 | 0 | 0 | 0 | 0 | |
| 0 | 0 | 0 | 0 | 0 | |
| 0 | 2.17 | 0 | 0 | 0 | |
| 0 | 2.17 | 0 | 0 | 0 | |
| 0 | 2.17 | 0 | 0 | 0 | |
| 0 | 0 | 0 | 0 | 0 | |
| 0 | 0 | 0 | 0 | 0.79 (0) | |
N number of chromosomes (N = 162 for 1D1-h in Afro-Gabonese, and this is why the sum of frequency values in this group did not equal exactly 100%). The distribution of MBL2 genotypes in the Gabonese adult sample as a whole was homogeneous with the distribution that we formerly found in a sample of 284 Gabonese children (39 malaria-free, 136 with uncomplicated malaria and 109 with severe malaria but not with cerebral malaria and/or severe hypoglycemia) [2]. In parentheses: differing values found in samples of smaller size but similar ethnic background (Gabonese Bantu, Danish, Chinese and Colombian Piapoco, respectively) [19]. & the Chinese investigated by others presented an additional LYPB haplotype with 6.3% frequency and a g.634G > A SNP (called by them -258 G > A) [19]. In italics: this work; 1 [19]; 2 [18]; 3 [28]; 4 [22].* p < 0.05 ** p < 0.01
Performance of haplotyping algorithms.
| Population | n | Ambiguous genotypes | Expectation maximization (EM) | Pseudo-Bayesian (ELB) | ||
|---|---|---|---|---|---|---|
| Wrongly assembled haplotypes | Spurious "new" haplotypes | Wrongly assembled haplotypes | Spurious "new" haplotypes | |||
| Afro-Gabonese | 428 | 59.6% | 1.1% | 7 | 2.2% | 11 |
| German Europeans | 104 | 63.5% | 1.4% | 1 | 1.4% | 2 |
| Euro-Brazilians | 131 | 66.4% | 1.1% | 2 | 0.8% | 1 |
| Guarani | 79 | 61% | 0 | 0 | 0 | 0 |
| Kaingang | 63 | 71% | 0 | 0 | 0.8% | 0 |
n = number of individuals.
Figure 2Mutational network of . The size of each node is proportional to the haplotype frequency in the pooled sample (this work and [18,28]). Variant nucleotide positions are indicated in red. In black: African; red: European; orange: North Chinese; blue: Guarani; green: Kaingang.
Figure 3Global Fst values distributed along the sequence. Fsts were calculated using the data of Table 2 and of [19]. Nucleotide positions corresponding to variant sites are shown on the x-axis. *** p < 0.001, ** p < 0.01.
Figure 4Sliding window plot of . Statistics were calculated for overlapping windows of 60 bp, placed at 15 bp intervals along the sequence. * P < 0.05.
MBL2 sequence diversity parameters of several populations.
| N | 856 | 48 | 470 | 62 | 348 | 46 | 48 | 126 | 158 | |
| S | 22 | 13 | 16 | 9 | 9 | 13 | 9 | 11 | 7 | |
| π | 44.1 | 43.7 | 37.6 | 37.6 | 27.8 | 40.4 | 25.2 | 17.4 | 16.9 | |
| θW | 37.0 | 36.1 | 29.3 | 23.6 | 17,3 | 36.5 | 25.0 | 27.4 | 15.3 | |
| DT | 0.459 | 0.63 | 0.672 | 1.591 | 1.297 | 0.325 | 0.024 | -0.934 | 0.23 | |
| DF | -1.157 | -0.004 | -0.404 | 1.343 | 1.23 | 0.012 | 1.353 | -0.498 | 1.159 | |
| F | -0.583 | 0.245 | 0.028 | 1.68* | 1.51 | 0.137 | 1.085 | -0.782 | 0.996 | |
| H | -1.28 | 0.743 | -0.276 | -0.387 | -1.718 | 0.039 | -2.163# | -3.27* | -3.282* | |
| S | 16 | 10 | 13 | 7 | 8 | 10 | 8 | 9 | 6 | |
| π | 52.1 | 51.8 | 43.4 | 43.1 | 32.8 | 46.9 | 28.7 | 1.62 | 1.42 | |
| θW | 35.6 | 36.8 | 31.5 | 24.3 | 20.3 | 37.1 | 29.4 | 30.2 | 17.4 | |
| DT | 1.044 | 1.177 | 0.858 | 1.963 | 1.268 | 0.762 | -0.063 | -1.146 | -0.383 | |
| DF | 0.129 | 0.791 | -0.783 | 1.227 | 1.166 | -0.406 | 1.302 | -0.1 | 1.083 | |
| F | 0.606 | 1.08 | -0.181 | 1.73* | 1.446 | -0.03 | 1.015 | -0.56 | 0.693 | |
| H | -1.579 | 0.434 | -0.619 | -0.74 | -1.921# | -0.32 | -2.39# | -3.534* | -3.33* | |
| S | 6 | 3 | 3 | 2 | 1 | 3 | 1 | 2 | 1 | |
| π | 19.3 | 19.6 | 19.6 | 21.7 | 13.00 | 21.5 | 15.2 | 22.5 | 26.8 | |
| θW | 41.3 | 36.1 | 22.5 | 22.8 | 8.3 | 36.5 | 12.0 | 19.8 | 9.5 | |
| DT | -0.92 | -0.953 | -0.183 | -0.079 | 0.52 | -0.866 | 0.352 | 0.202 | 1.925 | |
| DF | -2.55* | -1.7 | 0.717 | 0.726 | 0.43 | 0.9 | 0.547 | -1.124 | 0.466 | |
| F | -2.386 | -1.718 | 0.499 | 0.564 | 0.544 | 0.432 | 0.567 | -0.833 | 1.071 | |
| H | 0.30 | 0.308 | 0.343 | 0.353 | 0.203 | 0.359 | 0.227 | 0.265 | 0.048 |
Am. American European include the German Europeans and Euro-Brazilians, which were homogeneous at the genotypic level N. Chin. North Chinese Pac. Pacific; N number of chromosomes; S number of segregating sites; π (×10-4) nucleotide diversity per site, θW (×10-4) Watterson's Theta per site from S; DT Tajma's D; DF Fu and Li's D without an outgroup (D*); F Fu and Li's F without an outgroup (F*); H Fay and Wu's H with the chimpanzee (Genbank: AY970685.1) as an outgroup. In italics: this work; 2 [18]; 3 [28]; 4 [22]. * P < 0.05 # P < 0.10
Figure 5Distribution and origins of the . Arrows denote the mutational steps between haplotypes (six between *1 and *4) and when dotted, the ancient migratory routes with their approximate ages [51]. The haplotypes which could have been lost by natural selection and/or genetic drift were denoted by '?'. In bold: haplotypes generated before human out-of-Africa migration. Squared: more recent haplotypes, with geographically restricted distribution. KYA thousand years ago.