| Literature DB >> 24288652 |
Masaya Itou1, Mitsuharu Sato, Takashi Kitano.
Abstract
The human ABO blood group gene consists of three main alleles (A, B, and O) that encode a glycosyltransferase. The A and B alleles differ by two critical amino acids in exon 7, and the major O allele has a single nucleotide deletion (Δ261) in exon 6. Previous evolutionary studies have revealed that the A allele is the most ancient, B allele diverged from the A allele with two critical amino acid substitutions in exon 7, and the major O allele diverged from the A allele with Δ261 in exon 6. However, a recent phylogenetic network analysis study showed that the A allele of humans emerged through a recombination between the B and O alleles. In the previous study, a restricted dataset from only two populations was used. In this study, therefore, we used a large single nucleotide polymorphism (SNP) dataset from the HapMap Project. The results indicated that the A101-A201-O09 haplogroup was a recombinant lineage between the B and O haplotypes, containing the intact exon 6 from the B allele and the two critical A type sites in exon 7 from the major O allele. Its recombination point was assumed to be located just behind Δ261 in exon 6.Entities:
Year: 2013 PMID: 24288652 PMCID: PMC3830805 DOI: 10.1155/2013/406209
Source DB: PubMed Journal: Int J Evol Biol ISSN: 2090-052X
Figure 1Explanation of a recombination event in a phylogenetic network using model data. A nucleotide sequence (a) and a corresponding phylogenetic network (b) are shown. Modified from [6]. See text for the details.
Figure 2The location of SNP sites on the human ABO blood group genes obtained by the HapMap Project. A dashed line indicates Δ261, and blue vertical lines indicate the two critical sites that determine the blood group A and B specificities. The SNP number is shown in the Supplementary Material.
Figure 3The phylogenetic network constructed from 3pop_data_1 (a). A circle graph presented on each node indicates the proportion of three populations, and the size of the circle graph indicates frequency of each haplotype. The numbers on each branch are nucleotide positions responsible for those branches. The underlined haplotypes were used for the PNarec program. The reticulation of the phylogenetic network for a recombinant and its parental haplogroups is shown by thick lines ((b) and (c)).
Recombinants expected from phylogenetic networks and PNarec.
| Data | Recombinant | Parent 1 (forward) | Parent 2 (backward) | Region of recombination point |
|---|---|---|---|---|
| 3pop_data_1 | O01 | O02 | B101 | 32 and 42 |
| A101-A201-O09* | B101 | O01 (O02) | 15 and Δ261 (15 and 36) | |
| 3pop_data_2 | XV* (O02/B101) | O02 | B101 | 12 and Δ261 (12 and 16) |
| O01 | O02 | B101 | 32 and 36 | |
| A101-A201-O09* | B101 | O01 | 15 and Δ261 (15 and 36) | |
| 11pop_data_1 | O01 | O02 | B101 | 20 and 26 |
| A101-A201-O09* | B101 | O01 (O02) | 10 and Δ261 (1 and 11) | |
| 11pop_data_2 | O01 | O02 | B101 | 20 and 26 |
| A101-A201-O09* | B101 | O01 (O02) | 10 and Δ261 (1 and 11) |
Recombinants estimated not only from a phylogenetic network but also from the PNarec method are indicated by asterisks. Notation in parentheses indicates the result expected from PNarec.
Figure 4The phylogenetic network constructed from 3pop_data_2. Details are as in Figure 3.
Figure 5The phylogenetic network constructed from 3pop_data_2, except for the XV (O02/B101) haplotype. Details are as in Figure 3.
Figure 6The phylogenetic network constructed from 11pop_data_1. Details are as in Figure 3.
Figure 7The phylogenetic network constructed from 11pop_data_2. Details are as in Figure 3.
Minor haplotypes which were not included in the phylogenetic network constructed by 11pop_data_2.
| Haplotype (possible haplogroup) | Number of sequences | Estimated cause (recombination point and its parental haplotypes*) |
|---|---|---|
| n (A101-A201-O09/O47) | 5 | Recombination event (b, 22–26, r) |
| q (O02/O47) | 11 | Recombination event (e, 26–28, w) |
| t (B101/A101-A201-O09) | 6 | Recombination event (f, 19–24, a/b/m) |
| u (B101/A101-A201-O09) | 5 | Recombination event (f, 2-3, a) |
| x (O01/A101-A201-O09) | 4 | Recombination event (g/h, 22–26, a/b/m) |
| y (B101/O02) | 13 | Recombination event (f, 7-8, e) |
| aa (O01/O02) | 5 | Recombination event (g/h, 25–27, c/d/e) |
| ab (B101 with O01) | 4 | Gene conversion (26–29 of g/h/i was converted into f) |
| ac (A101-A201-O09-O01/O47) | 6 | Recombination event and one nucleotide substitution (a/b/g/h, 10–14, r, and 1 substitution) |
| af (B101) | 3 | Recombination event (a/b/c/d/e/g/h/i/j/k/l/m, 2-3, f) |
| ag (O02/A101-A201-O09) | 7 | Recombination event (d/e, 20–23, a/b/m) |
| ai (A101-A201-O09-O01/O02) | 7 | Recombination event (a/b/g/h, 13-14, j) |
| ah (O47/B101) | 7 | Recombination event and one nucleotide substitution (p/r/w, 8–14, f, and 1 substitution) |
*For example, haplotype q is a recombinant between e (forward) and w (backward) between sites 26 and 28.
Figure 8A scheme of the evolutionary pathway of some haplogroups of the human ABO blood group genes. Red square indicates the A type gene. Black vertical bar indicates Δ261 in exon 6. The blue portion of the B gene indicates the two B type-specific critical amino acid sites; containing region in exon 7. Stars indicate recombination event. A rough time scale is shown by following the results of [6]; the divergence time of the lineage leading to B101: 2.08 million years ago, the divergence time of the lineage leading to O01: 1.98 million years ago, and the divergence time of the lineage leading to A101: 0.26 million years ago. Because times of divergence between A and O02, the Δ261 deletion, transfer of Δ261, two amino acid substitutions for B101, and recombination for O02/B101 cannot be estimated, these positions are located arbitrarily.