| Literature DB >> 19533213 |
Joana Abrantes1, David Posada, Patrice Guillon, Pedro J Esteves, Jacques Le Pendu.
Abstract
The alpha-2-fucosyltransferases (alpha2FTs) are enzymes involved in the biosynthesis of alpha2fucosylated glycan structures. In mammalian genomes, there are three alpha2FT genes located in tandem-FUT1, FUT2, and Sec1-each contained within a single exon. It has been suggested that these genes originated from two successive duplications, with FUT1 being generated first and FUT2 and Sec1 second. Despite gene conversion being considered the main mechanism of concerted evolution in gene families, previous studies of primates alpha2FTs failed to detect it, although the occurrence of gene conversion between FUT2 and Sec1 was recently reported in a human allele. The primary aim of our work was to initiate a broader study on the molecular evolution of mammalian alpha2FTs. Sequence comparison leads us to confirm that the three genes appeared by two rounds of duplication. In addition, we were able to detect multiple gene-conversion events at the base of primates and within several nonprimate species involving FUT2 and Sec1. Gene conversion involving FUT1 and either FUT2 or Sec1 was also detected in rabbit. The extent of gene conversion between the alpha2FTs genes appears to be species-specific, possibly related to functional differentiation of these genes. With the exception of rabbits, gene conversion was not observed in the region coding the C-terminal part of the catalytic domain. In this region, the number of amino acids that are identical between FUT1 and FUT2, but different in Sec1, is higher than in other parts of the protein. The biologic meaning of this observation may be related to functional constraints.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19533213 PMCID: PMC2706377 DOI: 10.1007/s00239-009-9239-0
Source DB: PubMed Journal: J Mol Evol ISSN: 0022-2844 Impact factor: 2.395
List of the coding sequences of the α2FTs genes, FUT1, FUT2 and Sec1, retrieved from GenBank and used in this study
| Species name | Common name | Gene | GenBank accession no. | Abbreviation |
|---|---|---|---|---|
| Human | U17894 | Human_FUT2 | ||
| M35531 | Human_FUT1 | |||
| U17895 | Human_Sec1 | |||
| Common chimpanzee | AF080604 | Chimpanzee_FUT2 | ||
| AF080603 | Chimpanzee_FUT1 | |||
| AB006612 | Chimpanzee_Sec1 | |||
| Orangutan | AB015636 | Orangutan_FUT2 | ||
| AB006610 | Orangutan_Sec1 | |||
| Gorilla | AF080606 | Gorilla_FUT2 | ||
| AF080605 | Gorilla_FUT1 | |||
| Lar gibbon | AF136648 | Gibbon_FUT2 | ||
| Agile gibbon | AB006609 | Gibbon_Sec1 | ||
| Green monkey | D87934 | Green_monkey_FUT2 | ||
| D87932 | Green_monkey_FUT1 | |||
| D87933 | Green_monkey_Sec1 | |||
| Cynomolgus | AF080608 | Cynomolgus_Sec1 | ||
| Cow | X99620 | Cow_FUT2 | ||
| NM_177499 | Cow_FUT1 | |||
| AF187851 | Cow_Sec1 | |||
| Mouse | AF064792 | Mouse_FUT2 | ||
| U90553 | Mouse_FUT1 | |||
| Y09882 | Mouse_Sec1 | |||
| Rat | AB006138 | Rat_FUT2 | ||
| AB015637 | Rat_FUT1 | |||
| AF131239 | Rat_Sec1 | |||
| Pig | U70881 | Pig_FUT2 | ||
| U70883 | Pig_FUT1 | |||
| U70882 | Pig_Sec1 | |||
| Rabbit | X91269 | Rabbit_FUT2 | ||
| X80226 | Rabbit_FUT1 | |||
| X80225 | Rabbit_Sec1 | |||
| Frog | NM_001004772 | Frog_FUT1 | ||
| Opossum | XM_001362239 | Opossum_FUT2_like |
Fig. 1cAIC model-averaged support for recombination break points as detected by GARD. Nucleotide position 1 in the graph corresponds to nucleotide position 184 of the human FUT2 sequence
Fig. 2ML trees for (a) segment A (nucleotides 184–720), (b) segment B (nucleotides 721–912), and (c) segment C (nucleotides 913–1032). Nucleotide positions are according to human FUT2 sequence
List of global inner fragments (484 polymorphisms and 849 aligned bases) obtained with Geneconv where inner fragments are runs of matching sites with penalties
| Sequences compared | Sim | BC KA | Aligned offsets | No. polymorphicc | No. differencesd | Total differencese | Mismatch penaltyf | ||
|---|---|---|---|---|---|---|---|---|---|
| Begin | End | Length | |||||||
| Cow_FUT2; Cow_Sec1 | 0.0000 | 0.00000 | 193 | 608 | 415 | 242 | 3 | 112 | 5 |
| Rabbit_FUT1; Rabbit_Sec1 | 0.0000 | 0.00000 | 755 | 931 | 177 | 93 | 3 | 198 | 3 |
| Rabbit_FUT1; Rabbit_FUT2 | 0.0000 | 0.00000 | 755 | 1023 | 269 | 152 | 14 | 180 | 3 |
| Rat_FUT2; Rat_Sec1 | 0.0000 | 0.00000 | 208 | 725 | 521 | 291 | 18 | 112 | 5 |
| Orangutan_FUT2; Orangutan_Sec1 | 0.0000 | 0.00000 | 208 | 559 | 352 | 201 | 11 | 134 | 4 |
| Mouse_FUT2; Mouse_Sec1 | 0.0000 | 0.00000 | 307 | 728 | 422 | 241 | 9 | 110 | 5 |
| Orangutan_FUT2; Gibbon_Sec1 | 0.0000 | 0.00000 | 208 | 559 | 352 | 201 | 16 | 140 | 4 |
| Gorilla_FUT2; Orangutan_Sec1 | 0.0000 | 0.00000 | 208 | 559 | 352 | 201 | 11 | 131 | 4 |
| Orangutan_FUT2; Green_monkey_Sec1 | 0.0000 | 0.00000 | 208 | 559 | 352 | 201 | 18 | 143 | 4 |
| Orangutan_FUT2; Human_Sec1 | 0.0000 | 0.00000 | 208 | 559 | 352 | 201 | 14 | 135 | 4 |
| Orangutan_FUT2; Chimpanzee_Sec1 | 0.0000 | 0.00000 | 208 | 559 | 352 | 201 | 14 | 135 | 4 |
| Human_FUT2; Orangutan_Sec1 | 0.0000 | 0.00000 | 208 | 559 | 352 | 201 | 13 | 133 | 4 |
| Green_monkey_FUT2; Green_monkey_Sec1 | 0.0000 | 0.00000 | 208 | 559 | 352 | 201 | 18 | 142 | 4 |
| Green_monkey_FUT2; Gibbon_Sec1 | 0.0000 | 0.00000 | 208 | 559 | 352 | 201 | 18 | 142 | 4 |
| Green_monkey_FUT2; Orangutan_Sec1 | 0.0000 | 0.00000 | 208 | 559 | 352 | 201 | 13 | 132 | 4 |
| Gorilla_FUT2; Gibbon_Sec1 | 0.0000 | 0.00000 | 208 | 559 | 352 | 201 | 18 | 141 | 4 |
| Gorilla_FUT2; Human_Sec1 | 0.0000 | 0.00000 | 208 | 559 | 352 | 201 | 12 | 130 | 4 |
| Gorilla_FUT2; Chimpanzee_Sec1 | 0.0000 | 0.00000 | 208 | 559 | 352 | 201 | 12 | 130 | 4 |
| Human_FUT2; Human_Sec1 | 0.0000 | 0.00000 | 208 | 559 | 352 | 201 | 14 | 132 | 4 |
| Human_FUT2; Chimpanzee_Sec1 | 0.0000 | 0.00000 | 208 | 559 | 352 | 201 | 14 | 132 | 4 |
| Mouse_FUT2; Rat_Sec1 | 0.0000 | 0.00000 | 241 | 728 | 488 | 279 | 29 | 131 | 4 |
| Chimpanzee_FUT2; Orangutan_Sec1 | 0.0000 | 0.00000 | 232 | 559 | 328 | 193 | 12 | 131 | 4 |
| Human_FUT2; Gibbon_Sec1 | 0.0000 | 0.00000 | 208 | 559 | 352 | 201 | 20 | 143 | 4 |
| Green_monkey_FUT2; Human_Sec1 | 0.0000 | 0.00000 | 208 | 559 | 352 | 201 | 16 | 134 | 4 |
| Green_monkey_FUT2; Chimpanzee_Sec1 | 0.0000 | 0.00000 | 208 | 559 | 352 | 201 | 16 | 134 | 4 |
| Green_monkey_FUT2; Cynomolgus_Sec1 | 0.0000 | 0.00001 | 208 | 559 | 352 | 201 | 18 | 136 | 4 |
| Orangutan_FUT2; Cynomolgus_Sec1 | 0.0000 | 0.00001 | 208 | 559 | 352 | 201 | 18 | 136 | 4 |
| Gorilla_FUT2; Green_monkey_Sec1 | 0.0000 | 0.00001 | 208 | 559 | 352 | 201 | 21 | 143 | 4 |
| Chimpanzee_FUT2; Chimpanzee_Sec1 | 0.0000 | 0.00001 | 208 | 559 | 352 | 201 | 14 | 129 | 4 |
| Chimpanzee_FUT2; Human_Sec1 | 0.0000 | 0.00002 | 208 | 559 | 352 | 201 | 15 | 130 | 4 |
| Chimpanzee_FUT2; Gibbon_Sec1 | 0.0000 | 0.00002 | 232 | 559 | 328 | 193 | 19 | 141 | 4 |
| Human_FUT2; Green_monkey_Sec1 | 0.0000 | 0.00003 | 208 | 559 | 352 | 201 | 23 | 147 | 4 |
| Chimpanzee_FUT2; Green_monkey_Sec1 | 0.0000 | 0.00011 | 232 | 559 | 328 | 193 | 22 | 145 | 4 |
| Human_FUT2; Cynomolgus_Sec1 | 0.0000 | 0.00012 | 208 | 559 | 352 | 201 | 22 | 140 | 4 |
| Gorilla_FUT2; Cynomolgus_Sec1 | 0.0000 | 0.00024 | 208 | 559 | 352 | 201 | 22 | 138 | 4 |
| Chimpanzee_FUT2; Cynomolgus_Sec1 | 0.0000 | 0.00044 | 232 | 559 | 328 | 193 | 21 | 138 | 4 |
| Gorilla_FUT2; Cow_Sec1 | 0.0000 | 0.00044 | 207 | 569 | 363 | 207 | 26 | 145 | 4 |
| Human_FUT2; Cow_Sec1 | 0.0000 | 0.00059 | 207 | 569 | 363 | 207 | 27 | 148 | 4 |
| Orangutan_FUT2; Cow_Sec1 | 0.0000 | 0.00099 | 207 | 551 | 345 | 196 | 25 | 147 | 4 |
| Rat_FUT2; Mouse_Sec1 | 0.0000 | 0.00181 | 307 | 728 | 422 | 241 | 23 | 121 | 4 |
| Cow_FUT2; Orangutan_Sec1 | 0.0001 | 0.00335 | 208 | 551 | 344 | 195 | 31 | 162 | 3 |
| Green_monkey_FUT2; Cow_Sec1 | 0.0002 | 0.00554 | 365 | 569 | 205 | 118 | 12 | 153 | 4 |
| Chimpanzee_FUT2; Cow_Sec1 | 0.0002 | 0.00657 | 232 | 569 | 338 | 198 | 27 | 147 | 4 |
| Cow_FUT2; Chimpanzee_Sec1 | 0.0005 | 0.01069 | 208 | 551 | 344 | 195 | 33 | 164 | 3 |
| Pig_FUT2; Cow_Sec1 | 0.0009 | 0.01627 | 208 | 569 | 362 | 206 | 27 | 137 | 4 |
| Pig_FUT2; Chimpanzee_Sec1 | 0.0032 | 0.03161 | 253 | 551 | 299 | 176 | 30 | 166 | 3 |
| Pig_FUT2; Orangutan_Sec1 | 0.0056 | 0.05048 | 208 | 551 | 344 | 195 | 35 | 165 | 3 |
| Pig_FUT2; Human_Sec1 | 0.0093 | 0.10173 | 208 | 551 | 344 | 195 | 35 | 162 | 3 |
| Cow_FUT2; Gibbon_Sec1 | 0.0112 | 0.11739 | 208 | 551 | 344 | 195 | 37 | 170 | 3 |
| Cow_FUT2; Human_Sec1 | 0.0117 | 0.12367 | 208 | 426 | 219 | 127 | 17 | 160 | 4 |
| Pig_FUT2; Gibbon_Sec1 | 0.0126 | 0.14306 | 208 | 551 | 344 | 195 | 37 | 169 | 3 |
| Cow_FUT2; Green_monkey_Sec1 | 0.0211 | 0.22221 | 349 | 551 | 203 | 122 | 19 | 168 | 3 |
Only fragments with Sim p ≤ 0.05 are listed
a,bCorrected for multiple comparisons
bBonferroni-corrected KA (BLAST-like) p, where KA p is not corrected for multiple pairwise comparison. BC p is KA p × 465
cNumber of polymorphic sites in the fragment
dNumber of mismatches within the fragment
eTotal number of mismatches between two sequences
fPenalty per mismatch for the two sequences
Additional pairwise fragments obtained by Geneconv with BC Pairwise SimPval <0.05 or listed global fragments with significantly better BC SimPval (≤3 pairwise fragments considered per pair)
| Sequences compared | BC Sim | BC KA | Aligned offsets | No. polymorphicc | No. differencesd | Tot differencese | Mismatch penaltyf | ||
|---|---|---|---|---|---|---|---|---|---|
| Begin | End | Length | |||||||
| Orangutan_FUT2; Pig_Sec1 | 0.0465 | >1.0 | 207 | 479 | 273 | 154 | 19 | 127 | 4 |
| Rabbit_FUT2; Rabbit_Sec1 | 0.0465 | >1.0 | 184 | 881 | 698 | 393 | 5 | 39 | 13 |
| Pig_FUT2; Gibbon_Sec1 | 0.0465 | 0.14306 | 208 | 551 | 344 | 195 | 37 | 169 | 3 |
| Pig_FUT2; Pig_Sec1 | 0.0465 | 0.26279 | 208 | 929 | 722 | 397 | 2 | 42 | 12 |
| Cow_FUT2; Human_Sec1 | 0.0465 | 0.12367 | 208 | 426 | 219 | 127 | 17 | 160 | 4 |
| Cow_FUT2; Gibbon_Sec1 | 0.0465 | 0.11739 | 208 | 551 | 344 | 195 | 37 | 170 | 3 |
aPairwise Sim p × 465
bBonferroni-corrected KA (BLAST-like) p where KA p is not corrected for multiple pairwise comparisons. BC p is KA p × 465
cNumber of polymorphic sites in the fragment
dNumber of mismatches within the fragment
eTotal number of mismatches between two sequences
fPenalty per mismatch for the two sequences
Fig. 3Amino acid alignment of the catalytic domain of the α2FTs proteins of a the N-terminal region and b the C-terminal region. The homologous regions between FUT1, FUT2, and Sec1 that may have resulted by gene conversion are highlighted in light grey. Amino acid substitutions shared between FUT1 and FUT2, but not shared with Sec1, are highlighted in black. Closed square = stop codon; dashes = alignment gaps; dots = identity with the consensus sequence; and question marks = consensus sequence represents nonconsensual amino acids. The consensus sequence is based on the amino acid sequences of the α2FTs proteins of the mammals presented