| Literature DB >> 17072863 |
Petr Vyletal1, Jitka Sokolová, David N Cooper, Jan P Kraus, Michael Krawczak, Guglielmina Pepe, Olga Rickards, Hans G Koch, Michael Linnebank, Leo A J Kluijtmans, Henk J Blom, Godfried H J Boers, Mette Gaustadnes, Flemming Skovby, Bridget Wilcken, David E L Wilcken, Generoso Andria, Gianfranco Sebastio, Eileen R Naughten, Sufin Yap, Toshihiro Ohura, Ewa Pronicka, Aranka Laszlo, Viktor Kozich.
Abstract
Homozygosity or compound heterozygosity for the c.833T>C transition (p.I278 T) in the cystathionine beta-synthase (CBS) gene represents the most common cause of pyridoxine-responsive homocystinuria in Western Eurasians. However, the frequency of the pathogenic c.833C allele, as observed in healthy newborns from several European countries (q(c.833C) approximately equals 3.3 x 10(-3)), is approximately 20-fold higher than expected on the basis of the observed number of symptomatic homocystinuria patients carrying this mutation (q(c.833C) approximately equals 0.18 x 10(-3)), implying clinical underascertainment. Intriguingly, the c.833C mutation is also present in combination with a 68-bp insertion, c.[833C; 844_845ins68], in a substantial proportion of chromosomes from nonhomocystinuric individuals worldwide. We have sought to study the relationship between the pathogenic and nonpathogenic c.833C-bearing chromosomes and to determine whether the pathogenic c.[833C; -] chromosomes are identical-by-descent or instead arose by recurrent mutation. Initial haplotype analysis of 780 randomly selected Czech and sub-Saharan African wild-type chromosomes, employing 12 intragenic markers, revealed 29 distinct CBS haplotypes, of which 10 carried the c.[833C; 844_845ins68] combination; none carried an isolated c.833C or c.844_845ins68 mutation. Subsequent examination of 69 pathogenic c.[833C; -] chromosomes, derived from homocystinuria patients of predominantly European origin, disclosed three unrelated haplotypes that differed from their wild-type counterparts by virtue of the presence of c.833C, thereby indicating that c.833T>C transition has occurred repeatedly and independently in the past. Since c.833T does not reside within an obvious mutational hotspot, we surmise that the three pathogenic and comparatively prevalent c.[833C; -] chromosomes may have originated by recurrent gene conversion employing the common nonpathogenic c.[833C; 844_845ins68] chromosomes as templates.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17072863 PMCID: PMC2630376 DOI: 10.1002/humu.20430
Source DB: PubMed Journal: Hum Mutat ISSN: 1059-7794 Impact factor: 4.878
FIGURE 1Structure of the wild-type c. [833T; –] and c. [833C; 844_845ins68] CBS chromosomes. A:The genomic organization of the CBS gene, with exons depicted as numbered black boxes and the initiator ATG in exon 1 indicated by an upward-pointing arrow. Markers genotyped in this study are denoted by downward-pointing arrows. B: Boxed uppercase letters represent the sequence of wild-type exon 8. The lowercase sequence corresponds to flanking intronic regions. The thymidine residue at nucleotide position 833 is marked by an arrow. C:The sequence of the variant c. [833C; 844_845ins68] chromosome. The 68-bp insertion between nucleotides 844 and 845 is bracketed. Both the mutation c.833C in exon 8 and the wild-type thymidine 833 within the 68-bp insertion sequence are indicated by arrows. Motifs corresponding to DNA polymerase α/β frameshift hotspots and inverted repeats are shaded and underlined, respectively.
CBS MarkerAllele Frequencies in Controls*
| Allele frequencies in different populations | ||||||
|---|---|---|---|---|---|---|
| African control populations | ||||||
| Marker | Czech controls | African controls | Fon | Dendi | Berba | Bariba |
| Number of chromosomes | 400 | 380 | 100 | 100 | 80 | 100 |
| c.-3792G>A | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| c.699C>T | 0.32 | 0.20 | 0.17 | 0.24 | 0.15 | 0.24 |
| c.833T>C | 0.07 | 0.33 | 0.33 | 0.30 | 0.36 | 0.24 |
| c.844_845ins68 | 0.07 | 0.33 | 0.33 | 0.30 | 0.36 | 0.24 |
| C.1080C>T | 0.42 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 |
| c.1444_1467+7(16) | 0.002 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| C.1444_1467+7(17) | 0.07 | 0.22 | 0.29 | 0.15 | 0.24 | 0.21 |
| C.1444_1467+7(18) | 0.77 | 0.26 | 0.23 | 0.27 | 0.20 | 0.33 |
| C.1444_1467+7(19) | 0.11 | 0.19 | 0.12 | 0.16 | 0.15 | 0.13 |
| C.1444_1467+7(21) | 0.05 | 0.38 | 0.36 | 0.42 | 0.41 | 0.33 |
| c.*543T>C (known as c.1985T>C) | 0.46 (C) | 0.12 (T) | 0.14 (T) | 0.14 (T) | 0.13 (T) | 0.07 (T) |
With the exception of the VNTR polymorphism, the marker frequencies are given as minor allele frequencies. The cDNA sequence numbering is based on GenBank reference sequence NM_000071.1 (the first adenosine in the initiator ATG is designated as + 1), genomic DNA numbering is based on GenBank reference sequence NC_000021.7 and the numbering of the 31-bp c.1444_1467+7(16.21) VNTR is based on a published report [Lievers et al., 2001]. For an overview of polymorphisms in the CBS gene, see [Kožich and Kraus, 2001].
The length of the individual VNTR motifs is invariably 31 bp; however, different single-nucleotide substitutions may be present within the repeat units, as described previously by Lievers et al. [2001].
CBS Haplotypes Observed in Control Chromosomes
| Population | ||||
|---|---|---|---|---|
| Haplotype | Detection method | X, number of c.1444_1467+7(16_21) VNTR repeats | Czech | African |
| c.[833T;-] haplotypes | ||||
| [G; T; T; –; C; α; X; C] | P | 21 | − | + |
| [G; C; T; –; C; β X; C] | H | 19 | − | + |
| [G; T; T; –; C; β X; C] | H | 19 | − | + |
| [G; T; T; –; C; γ; X; C] | H | 19 | + | − |
| [G; T; T; –; C; γ; X; T] | H | 18 | + | + |
| same as above | H | 19 | + | + |
| [G; C; T; –; C; δ; X; C] | H | 18 | + | + |
| same as above | H | 21 | + | + |
| [G; C; T; –; C; δ; X; T] | H | 17 | + | + |
| same as above | H | 18 | + | + |
| same as above | H | 19 | + | + |
| same as above | H | 21 | + | + |
| [G; T; T; –; C; δ; X; T] | H | 17 | + | + |
| same as above | H | 18 | + | + |
| same as above | H | 21 | + | + |
| [G; C; T; –; T; δ; X; C] | H | 18 | + | − |
| [G; T; T; –; T; δ; X; C] | P | 16 | + | − |
| same as above | H | 18 | + | − |
| [G; C; T; –; T; δ; X; T] | H | 18 | + | − |
| c.[833C; 844_845ins68] haplotypes | ||||
| [G; C; C; ins; C; α; X; C] | P | 18 | − | + |
| [G; C; C; ins; C; β; X; T] | H | 17 | + | + |
| same as above | H | 18 | + | + |
| same as above | H | 21 | + | + |
| [A; C; C; ins; C; β; X; T] | P | 17 | + | − |
| [G; C; C; ins; T; β; X; T] | P | 17 | − | + |
| [G; T; C; ins; C; β; X; T] | H | 17 | − | + |
| [G; C; C; ins; C; δ; X; C] | P | 21 | − | + |
| [G; C; C; ins; C; δ; X; T] | H | 17 | − | + |
| same as above | H | 19 | − | 1 |
Haplotypes were determined either by homozygosity of at least 11 markers (detection method H) or by physical haplotyping (detection method P).
Haplotypes are described in relation to the reference CBS haplotype- NM_000071.1: c.[-3792G>A; 699C>T; 833T>C; 844_845ins68; 1080C>T; intron 12 cluster; c.1444_1467+7(16_21); *543T>C], for details see Patients and Methods. The intron 12 variants C.1358+161G>A, c.1358 + 264A>G, c.1359-219C>T, c.1359-134G>A and c.1359-30C>T were clustered as follows: α, [G; G; C; G; C]; β, [G; G; C; A;T]; γ [G; G; T; A; T] and δ, [G; A; C; G; C].
+, haplotype observed in the studied population sample; −, haplotype not detected in the studied population sample.
Population Frequency of Pathogenic CBS Chromosomes*
| Frequency calculated from number of diagnosed patients with homocystinuria | Frequency determined in healthy controls | ||
|---|---|---|---|
| Country | All pathogenic | c. [833C; –] chromosomes | c. [833C; –] chromosomes |
| Czech Republic | 1.1 (0.86–1.5) | 0.30 (0.23–0.40) | 2.0 (0.83; 4.6) |
| Denmark | 1.6 (1.2–2.1) | 0.81 (0.62–1.0) | 7.0 (3.4–14.6) |
| France | 0.52 (0.40–0.66) | 0.048 (0.038–0.062) | |
| Germany | 0.76 (0.66–0.88) | 0.21 (0.19–0.25) | 7.5 (2.6–22.0) |
| Hungary | 1.1 (0.83–1.5) | 0.046 (0.035–0.061) | |
| Ireland | 3.3 (2.8–3.8) | 0.12 (0.099–0.13) | |
| Italy | 0.58 (0.46–0.72) | 0.14 (0.11–0.17) | |
| Norway | 1.5 (1.1–2.0) | 0.15 (0.11–0.20) | 3.1 (1.5–6.4) |
| Netherlands | 1.5 (1.2–1.7) | 0.80 (0.68–0.94) | 2.0 (0.5–7.3) |
| Poland | 0.66 (0.52–0.84) | 0.020 (0.015–0.025) | |
| Slovakia | 1.2 (0.87–1.7) | 0.076 (0.054–0.11) | |
| Spain | 0.58 (0.45–0.76) | 0 (0.0–0.0) | |
| United Kingdom | 0.49 (0.38–0.63) | 0.14 (0.11–0.18) | |
| Europe | 0.82 (0.77–0.87) | 0.18 (0.17–0.19) | 3.3 (2.2–5.0) |
| Australia-New South Wales | 2.4 (2.1–2.9) | 0.31 (0.26–0.36) | 0 (0.0–4.0) |
| Japan | 0.58 (0.50–0.67) | 0.013 (0.012–0.016) | |
Frequencies are given as number per 1,000 chromosomes (i.e., x 10 −3).
Raw data for calculating allele frequencies are given in Supplementary Table S2; the calculations are described in detail in the Patients and Methods section.
Population frequencies of the c.[833C; –] chromosome were calculated from original data published for Czech [Sokolová et al., 2001], Danish [Gaustadnes et al., 1999], German [Linnebank et al., 2001a], Norwegian [Refsum et al., 2004], and Australian newborns (personal communication by B. Wilcken; n = 1,160 consecutive newborns), and for Dutch healthy controls [Griffioen et al., 2005]. The 95% confidence intervals are given in parentheses.
FIGURE 2Distribution of c.[833C; –] haplotypes in Europe.Three different pathogenic haplotypes ([G; C; C; –; C; δ; 21;T], [G; C; C; –;T; δ; 18; C], and [G; T; C; –; C; γ; 19; C]) were observed among European homocystinuria patients. Haplotypes are described in relation to the reference CBS haplotype, for details see Patients and Methods. The number of chromosomes is proportional to the size of the column in each country, normalized by the black square representing one allele; it may also be found in Supplementary Table S3. [Color figure can be viewed in the online issue, which is available at www.interscience.wiley.com.]
FIGURE 3Gene conversion as a possible mechanism for recurrent mutation. A hypothetical mechanism by which the c. [833C; 844_845ins68] chromosomes could serve as template for the generation of the pathogenic c. [833C; –] chromosomes. The upper two strands represent the wild-type c.[833T; –] chromosomes and the two bottom strands represent a variant wild-type c. [833C; 844_845ins68] chromosome. Intron 7 is shown as a black line, exon 8 is a gray line, and the insertion of 68 bp is shown in green. The pathogenic mutation c.833C is shown as a red square. In step A, a double strand break occurs at the DNA polymerase α/β frameshift hotspot followed by 5′→3′ exonuclease activity on the c.[833T; –] chromosome in step B. Subsequent strand invasion occurs in step C and results in the formation of a Holliday junction in step D. In step E, strandrepair leads to the occurrence of a loop that is subsequently cleaved, generating the neutral c. [833C; 844_845ins68] and pathogenic c.[833C; –] chromosomes during gametogenesis (step F).