Literature DB >> 18996919

Mutation and polymorphism spectrum in osteogenesis imperfecta type II: implications for genotype-phenotype relationships.

Dale L Bodian1, Ting-Fung Chan, Annie Poon, Ulrike Schwarze, Kathleen Yang, Peter H Byers, Pui-Yan Kwok, Teri E Klein.   

Abstract

Osteogenesis imperfecta (OI), also known as brittle bone disease, is a clinically and genetically heterogeneous disorder primarily characterized by susceptibility to fracture. Although OI generally results from mutations in the type I collagen genes, COL1A1 and COL1A2, the relationship between genotype and phenotype is not yet well understood. To provide additional data for genotype-phenotype analyses and to determine the proportion of mutations in the type I collagen genes among subjects with lethal forms of OI, we sequenced the coding and exon-flanking regions of COL1A1 and COL1A2 in a cohort of 63 subjects with OI type II, the perinatal lethal form of the disease. We identified 61 distinct heterozygous mutations in type I collagen, including five non-synonymous rare variants of unknown significance, of which 43 had not been seen previously. In addition, we found 60 SNPs in COL1A1, of which 17 were not reported previously, and 82 in COL1A2, of which 18 are novel. In three samples without collagen mutations, we found inactivating mutations in CRTAP and LEPRE1, suggesting a frequency of these recessive mutations of approximately 5% in OI type II. A computational model that predicts the outcome of substitutions for glycine within the triple helical domain of collagen alpha1(I) chains predicted lethality with approximately 90% accuracy. The results contribute to the understanding of the etiology of OI by providing data to evaluate and refine current models relating genotype to phenotype and by providing an unbiased indication of the relative frequency of mutations in OI-associated genes.

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 18996919      PMCID: PMC2638801          DOI: 10.1093/hmg/ddn374

Source DB:  PubMed          Journal:  Hum Mol Genet        ISSN: 0964-6906            Impact factor:   6.150


INTRODUCTION

Osteogenesis imperfecta (OI), also known as brittle bone disease, is a heterogeneous disorder characterized by susceptibility to fracture. The disease varies in severity from mild (OI type I) to perinatal lethal (OI type II) and exhibits both autosomal dominant and recessive inheritance patterns. The dominantly inherited forms that account for ∼90% of infants and adults with OI result from heterozygous mutations in COL1A1 or COL1A2, the genes that encode the proαl(I) and proα2(I) chains of type I collagen, the major structural protein of bone. Recessively inherited OI results in many cases from homozygous or compound heterozygous mutations in CRTAP and LEPRE1 (1–3), which encode cartilage-associated protein and leucine proline-enriched proteoglycan (leprecan; prolyl-3-hydroxylase-1), respectively. These two proteins are part of a complex that hydroxylates a single proline residue at position 986 of the triple helical domain of the proα1(I) chain of collagen during its biosynthesis. Mutations in each of the four genes, COL1A1, COL1A2, CRTAP and LEPRE1, lead to the production of type I collagen molecules that undergo excessive post-translational modification, at least in cultured dermal fibroblasts (1,2,4). The relationship between the clinical severity of OI and the causative mutations is only partially understood, despite the availability of mutation sequences from >800 individuals with OI (5). The mildest forms of OI generally result from heterozygous mutations that lead to the loss of mRNA from one COL1A1 allele due to the presence of premature termination codons in the coding sequence of that allele (6–8). Different classes of mutations (single codon nonsense mutations, frameshift mutations and splice site mutations that result in use of out-of-frame cryptic splice sites) can all lead to activation of the nonsense-mediated mRNA decay pathway (6). In the more severe forms of OI in which there is often bone deformity, increased fracture rate, dentinogenesis imperfecta and short stature, the most frequent mutations result in substitution of the glycine residue in almost any of the 338 Gly-X-Y tripeptide motifs found in the triple helical region of each of the proα chains. For these mutations, it appears that the chain in which the mutation occurs, the position of the substituted glycine and the substituting residue all contribute to phenotypic outcome (9). Yet the nature of these relationships is not yet evident. This may reflect the possibility that different mutations result in lethality by distinct pathways (10–12). Given the diversity of mechanisms underlying the lethal forms of OI, understanding the relationship between genotype and phenotype requires mutation detection and clinical data from a larger number of individuals with OI type II than currently available. To initiate this process, we identified the disease-associated mutations in 62 subjects with OI type II. In 59 cases we found heterozygous mutations in the type I collagen genes, of which 23 of the distinct causative mutations in COL1A1 and 15 in COL1A2 are novel. In three samples without collagen mutations, we found homozygous inactivating mutations in CRTAP (one) and LEPRE1 (two individuals homozygous for the same West African allele). Since the subjects were selected based only on phenotype and the production of abnormal type I procollagen molecules, this provides an unbiased indication of the frequency of mutations in these genes in OI type II. The novel collagen mutations provide a valuable addition to the set of known mutations available for studying genotype–phenotype relationships.

RESULTS

COL1A1 and COL1A2 SNPs

A total of 203 distinct genomic sequence variations were identified in the type I collagen genes among the 63 subjects with OI type II, 98 in COL1A1 and 105 in COL1A2 (Table 1). This includes 142 SNPs (Supplementary Material, Tables S1 and S2) and 61 mutations and non-synonymous sequence variants of unknown significance (Table 2). Of the 142 SNPs, 35 are novel: 17 in COL1A1 and 18 in COL1A2. Of the 76 common SNPs with minor allele frequency (MAF) >0.05 listed in dbSNP which are located within the genomic regions of the resequencing effort (25 in COL1A1 and 51 in COL1A2), all but three (rs41317349 and rs2586494 in COL1A1 and rs17073 in COL1A2) are seen in our sample set. The SNPs rs41317349 and rs2586494 are particularly common in the African-American and Asian populations, respectively (13), so their absence may reflect the ethnicities in our OI patient cohort that is largely Caucasian. Approximately 90% of the identified SNPs in each gene are intronic (53 of 60 in COL1A1 and 73 of 82 in COL1A2), and only one non-synonymous SNP was recognized in each gene, rs1800215 (p.Ala1075Thr) in COL1A1 and rs42524 (p.Ala549Pro) in COL1A2. The rs42524 SNP is present at frequencies of 17.7, 26.0, 9.38 and 30.2% in African-American, Hispanic, Chinese and Caucasian populations of healthy individuals, respectively (13). In the same study, rs1800215 was found at a frequency of 6.25% in African-American population (negligible in other ethnic groups).
Table 1.

Distribution of distinct genomic sequence variations in type I collagen genes

COL1A1COL1A2
Previously reportedNovelPreviously reportedNovel
MutationsCodinga1121712
Splicing0604
SNPsSynonymous4253
Non-synonymous1010
Intronic38155815

aIncludes causative mutations and sequence variants of unknown significance.

Table 2.

COL1A1 and COL1A2 mutations

SubjectCOL1A1
COL1A2
Referencesa
cDNAProteinTriple helixExoncDNAProteinTriple helixExon
Missense mutations
F1c.1058G>Ap.Gly353Asp17517
E6c.1103G>Tp.Gly368Val19017
E1c.1273G>Ap.Gly425Ser24719(5,28)b
H2c.1364G>Ap.Gly455Asp27721c.700C>Tp.Arg234Cys14415
F5c.1409G>Tp.Gly470Val29221
D8c.1526G>Tp.Gly509Val33123
G4c.1643G>Cp.Gly548Ala37024
A1c.1804G>Ap.Gly602Arg42426(29)
G1c.1804G>Ap.Gly602Arg42426(29)
D7c.1814G>Ap.Gly605Asp42726
G7c.1840G>Cp.Gly614Arg43627
A7c.2218G>Cp.Gly740Arg56232
c.1168G>Ap.Ala390Thr21218
E4c.2425G>Ap.Gly809Ser63136(5,30)
E8c.2470G>Cp.Gly824Arg64637
H4c.2533G>Cp.Gly845Arg66737(5,31)
H5c.2542G>Cp.Gly848Arg67037
B1c.2596G>Ap.Gly866Ser68838(5,32,33)b
B7c.2623G>Ap.Gly875Ser69739(5)
c.863A>Tp.Glu288Ala11013
E5c.2650G>Ap.Gly884Ser70639
G2c.2650G>Ap.Gly884Ser70639
E3c.2687G>Ap.Gly896Asp71840
C5c.2839G>Tp.Gly947Cys76941(5)
c.2563A>Cp.Asn855His67738
G3c.2930G>Ap.Gly977Asp79941
B3c.3001G>Tp.Gly1001Cys82342
B2c.3065G>Tp.Gly1022Val84443(5,34)
F3c.3065G>Tp.Gly1022Val84443(5,34)
B5c.3164G>Ap.Gly1055Asp87744
G8c.3280G>Ap.Gly1094Ser91646
C6c.3299G>Ap.Gly1100Asp92246(5)
c.436C>Ap.Pro146Thrna5
D6c.4237G>Ap.Asp1413Asnna51(17)
F2c.847G>Cp.Gly283Arg19317
F8c.1190G>Ap.Gly397Glu30721
D4c.1360G>Tp.Gly454Cys36424(5)
A3c.1369_1370GG>CTp.Gly457Leu36724
F7c.1577G>Ap.Gly526Glu43627(5,35)c
E7c.1685G>Tp.Gly562Val47229
A6c.2215G>Cp.Gly739Arg64937(5)c
F6c.2243G>Tp.Gly748Val65837
H3c.2369G>Ap.Gly790Asp70039(36)
B8c.2567G>Tp.Gly856Val76641
E2c.2845G>Ap.Gly949Ser85944(5,37,38)b
H7c.2864G>Ap.Gly955Asp86544
A4c.3080G>Ap.Gly1027Glu93746(5)
Insertions and deletions
H6c.3148_3156dupGCTCCTGGTp.1050_1052dupAlaProGly87444
C7c.1380_1397delCCCCp.461_466delProAlaG371_37624
GCTGGAAAAGAAGGlyLysGluGly
G5c.2113_2121delp.705_707delAlaGlyPro615_61735(16)
GCTGGTCCT
F4c.2391_2393dupCCCp.Pro798dup70839
C8c.2415_2432delCCCTp.806_811delProProG
CCTGGTCCCCCTGGlyProProGly716_72140
H1c.3171_3188delTCCTp.1058_1062delProSe
TCTGGCCCTGCTGGrGlyProAlaGly968_97248
Splicing
C4c.957+5G>AIVS14+5G>ASkip exon 14
A5c.2509_2559+9delSkip exon 37
B6c.3261+1G>AIVS45+1G>ASkip exon 45
H8c.3423+2T>AIVS47+2T>ASkip exon 47
A2c.3424-1G>CIVS47-1G>CSkip exon 48
C2Genomic deletion of exons 40–48
B4c.1701_1719+6delSkip exon 29
G6c.1720-2A>GIVS29-2A>GUnknown–?skip exon 30
C1c.2673+1G>AIVS41+1G>ASkip exon 41
A8Not resolvedModerate level of skip exon 46

aReferences are listed for previously reported mutations; this field is left blank for novel mutations.

bReported in both lethal and non-lethal OI cases.

cReported as OI type III.

Distribution of distinct genomic sequence variations in type I collagen genes aIncludes causative mutations and sequence variants of unknown significance. COL1A1 and COL1A2 mutations aReferences are listed for previously reported mutations; this field is left blank for novel mutations. bReported in both lethal and non-lethal OI cases. cReported as OI type III. Three intronic SNPs in COL1A2 with MAF ∼0.05 or greater among the OI individuals were not seen previously in our healthy cohort (13) (Table 3). One, with an MAF of 4.1%, is a novel SNP, whereas the other two were reported previously. rs34026686 has only one submitter listed in dbSNP and no allele frequency information, suggesting that it could be a rare allele. However, in our OI sample set it has an MAF of 14.4%. rs10228528 has an MAF of 4.8% among the OI subjects and is listed with an MAF of only 0.8% among the Caucasians (CEU), although it is known to be more common in Han Chinese (CHB) and Yoruba African (YRI) populations (14).
Table 3.

Common SNPs in COL1A2

Genomic positionrefSNP cluster IDaMajor alleleMinor alleleMAF in OI subjectsMAF in CEUaMAF in CHBaMAF in YRIa
93863673NovelAG0.041No dataNo dataNo data
93878223rs34026686GA0.144No dataNo dataNo data
93883010rs10228528GA0.0480.0080.310.25

aAccording to dbSNP build 129.

Common SNPs in COL1A2 aAccording to dbSNP build 129.

COL1A1 and COL1A2 mutations

We identified mutations in type I collagen genes in 59 of the 63 samples (Table 2). In one sample (C2) in which protein studies indicated a rearrangement in COL1A1, no mutation was detected in genomic DNA but cDNA analysis revealed a nine-exon deletion (exons 40–48). Of the 59 samples in which collagen mutations were identified, 37 had mutations in COL1A1 and 22 in COL1A2. Among the 34 distinct COL1A1 causative mutations, 26 resulted in substitution for a glycine within the Gly-X-Y triplet domain of the triple helix, four altered splice sites and resulted in exon skipping, one resulted in exon skipping from a deletion spanning coding and intronic sequence, one was a nine-exon deletion from genomic DNA, one was a duplication of nine nucleotides and the last (p.Asp1413Asn) altered a single residue in the carboxyl-terminal propeptide. Our unpublished studies suggest that this last mutation interferes with chain association. Among the 22 causative mutations we identified in COL1A2, 13 resulted in substitution for triple helix glycine residues, three were deletions of different 18 bp fragments, two were splice site mutations that resulted in exon skipping, one was a 3 bp duplication and one a 9 bp deletion. Two additional mutations resulted in exon skipping: one deletion spanning coding and intronic sequence and the other one not yet resolved. In sample H2, we identified a mutation in COL1A1 that is most likely responsible for the OI type II phenotype c.1364G>A, p.Gly455Asp, Gly277Asp in the triple helix and, in addition, a second mutation, c.700C>T, p.Arg234Cys, Arg144Cys in the triple helical domain of COL1A2. Substitutions of arginine by cysteine in the triple helical domain of proα1(I) chains has been associated with classical forms of Ehlers–Danlos syndrome (15). We have not been able to study parental samples to determine if there is a phenotype associated with the COL1A2 sequence alteration. The 38 novel, causative mutations represent a 24% addition to the set of 159 distinct mutations in COL1A1 and COL1A2 associated with lethal forms of OI published by the OI consortium (99 and 44 unique substitutions for glycine in αl(I) and α2(I), respectively, and seven and nine unique splicing mutations in COL1A1 and COL1A2, respectively) (5). As seen previously, the most common mutations are missense mutations that result in substitution for triple helical glycine residues and represent 80 and 60% of the observed mutations in COL1A1 and COL1A2, respectively, in this set. Twice as many were observed in COL1A1 as in COL1A2 (26 versus 13), consistent with the set of mutations reported by the OI consortium, in which 67% of the lethal glycine missense mutations occurred in COL1A1 (5). We also identified mutations affecting splice sites, insertions and deletions in the triple helical region of multiples of three amino acids and mutations in repetitive regions, all of which have been discussed previously regarding OI (5,16). Sixteen of the 18 mutations previously seen were reported in at least one case of lethal OI. Although c.3423+2T>A (IVS47+2T>A) in COL1A1 has not been described previously, mutation of the adjacent G of the obligate GT at the donor site (c.3423+1G>A, IVS47+1G>A) was found in one patient with lethal OI (5). The data set also includes some unique features. The COL1A2 mutation c.1369_1370GG>CT, p.Gly457Leu is the first observed leucine for glycine substitution in a type I collagen gene and the only known example of a dinucleotide mutation in a glycine codon in this gene. The c.700C>T, p.Arg234Cys mutation is the first mutation in which arginine is replaced by cysteine in the triple helical domain of proα2(I). Four additional individuals with substitutions for glycine in proα1(I) chains also had a second amino substitution that affected a non-glycine residue in proα1(I) chains, three in the triple helical region (p.Glu288Ala, Glu110Ala in the triple helix, p.Ala390Thr, Ala212Thr in the triple helix and p.Asn855His, Asn677His in the triple helix) and one in the N-terminal propeptide (p.Pro146Thr). None of these variations have been reported previously although the p.Ala390Thr was seen in one other individual in the diagnostic setting (unpublished data). The contribution of the non-glycine substitution to the phenotype is not known; however, p.Glu288Lys (c.862G>A) was observed in a patient with OI type I who also had a p.Asp1219Glu in the C-propeptide cleavage site, as well as in the patient's unaffected parent (17), suggesting that substitution of the glutamic acid, at least with lysine, does not cause lethal disease. It is not known whether the pairs of mutations are co-allelic.

Phenotype prediction for glycine substitutions in proα1(I)

We are developing a computational method for predicting OI phenotype of collagen mutations. The mutations reported here provide an important opportunity to evaluate the current implementation of the model, which was trained on the OI consortium mutations (5), on new data. Of the 29 patients with mutations resulting in substitution of a triple helical glycine in proα1(I) chains, the clinical outcome in 26 (90%) is predicted correctly (Table 4). Of the 17 unique, novel mutations not included in the training set, 15 (88%) were classified correctly. The model can also be applied to serine and cysteine substitutions for glycine in the triple helical region of proα2(I) but no novel mutations of this type were identified. The three COL1A1 mutations with incorrect predictions are (i) p.Gly353Asp, Gly175Asp in the triple helix, which lies one glycine N-terminal to the model's cutoff between lethal and non-lethal, (ii) a rare lethal alanine substitution and (iii) p.Gly866Ser, Gly688Ser in the triple helix, which is non-lethal in five of six previously observed cases.
Table 4.

Prediction of lethality for COL1A1 glycine missense mutations

SubjectMutationTriple helix positionLethality in training setaPredictionCorrect?
F1p.Gly353Asp175naNon-lethalNo
E6p.Gly368Val190naLethalYes
E1p.Gly425Ser247LethalLethalYes
H2p.Gly455Asp277naLethalYes
F5p.Gly470Val292naLethalYes
D8p.Gly509Val331naLethalYes
G4p.Gly548Ala370naNon-lethalNo
A1p.Gly602Arg424naLethalYes
G1p.Gly602Arg424naLethalYes
D7p.Gly605Asp427naLethalYes
G7p.Gly614Arg436naLethalYes
A7p.Gly740Arg562naLethalYes
E4p.Gly809Ser631LethalLethalYes
E8p.Gly824Arg646naLethalYes
H4p.Gly845Arg667LethalLethalYes
H5p.Gly848Arg670naLethalYes
B1p.Gly866Ser688Non-lethalNon-lethalNo
B7p.Gly875Ser697LethalLethalYes
E5p.Gly884Ser706naLethalYes
G2p.Gly884Ser706naLethalYes
E3p.Gly896Asp718naLethalYes
C5p.Gly947Cys769LethalLethalYes
G3p.Gly977Asp799naLethalYes
B3p.Gly1001Cys823LethalLethalYes
B2p.Gly1022Val844LethalLethalYes
F3p.Gly1022Val844LethalLethalYes
B5p.Gly1055Asp877naLethalYes
G8p.Gly1094Ser916naLethalYes
C6p.Gly1100Asp922LethalLethalYes

ana, not applicable.

Prediction of lethality for COL1A1 glycine missense mutations ana, not applicable.

LEPRE1 and CRTAP mutations

We found no mutations in type I collagen genes in four samples and in three of those we identified mutations in genes in which mutations were recently recognized to give rise to recessive forms of severe/lethal OI. In two of those samples, D1 and D3, there is homozygosity for a recently recognized splice site mutation in the LEPRE1 gene (c.1080+1G>T, IVS5+1G>T) that is of West African origin and is a cause of lethal OI in the African-American community in the USA (1). The ethnic origin of the two families was not stated. In a third infant, C3, there was homozygosity for a splice site mutation in the CRTAP gene (c.471+2C>A, IVS1+2C>A). No consanguinity was reported in the parents. Based on the small number of patients in our set with mutations in the LEPRE1 and CRTAP genes, an estimate of the proportion of OI type II patients with recessive mutations in these two genes is 4.8% (90% confidence interval 1.5–12.4%). This extends an earlier estimate that CRTAP mutations cause ∼2–3% of cases of lethal OI (2). Because of the measured frequency of LEPRE1 heterozygosity in the African-Americans in the USA (18,19), these mutations may account for a significant proportion of the severe forms of OI in the USA among that group.

DISCUSSION

We have identified mutations in samples from 62 subjects with OI type II in whose DNA we sequenced the complete coding regions and flanking intronic regions of the two type I collagen genes, COL1A1 and COL1A2. In these samples, 37 had causative mutations in COL1A1 and 22 in COL1A2, of which 38 distinct mutations had not been reported previously, a 24% addition to the published set of unique missense and splicing mutations in these two genes associated with a lethal phenotype (5). Among the four individuals in whom we did not identify mutations in type I collagen, we found one who was homozygous for a CRTAP inactivating mutation and two who were homozygous for a known LEPRE1 mutation common in the African-American population derived from West Africa. One sample remains incompletely characterized but these cells make abnormal type I collagen molecules. Since the samples were selected by two criteria, a clinical picture consistent with a lethal OI phenotype (although some were identified by ultrasound in the second trimester) and cells that made overmodified type I collagens, these results provide an estimate of the relative frequencies of the spectrum of mutations associated with lethal OI. The data suggest that there are multiple pathways that lead to lethality. We identified mutations in OI type II subjects likely to cause impaired post-translational modification of procollagen either because of (i) altered sequences in the triple helical domain of the proα chains of type I collagen, (ii) impaired chain association as a result of mutations in the carboxyl-terminal propeptide or (iii) alterations in helix propagation that result from failure to bring the prolyl cis–trans isomerase to the molecule (e.g. CRTAP and LEPRE1 mutations) (1–3,18,20). Mutations that alter the sequences of the triple helical domains of the chains of type I procollagen can affect the thermal stability of the molecules (12,21), interfere with secretion efficiency (22), disrupt the chain register (23) or interfere with ligand binding or other interaction sites (5,24). Models relating genotype to phenotype must capture the diversity of these mechanisms. We allow for this complexity by using a composite predictive model (12). The high accuracy in predicting the lethality of the newly sequenced mutations is consistent with the hypothesis that substitutions of glycine by serine and mutations that substitute bulky amino acids for glycine carboxy-terminal to residue 178 in the triple helical region of proα1(I) differ in the manner or extent to which they disrupt collagen structure or function. Triple helix position 178 was modeled as the end of the predominantly non-lethal N-terminal region based on available data. The newly identified lethal mutation Gly175Asp suggests that position 175 may be a better cutoff. Because the current data set is limited to OI type II mutations, a more complete evaluation of the model requires non-lethal cases. Additional mutations are needed to refine other imprecise aspects of the model including whether the cutoff position depends on the residue replacing glycine and the determinant of lethality of substitutions of glycine by cysteine in the proα1(I) chains. The use of the composite model framework will allow incorporation of other features that may contribute to lethality, such as individual variation and sites of functional importance, when sufficient data are available. Exceptions to this model may have implications for our understanding of collagen biology. For example, the model misclassifies Gly370Ala in the triple helix of proα1(I) as non-lethal. This suggests that a region around residue 370 may have important function and that its disruption can be lethal. Indeed, substitution of the glycine at 376 by alanine is lethal (5). This region overlaps a binding site for a small proteoglycan (decorin) on collagen molecules (25,26), but it seems unlikely that disturbance of binding of decorin, by itself, is lethal as a decorin knockout mouse does not have OI, despite dramatic alterations in fibrillogenesis (27). It was proposed recently that some regions of the triple helical domain of type I collagen molecules may lack mutations or have exclusively lethal mutations because disruption interferes with a critical function (although these functions have not been clearly identified) (5). Four of the novel mutations in COL1A1, which result in the substitutions Gly706Ser, Gly718Asp, Gly799Asp and Gly916Ser in the triple helical domain, lie within ‘lethal-only’ regions that overlap the proposed major ligand binding regions MLBR2 and MLBR3 (5,26). Interestingly, Gly688Ser, present in both lethal and non-lethal cases, lies near the edge of MLBR2. In COL1A2, three novel mutations were identified, which fall into lethal-only regions 2 (Gly472Val), 6 (Gly766Val) and 7 (Gly865Asp). In contrast, mutations in COL1A1 that alter residues in triple helix regions 328–346 and 418–436, proposed to be of critical importance due to a lack of observed mutations (5), contain four mutations in our data set (Gly331Val, Gly424Arg, Gly427Arg and Gly436Arg), suggesting that the previous absence of mutations was due to sampling. It is clear that to this point sampling of the almost 2000 possible substitutions of triple helical glycines by other residues in each chain is sufficiently incomplete that it is difficult to draw broad conclusions about the function of particular regions of the triple helical domain based on the absence of mutations. Of the substitutions for glycine in the triple helical domain encoded by COL1A1 identified here, 16 of the 26 distinct mutations are novel. According to published models (5,9,12), almost all substitutions of glycine by valine, aspartic acid, glutamic acid, arginine and tryptophan between residues 178 and 1012 of the triple helix in proα1(I) chains should be lethal. Using codon substitution tables, this suggests that a minimum of 900 of the possible non-synonymous mutations in this region will have a lethal effect. Only 100 distinct lethal glycine substitutions in this region were reported previously, including 49 replacements by alanine, serine and cysteine (5). With the relatively low coverage of these mutations to date, most newly studied infants with lethal OI will have previously unrecognized mutations. Mutations in COL1A2 are less likely to be lethal, and it has been more difficult to locate the domains in which they occur. Of the mutations identified here, 13 of the COL1A2 mutations resulted in substitutions for glycine. Of these, six have been previously encountered in infants with lethal OI but the remaining seven were seen for the first time and six of them fell in domains of the triple helix not previously thought to carry lethal mutations. This suggests that there are still too few lethal mutations, or mutations overall, in the COL1A2 gene to accurately predict the outcome on the basis of any proposed model. Nearly 10% of the subjects in this study (five of 59) with a type I collagen mutation have a second missense alteration, including one individual with a mutation in both COL1A1 and COL1A2. Although the glycine substitution itself is probably sufficient for lethality in these cases, the additional variations may influence the phenotype resulting from less disruptive mutations. Complete sequencing of both genes and reporting of all sequence variations found will be important for assessing the role of multiple mutations in the development of disease. These studies indicate that recurrence in families in which a child with lethal OI is born is increased as a result of parental carrier status for recessive mutations in addition to the well-known risk of parental mosaicism for dominant mutations. For mutations in the COL1A1 gene, it is clear that substitution of glycine residues by large amino acids carboxyl-terminal to position 178 in the triple helical domain of the proα1(I) chains carries a high risk of lethal outcome, something that can be transferred to clinical utility. It is clear that many effects of mutations on behavior of collagen molecules need to be taken into account when determining how they result in lethal mutations. Even in the absence of well-defined pathways and mechanisms, the accumulation of mutations with associated clinical data is an important part of providing the basis for rational counseling of families and represents the first step in forming a truly informative model for how mutations result in phenotypes.

MATERIALS AND METHODS

Subject population

We selected 64 DNA samples from unrelated individuals from the Connective Tissue Biopsy Program Repository, an IRB approved activity at the University of Washington that allows further analysis of stored samples. The referral diagnosis for all selected samples was a lethal form of OI. For all samples we had screened cultured fibroblasts for the production of and structure of type I procollagen (22) and cells from all made abnormal proteins. One sample had DNA that failed to amplify (D5). Of the remaining 63, 27 were from fetal samples of <24 weeks gestation. In all instances, the ultrasound picture or post-termination or post-spontaneous delivery radiographs were consistent with the diagnosis of OI type II–minimal calvarial mineralization, very short and bowed limbs, multiple rib fractures. Among the remaining 36 samples, there was no clinical information for one, one that left the hospital at 15 days and for whom there was no follow-up information, and for the remainder there were radiographs or clinical photographs consistent with the diagnosis of a lethal form of OI, with the usual diagnosis of OI type II. Patient descriptions are summarized in Supplementary Material, Table S3. Data on ethnic background were absent for virtually all samples. For two of the samples (C3 and D1) there was a record of familial recurrence of lethal OI. For two others (A3, C2), it was noted that the father had a milder form of OI, OI type III or III/IV. No consanguinity was noted in either family. No prior history of OI was noted in any of the remaining instances.

Collagen gene sequencing and variant detection

Genomic DNA was isolated from peripheral blood using standard extraction protocols. For each patient, a total of 14 and 23 kb of genomic DNA for COL1A1 and COL1A2, respectively, were sequenced including all coding regions, intron sequences for at least 100 bp on either side of each exon and evolutionarily conserved domains in non-coding regions (>75% sequence identity between mouse and human). Primer sequences, PCR conditions and sequence coordinates of each fragment are published elsewhere (13). All primers were synthesized by Integrated DNA Technologies (Coralville, IA, USA). Sequence traces were aligned with the GenBank reference sequences of the COL1A1 genomic DNA (AF017178) and cDNA (NM_000088.3), and the COL1A2 genomic DNA (AF004877.1) and cDNA (NM_000089.3). Sequence variations were identified and confirmed by detection with two different analysis programs: Sequencher v4.8 (GeneCodes Corp., Ann Arbor, MI, USA) and Mutation Surveyor v3.1 (SoftGenetics LLC., State College, PA, USA). Automatic calls were verified by inspection. All newly identified variants were deposited in dbSNP (14) under the submitter handle UCSF_HG. All mutations identified were confirmed by resequencing of genomic DNA taken from reserved samples. For those samples in which mutations were not detected in the initial phases of analysis, collagen cDNA was prepared, amplified in overlapping fragments of about 1–1.5 kb and examined by PAGE. For sample C2, the cDNA was amplified in two fragments to identify the extent of the genomic deletion. If no mutation was identified in cDNA, genomic DNA of CRTAP and LEPRE1 was amplified as described previously (18) and sequenced. Collagen mutations are numbered following the HGVS-approved convention (http://www.hgvs.org/mutnomen/recs.html), which starts with the translation initiator methionine as amino acid +1, and the A of the ATG codon as nucleotide +1. Mutations affecting intronic sequence are referenced to the cDNA sequence. Triple helix positions are provided for amino acids within the triple helical region of each proα chain (residues encoded by codons 179–1192 of the COL1A1 transcript and 91–1104 of COL1A2). The exons of the COL1A1 gene are numbered consecutively from 1–32. Exon 33 of the gene is referred to as exon 33–34 by tradition, to allow the codons of the helical domain to be similar between the two genes and exons thereafter continue from 35 to 52. Exons of COL1A2 are numbered consecutively. Each identified variation was uniquely classified as either a SNP, causative mutation, or non-synonymous sequence variant of unknown significance. All substitutions for glycine within the triple helical domain, mutations that alter consensus splice sites, insertion or deletion of residues within the triple helical domain, and mutations that result in sequence alterations in the carboxyl-terminal propeptides and that had been previously identified in individuals with OI or shown to alter molecular behavior were considered causative mutations. p.Ala390Thr, p.Glu288Ala, p.Asn855His and p.Pro146Thr in the proα1(I) chain are categorized as rare non-synonymous sequence variants of unknown significance. Variations classified as SNPs are detailed in Supplementary Material, Tables S1 and S2. SNPs include synonymous variations and the non-synonymous rs1800215 (p.Ala1075Thr) in COL1A1 and rs42524 (p.Ala549Pro) in COL1A2. SNPs are considered novel if they are not listed in build 129 of dbSNP (14); novel mutations are those not reported in collagen mutation databases (28, Bodian and Klein, manuscript in preparation).

CRTAP and LEPRE1 mutation detection

Each exon of the CRTAP and LEPRE1 genes was amplified and then sequenced as previously described (3,18). All mutations were confirmed by reamplification and sequencing of a new genomic DNA sample.

Lethality prediction

Lethality was predicted using a published method applicable to single missense mutations altering the Gly-X-Y glycines in the triple helical region of the proα1(I) chain (12). One component of the model is a decision tree that classifies mutations within the N-terminal 178 residues of the triple helix as non-lethal. Mutations C-terminal to position 178 are classified by the amino acid substituting for glycine. Arginine, valine, aspartic acid and glutamic acid substitutions are predicted to be lethal, whereas alanine replacements are predicted to be non-lethal. Cysteine substitutions are tentatively classified as lethal C-terminal to triple helix position 688. Serine substitutions for glycine are classified by a second component based on an estimate of the thermostability of the Gly-X-Y triplet C-terminal to the triplet with the substitution. The models were constructed using the OI triple helical glycine substitutions mutations published by the OI consortium (5) as a training set. Predictions for p.Gly884Ser (Gly706Ser) and p.Gly1094Ser (Gly916Ser) based on preliminary sequencing were published previously (12). For subjects with multiple variations, the triple helical glycine substitutions are assumed to be the primary determinant of OI lethality.

Statistical analysis

Confidence intervals were computed using exact distributions as implemented in R (29).

SUPPLEMENTARY MATERIAL

Supplementary Material is available at . Conflict of Interest statement. None declared.

FUNDING

This work was supported by the National Institutes of Health (AR051582 to T.E.K and T32HL007731 to T.F.C); and the Osteogenesis Imperfecta Foundation and the Children's Brittle Bone Foundation (to D.L.B.). Funding to Pay the Open Access Charge was provided by the National Institutes of Health AR051582.
  36 in total

1.  dbSNP: the NCBI database of genetic variation.

Authors:  S T Sherry; M H Ward; M Kholodov; J Baker; L Phan; E M Smigielski; K Sirotkin
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

2.  Deletions and duplications of Gly-Xaa-Yaa triplet repeats in the triple helical domains of type I collagen chains disrupt helix formation and result in several types of osteogenesis imperfecta.

Authors:  J M Pace; M Atkinson; M C Willing; G Wallis; P H Byers
Journal:  Hum Mutat       Date:  2001-10       Impact factor: 4.878

3.  Altered triple helical structure of type I procollagen in lethal perinatal osteogenesis imperfecta.

Authors:  J Bonadio; K A Holbrook; R E Gelinas; J Jacob; P H Byers
Journal:  J Biol Chem       Date:  1985-02-10       Impact factor: 5.157

4.  Thirty-three novel COL1A1 and COL1A2 mutations in patients with osteogenesis imperfecta types I-IV.

Authors:  L M Ward; L Lalic; P J Roughley; F H Glorieux
Journal:  Hum Mutat       Date:  2001-05       Impact factor: 4.878

5.  Nonsense mutations in the COL1A1 gene preferentially reduce nuclear levels of mRNA but not hnRNA in osteogenesis imperfecta type I cell strains.

Authors:  R L Slayton; S P Deschenes; M C Willing
Journal:  Matrix Biol       Date:  2000-02       Impact factor: 11.583

6.  Destabilization of osteogenesis imperfecta collagen-like model peptides correlates with the identity of the residue replacing glycine.

Authors:  K Beck; V C Chan; N Shenoy; A Kirkpatrick; J A Ramshaw; B Brodsky
Journal:  Proc Natl Acad Sci U S A       Date:  2000-04-11       Impact factor: 11.205

7.  Mapping the ligand-binding sites and disease-associated mutations on the most abundant protein in the human, type I collagen.

Authors:  Gloria A Di Lullo; Shawn M Sweeney; Jarmo Korkko; Leena Ala-Kokko; James D San Antonio
Journal:  J Biol Chem       Date:  2001-11-09       Impact factor: 5.157

8.  CRTAP and LEPRE1 mutations in recessive osteogenesis imperfecta.

Authors:  Dustin Baldridge; Ulrike Schwarze; Roy Morello; Jennifer Lennington; Terry K Bertin; James M Pace; Melanie G Pepin; Maryann Weis; David R Eyre; Jennifer Walsh; Deborah Lambert; Andrew Green; Haynes Robinson; Melonie Michelson; Gunnar Houge; Carl Lindman; Judith Martin; Jewell Ward; Emmanuelle Lemyre; John J Mitchell; Deborah Krakow; David L Rimoin; Daniel H Cohn; Peter H Byers; Brendan Lee
Journal:  Hum Mutat       Date:  2008-12       Impact factor: 4.878

9.  Heterozygosity for a large deletion in the alpha 2(I) collagen gene has a dramatic effect on type I collagen secretion and produces perinatal lethal osteogenesis imperfecta.

Authors:  M C Willing; D H Cohn; B Starman; K A Holbrook; C R Greenberg; P H Byers
Journal:  J Biol Chem       Date:  1988-06-15       Impact factor: 5.157

10.  Subtle structural alterations in the chains of type I procollagen produce osteogenesis imperfecta type II.

Authors:  J Bonadio; P H Byers
Journal:  Nature       Date:  1985 Jul 25-31       Impact factor: 49.962

View more
  38 in total

1.  Deficiency of CRTAP in non-lethal recessive osteogenesis imperfecta reduces collagen deposition into matrix.

Authors:  M Valli; A M Barnes; A Gallanti; W A Cabral; S Viglio; M A Weis; E Makareeva; D Eyre; S Leikin; F Antoniazzi; J C Marini; M Mottes
Journal:  Clin Genet       Date:  2011-10-19       Impact factor: 4.438

2.  Mutations in PPIB (cyclophilin B) delay type I procollagen chain association and result in perinatal lethal to moderate osteogenesis imperfecta phenotypes.

Authors:  Shawna M Pyott; Ulrike Schwarze; Helena E Christiansen; Melanie G Pepin; Dru F Leistritz; Richard Dineen; Catharine Harris; Barbara K Burton; Brad Angle; Katherine Kim; Michael D Sussman; Maryann Weis; David R Eyre; David W Russell; Kevin J McCarthy; Robert D Steiner; Peter H Byers
Journal:  Hum Mol Genet       Date:  2011-01-31       Impact factor: 6.150

Review 3.  New perspectives on osteogenesis imperfecta.

Authors:  Antonella Forlino; Wayne A Cabral; Aileen M Barnes; Joan C Marini
Journal:  Nat Rev Endocrinol       Date:  2011-06-14       Impact factor: 43.330

4.  Genetic analysis of osteogenesis imperfecta in the Palestinian population: molecular screening of 49 affected families.

Authors:  Osama Essawi; Sofie Symoens; Maha Fannana; Mohammad Darwish; Mohammad Farraj; Andy Willaert; Tamer Essawi; Bert Callewaert; Anne De Paepe; Fransiska Malfait; Paul J Coucke
Journal:  Mol Genet Genomic Med       Date:  2017-11-18       Impact factor: 2.183

Review 5.  Genotype-phenotype correlations in pathology caused by collagen type IV alpha 1 and 2 mutations.

Authors:  Marion Jeanne; Douglas B Gould
Journal:  Matrix Biol       Date:  2016-10-26       Impact factor: 11.583

6.  COL1A1 C-propeptide mutations cause ER mislocalization of procollagen and impair C-terminal procollagen processing.

Authors:  Aileen M Barnes; Aarthi Ashok; Elena N Makareeva; Marina Brusel; Wayne A Cabral; MaryAnn Weis; Catherine Moali; Emmanuel Bettler; David R Eyre; John P Cassella; Sergey Leikin; David J S Hulmes; Efrat Kessler; Joan C Marini
Journal:  Biochim Biophys Acta Mol Basis Dis       Date:  2019-05-02       Impact factor: 5.187

7.  Genotype-phenotype correlations in nonlethal osteogenesis imperfecta caused by mutations in the helical domain of collagen type I.

Authors:  Frank Rauch; Liljana Lalic; Peter Roughley; Francis H Glorieux
Journal:  Eur J Hum Genet       Date:  2010-01-20       Impact factor: 4.246

8.  Differential response to intracellular stress in the skin from osteogenesis imperfecta Brtl mice with lethal and non lethal phenotype: a proteomic approach.

Authors:  Laura Bianchi; Assunta Gagliardi; Roberta Gioia; Roberta Besio; Chiara Tani; Claudia Landi; Maria Cipriano; Anna Gimigliano; Antonio Rossi; Joan C Marini; Antonella Forlino; Luca Bini
Journal:  J Proteomics       Date:  2012-02-18       Impact factor: 4.044

9.  Generalized connective tissue disease in Crtap-/- mouse.

Authors:  Dustin Baldridge; Jennifer Lennington; MaryAnn Weis; Erica P Homan; Ming-Ming Jiang; Elda Munivez; Douglas R Keene; William R Hogue; Shawna Pyott; Peter H Byers; Deborah Krakow; Daniel H Cohn; David R Eyre; Brendan Lee; Roy Morello
Journal:  PLoS One       Date:  2010-05-11       Impact factor: 3.240

10.  A missense mutation in the SERPINH1 gene in Dachshunds with osteogenesis imperfecta.

Authors:  Cord Drögemüller; Doreen Becker; Adrian Brunner; Bianca Haase; Patrick Kircher; Frank Seeliger; Michael Fehr; Ulrich Baumann; Kerstin Lindblad-Toh; Tosso Leeb
Journal:  PLoS Genet       Date:  2009-07-24       Impact factor: 5.917

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.