| Literature DB >> 31680973 |
Jingru Shi1, Meng Ren1, Jinmeng Jia1, Muxue Tang1, Yongli Guo2,3,4, Xin Ni2,3,4, Tieliu Shi1,2.
Abstract
Osteogenesis imperfecta (OI), mainly caused by structural abnormalities of type I collagen, is a hereditary rare disease characterized by increased bone fragility and reduced bone mass. Clinical manifestations of OI mostly include multiple repeated bone fractures, thin skin, blue sclera, hearing loss, cardiovascular and pulmonary system abnormalities, triangular face, dentinogenesis imperfecta (DI), and walking with assistance. Currently, 20 causative genes with 18 subtypes have been identified for OI, of them, variations in COL1A1 and COL1A2 have been demonstrated to be major causative factors to OI. However, the complexity of the bone formation process indicates that there are potential new pathogenic genes associated with OI. To comprehensively explore the underlying mechanism of OI, we conducted association analysis between genotypes and phenotypes of OI diseases and found that mutations in COL1A1 and COL1A2 contributed to a large proportion of the disease phenotypes. We categorized the clinical phenotypes and the genotypes based on the variation types for those 155 OI patients collected from literature, and association study revealed that three phenotypes (bone deformity, DI, walking with assistance) were enriched in two variation types (the Gly-substitution missense and groups of frameshift, nonsense, and splicing variations). We also identified four novel variations (c.G3290A (p.G1097D), c.G3289C (p.G1097R), c.G3289A (p.G1097S), c.G3281A (p.G1094D)) in gene COL1A1 and two novel variations (c.G2332T (p.G778C), c.G2341T (p.G781C)) in gene COL1A2, which could potentially contribute to the disease. In addition, we identified several new potential pathogenic genes (ADAMTS2, COL5A2, COL8A1) based on the integration of protein-protein interaction and pathway enrichment analysis. Our study provides new insights into the association between genotypes and phenotypes of OI and novel information for dissecting the underlying mechanism of the disease.Entities:
Keywords: genotype; novel candidate pathogenic genes; novel candidate pathogenic variations; osteogenesis imperfecta; phenotype
Year: 2019 PMID: 31680973 PMCID: PMC6803541 DOI: 10.3389/fphar.2019.01200
Source DB: PubMed Journal: Front Pharmacol ISSN: 1663-9812 Impact factor: 5.810
Figure 1Detailed flow charts for the analysis process. (A) A flowchart for OI-related pathogenic variations prediction. (B) A flowchart for OI-related genes prediction.
Statistical analysis result on genotype–phenotype correlation and phenotype–phenotype correlation.
| (A) Relationship between clinical characteristics and different variation types (Gly-substitution mutation vs. Frameshift, nonsense, and splicing mutation) in | ||||
|---|---|---|---|---|
| Phenotype | Gly-substitution missense (n = 59) | Other variation (Frameshift, nonsense, and splicing) (n = 70) | Total case (Gly/other) |
|
|
| 23.7%/23.7%/52.6% | 80%/2%/18% | 38/50 |
|
| Gender(F/M) | 54.2%/45.8% | 58.6%/41.4% | 59/70 | 0.7218 |
|
| 40.7%/59.3% | 97.1%/2.9% | 58/70 |
|
|
| 80% | 58.50% | 45/53 |
|
| Hypermobile joints | 83.3% | 100% | 12/3 | 1 |
| Dense metaphyseal bands | 81.8% | 50.0% | 11/2 | 0.4231 |
| Vertebral anomalies | 51.1% | 31.3% | 45/48 | 0.06021 |
| Vertebral fracture | 40.0% | 42.1% | 20/19 | 1 |
| Osteopenia | 100% | 100% | 1/2 | 1 |
|
| 56.9% | 35.2% | 51/54 |
|
| Blue sclera | 70.7% | 77.1% | 58/72 | 0.4243 |
| Hearing loss | 14.0% | 18.5% | 50/54 | 0.6024 |
|
| 31.0% | 0% | 29/42 |
|
| Triangular face | 25.8% | 16.7% | 31/42 | 0.3889 |
| Popcorn calcif | 25.0% | 0% | 12/2 | 1 |
| (B) Relationship between dentinogenesis imperfecta and bone deformity/vertebral anomalies. Phenotype with significant difference ( | ||||
| Phenotype | Dentinogenesis imperfecta (Y) | Dentinogenesis imperfecta (N) | Total case (DI(Y)/DI(N)) |
|
|
| 85.10% | 52.90% | 47/51 |
|
|
| 53.50% | 30% | 43/50 |
|
Figure 2OI-related candidate mutations in COL1A1 and COL1A2. (A) OI-related candidate mutations in COL1A1 (“*”: currently unknown). (B) OI-related candidate mutations in COL1A2 (“*”: currently unknown). (C) Localization of variants c.G2332T (p.G778C) and c.G2341T (p.G781C) in COL1A2 protein sequence. Predicted pathogenic variations are represented in bold font (Eight orange rectangles with black border denote eight lethal spaces).
Figure 3(A) Network of OI pathogenic genes and candidate genes. 93 interactions between 20 OI pathogenic genes and 18 interacted genes. The circles and triangles represent 20 OI pathogenic genes and 18 interacted genes (≥1 interaction), respectively. Yellow triangles denote four predicted genes (= 1 interaction). Green and blue triangles totally denote 14 predicted genes (the predicted gene set A) (≥2 interactions), and blue triangles denote three candidate genes (gene expression is consistent with OI pathogenic genes), respectively. Red circles denote seven OI pathogenic genes (interact with three candidate genes). Grey lines denote 15 interactions between predicted genes. Green lines and red lines totally denote 78 interactions between OI pathogenic genes and predicted genes, and red lines denote 11 interactions between three candidate genes and seven interacted OI pathogenic genes, respectively. (B) The tissue expression distribution of predicted genes and their interacted pathogenic genes. The green, yellow, red, and blue lines denote candidate gene ADAMTS2, COL5A2, COL8A1 and interacted pathogenic genes (COL1A1, COL1A2, BMP1, PPIB, SERPINH1, P3H1, CRTAP). All these genes are expressed highly in “Cells_Transformed_fibroblasts” and have a similar expression distribution among all human tissues.
Significant pathways with related pathogenic genes and predicted genes.
| Significant Pathways | 20 Pathogenic Genes and 18 Interacted Genes |
|
|---|---|---|
| Epithelial to mesenchymal transition in colorectal cancer |
| 0.00083567 |
| Focal Adhesion-PI3K-Akt-mTOR-signaling pathway |
| 0.0010862 |
| miR-509-3p alteration of YAP1/ECM axis | COL1A1, SPARC | 0.0011611 |
| Focal Adhesion |
| 0.0018166 |
| Inflammatory Response Pathway | COL1A1, COL1A2 | 0.0035058 |
| miRNA targets in ECM and membrane receptors |
| 0.0059006 |
| PI3K-Akt Signaling Pathway |
| 0.012283 |
| Sterol Regulatory Element-Binding Proteins (SREBP) signaling | MBTPS2, SEC24D | 0.01637 |
| Hypothetical Craniofacial Development Pathway | WNT1 | 0.021586 |
| ncRNAs involved in Wnt signaling in hepatocellular carcinoma | SERPINF1, WNT1 | 0.023282 |
| LncRNA involvement in canonical Wnt signaling and colorectal cancer | SERPINF1, WNT1 | 0.031757 |
| Senescence and Autophagy in Cancer | COL1A1, SPARC | 0.034042 |
| EDA Signaling in Hair Follicle Development | BMP1 | 0.037488 |
| Osteoblast Signaling | COL1A1 | 0.037488 |
| Wnt Signaling Pathway | SERPINF1, WNT1 | 0.040029 |
| Canonical and Non-Canonical TGF-B signaling | BMP1 | 0.045347 |
| Regulation of Wnt/B-catenin Signaling by Small Molecule Compounds | WNT1 | 0.047953 |
| Adipogenesis | BMP1, WNT1 | 0.04837 |
Eighteen significant pathways (P < 0.05) were identified with 20 OI causative genes and their 18 interacting gene partners from protein–protein interaction network. Candidate genes (the predicted gene set B) are represented in bold font.
Figure 4(A) Type I collagen synthesis and application. (1) Formation of three alpha chains (two α1(I) chains and one α2(I) chain) (the red dot represents a Gly-substitution missense). (2) Triple helix formation (the blue dashed-line box displays the structure of the triple helix with a smooth loop model, in which all glycine residues are shown in ball-and-stick representation; the red box indicates abnormality of the triple helix after the Gly replacement). (3) Extracellular cleavage of the N-terminus and C-terminus (the red box indicates abnormality of the triple helix after the Gly replacement). (4) Cross-linking of type I collagen molecules. (5) Assembly of collagen fibrils to collagen fibers. (6) Collagen fibers participate in the formation of bone and connective tissues. (B) The chemical structural formula of four amino acids (glycine, serine, arginine, asparagic acid). Glycine, which has the smallest relative molecular mass, is the only amino acid with no sidechain.