| Literature DB >> 34234628 |
Khyber Shinwari1, Liu Guojun2, Svetlana S Deryabina1,3, Mikhail A Bolkov1,4, Irina A Tuzankina1,4, Valery A Chereshnev1,4.
Abstract
Hennekam lymphangiectasia-lymphedema syndrome has been linked to single-nucleotide polymorphisms in the CCBE1 (collagen and calcium-binding EGF domains 1) gene. Several bioinformatics methods were used to find the most dangerous nsSNPs that could affect CCBE1 structure and function. Using state-of-the-art in silico tools, this study examined the most pathogenic nonsynonymous single-nucleotide polymorphisms (nsSNPs) that disrupt the CCBE1 protein and extracellular matrix remodeling and migration. Our results indicate that seven nsSNPs, rs115982879, rs149792489, rs374941368, rs121908254, rs149531418, rs121908251, and rs372499913, are deleterious in the CCBE1 gene, four (G330E, C102S, C174R, and G107D) of which are the highly deleterious, two of them (G330E and G107D) have never been seen reported in the context of Hennekam syndrome. Twelve missense SNPs, rs199902030, rs267605221, rs37517418, rs80008675, rs116596858, rs116675104, rs121908252, rs147974432, rs147681552, rs192224843, rs139059968, and rs148498685, are found to revert into stop codons. Structural homology-based methods and sequence homology-based tools revealed that 8.8% of the nsSNPs are pathogenic. SIFT, PolyPhen2, M-CAP, CADD, FATHMM-MKL, DANN, PANTHER, Mutation Taster, LRT, and SNAP2 had a significant score for identifying deleterious nsSNPs. The importance of rs374941368 and rs200149541 in the prediction of post-translation changes was highlighted because it impacts a possible phosphorylation site. Gene-gene interactions revealed CCBE1's association with other genes, showing its role in a number of pathways and coexpressions. The top 16 deleterious nsSNPs found in this research should be investigated further in the future while researching diseases caused CCBE1 gene specifically HS. The FT web server predicted amino acid residues involved in the ligand-binding site of the CCBE1 protein, and two of the substitutions (R167W and T153N) were found to be involved. These highly deleterious nsSNPs can be used as marker pathogenic variants in the mutational diagnosis of the HS syndrome, and this research also offers potential insights that will aid in the development of precision medicines. CCBE1 proteins from Hennekam syndrome patients should be tested in animal models for this purpose.Entities:
Mesh:
Substances:
Year: 2021 PMID: 34234628 PMCID: PMC8211529 DOI: 10.1155/2021/6642626
Source DB: PubMed Journal: ScientificWorldJournal ISSN: 1537-744X
Figure 1Distribution of all SNPs in CCBE1 gene.
Figure 2(a) Gene-gene interaction of CCBE1 with other genes proposed by GeneMANIA. (b) Coexpression in GeneMANIA. (c) Gene-gene interaction of CCBE1 with other genes proposed by STRING.
Genes coexpressed and sharing a domain with CCBE1.
| Gene symbol | Description | Coexpression | Shared domain |
|---|---|---|---|
|
| Collagen type VI alpha 6 | Yes | No |
|
| Matrix remodeling associated 8 | Yes | No |
|
| Pleckstrin homology and FYVE domain containing 2 | Yes | No |
|
| Reprimo, TP53 dependent G2 arrest mediator candidate | Yes | No |
|
| Cadherin 4 | No | No |
|
| Pleckstrin homology and RhoGEF domain containing G1 | Yes | No |
|
| Cullin associated and neddylation dissociated 1 | Yes | No |
|
| Myosin X | Yes | No |
|
| Leucine rich repeat containing 4C | Yes | No |
|
| Lecithin retinol acyltransferase | Yes | No |
|
| Ankyrin 3, node of Ranvier | Yes | No |
|
| Olfactomedin 1 | Yes | No |
|
| Decorin | Yes | Yes |
|
| Neuralized E3 ubiquitin protein ligase 1B | Yes | No |
|
| Pleckstrin homology, MyTH4, and FERM domain containing H2 | Yes | Yes |
|
| Glioma tumor suppressor candidate region gene 2 | Yes | No |
|
| NDRG family member 2 | Yes | No |
nsSNPs predicted as deleterious by SIFT and PolyPhen2.
| ID of nsSNPs | AA position | SIFT | Score | PolyPhen2 | Score | MAF |
|---|---|---|---|---|---|---|
| rs199902030 | D336N | Deleterious | 0.003 | Probably damaging | 1 | <0.001 (T) |
| rs200149541 | T153N | Deleterious | 0.001 | Probably damaging | 1 | |
| rs372499913 | G107D | Deleterious | 0 | Probably damaging | 1 | |
| rs267605221 | P249S | Deleterious | 0.007 | Probably damaging | 1 | |
| rs374941368 | S19N | Deleterious | 0.004 | Probably damaging | 0.981 | |
| rs375717418 | R301W | Deleterious | 0.004 | Probably damaging | 1 | <0.001 (T) |
| rs80008675 | D41E | Deleterious low | 0.016 | Probably damaging | 0.982 | 0.017 (T) |
| rs116596858 | P181S | Deleterious low | 0.007 | Probably damaging | 0.906 | <0.001 (A) |
| rs116675104 | R167W | Deleterious low | 0.017 | Probably damaging | 0.990 | 0.003 (A) |
| rs121908250 | C75S | Deleterious low | 0.002 | Probably damaging | 0.981 | |
| rs121908251 | C102S | Deleterious low | 0 | Probably damaging | 0.999 | |
| rs121908252 | G327R | Deleterious | 0 | Probably damaging | 1 | |
| rs121908254 | C174R | Deleterious | 0.001 | Probably damaging | 0.984 | |
| rs147974432 | T144M | Deleterious low | 0.002 | Probably damaging | 1 | <0.001 (A) |
| rs192224843 | Q353R | Deleterious | 0.011 | Probably damaging | 0.993 | <0.001 (C) |
| rs115982879 | R118L | Deleterious low | 0.001 | Probably damaging | 0.910 | <0.001 (T) |
| rs139059968 | K355T | Deleterious | 0.002 | Probably damaging | 0.883 | <0.001 (G) |
| rs141125426 | D397Y | Deleterious low | 0.002 | Probably damaging | 0.828 | |
| rs147208835 | R125W | Deleterious low | 0 | Probably damaging | 0.995 | |
| rs147681552 | P290L | Deleterious | 0.005 | Probably damaging | 1 | <0.001 (A) |
| rs148498685 | P87S | Deleterious low | 0.002 | Probably damaging | 1 | |
| rs149531418 | G330E | Deleterious | 0 | Probably damaging | 0.999 | |
| rs149792489 | A96G | Deleterious low | 0.004 | Probably damaging | 1 | <0.001 (C) |
Threshold. SIFT: <0.05; PolyPhen2: >0.8 (PSIC > 0.5) or Benign (PSIC < 0.5).
Confirmation of the deleterious nsSNPs by other prediction software.
| AAS | LRT | Mutation Taster | Mutation Assessor | PROVEAN | FATHMM | VEST3 | MetaLR | M-CAP | CADD | DANN | FATHMM-MKK | PhD-SNP | PANTHER | SNP-GO | SNAP2 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| G330E | D | D | H | D | D | D | D | D | D | D | D | D | D | D | E |
| C102S | D | D | M | D | D | D | D | D | D | D | D | D | D | D | E |
| C174R | D | D | H | D | D | D | D | D | D | D | D | D | D | D | E |
| G107D | D | D | L | D | D | D | D | D | D | D | D | D | D | D | E |
| R125W | D | D | L | D | D | T | D | D | D | D | D | D | D | N | E |
| G327R | D | D | H | D | D | D | D | D | D | D | D | N | D | N | E |
| P290L | D | D | M | D | T | D | D | D | D | D | D | N | D | N | E |
| K355T | D | D | M | N | D | D | D | D | D | D | D | D | D | N | E |
| Q353R | D | D | M | N | D | D | D | D | D | D | D | D | D | N | E |
| D336N | D | D | M | N | D | T | D | D | D | D | D | D | D | N | E |
| T153N | D | D | M | N | D | T | D | D | D | D | D | D | D | N | E |
| C75S | D | D | L | D | D | D | D | D | D | T | D | N | D | N | E |
| P87S | D | D | L | N | D | D | D | D | D | D | D | D | D | N | E |
| T144M | D | D | L | N | D | D | D | D | D | D | D | N | D | N | E |
| R118L | D | D | L | D | D | D | T | D | D | D | D | D | D | N | E |
| D397Y | N | D | M | D | D | T | D | D | D | T | D | D | D | N | E |
| R301W | D | D | M | D | T | D | T | D | D | D | D | N | D | N | E |
| P249S | D | D | M | N | T | T | D | D | D | D | D | N | D | N | N |
| D41E | D | P | L | N | D | T | T | D | D | T | D | D | D | N | N |
| S19N | N | P | L | N | D | T | D | D | D | D | D | N | D | N | N |
| R167W | N | D | L | N | D | T | D | D | D | D | D | N | N | N | E |
| A96G | D | D | L | N | T | D | T | D | D | D | D | N | D | N | N |
| P181S | N | D | L | N | T | D | T | D | D | T | D | N | N | N | N |
D: deleterious; T: tolerated; U: unknown; L: low; N: neutral; M: medium; P: polymorphism; E: effect. Thresholds for all these predication tools are given in the S4 fill.
Figure 3Evolutionary conservation of amino acids in the ADA gene determined by the ConSurf server. Value 1 indicates a high variability region. The value increases as the region becomes more conserved, up to value 9.
Figure 4Amino acid alignment of human CCBE1 (UniProt ID: Q6UXH8) along with its homologues in phylogenetically close species in ABWGB and Q3MI99. Solid horizontal bars indicate conserved sequence motifs, and residues with asterisk (∗) mark indicate evolutionary conserved amino acids. The amino acid identities were colored according the Clustal color scheme, and the conservation index at each alignment position was provided by Jalview.
Predication of disease-related AA substitution and phenotypes by Mutpred2.
| SNPs | Actionable/confident hypothesis | Probability |
|
|---|---|---|---|
| C174R | Gain of intrinsic disorder | 0.39 | 0.009 |
| Loss of disulfide linkage at C174 | 0.21 | 0.020 | |
|
| |||
| D336N | Altered disorder interface | 0.29 | 0.02 |
| Loss of loop | 0.26 | 0.05 | |
| Loss of proteolytic cleavage at D336 | 0.11 | 0.05 | |
| Altered coiled coil | 0.11 | 0.04 | |
|
| |||
| G107D | Altered transmembrane protein | 0.29 | 0.0003 |
| Loss of loop | 0.27 | 0.02 | |
| Loss of disulfide linkage at C102 | 0.26 | 0.004 | |
| Gain of proteolytic cleavage at R108 | 0.15 | 0.01 | |
|
| |||
| C102S | Loss of disulfide linkage at C102 | 0.55 | 0.0003 |
| Loss of helix | 0.28 | 0.03 | |
| Loss of pyrrolidone carboxylic acid at Q100 | 0.19 | 0.002 | |
| Altered metal binding | 0.35 | 0.008 | |
| Altered transmembrane protein | 0.32 | 0.00007 | |
|
| |||
| G330E | Loss of B factor | 0.27 | 0.02 |
| Gain of loop | 0.04 | ||
|
| |||
| G327R | Loss of B factor | 0.27 | 0.02 |
|
| |||
| P290L | Altered disordered interface | 0.36 | 0.008 |
| Loss of B factor | 0.29 | 0.01 | |
|
| |||
| Q353R | Altered disordered interface | 0.29 | 0.03 |
| Altered coiled coil | 012 | 0.04 | |
|
| |||
| T153N | Loss of strand | 0.26 | 0.04 |
| Gain of disulfide linkage at C150 | 0.23 | 0.01 | |
|
| |||
| C75S | Altered metal binding | 0.40 | 0.006 |
| Loss of disulfide linkage at C75 | 0.30 | 0.001 | |
| Loss of helix | 0.27 | 0.05 | |
|
| |||
| P87S | Gain of helix | 0.28 | 0.02 |
| Gain of disulfide linkage at C85 | 0.20 | 0.02 | |
| Altered metal binding | 0.25 | 0.03 | |
| Loss of sulfation at Y90 | 0.09 | 0.003 | |
|
| |||
| R118L | Altered disordered interface | 0.27 | 0.40 |
| Loss of disulfide linkage at C113 | 0.19 | 0.02 | |
| Gain of photolytic cleavage at D120 | 0.16 | 0.009 | |
| Loss of sulfation at Y114 | 0.02 | 0.02 | |
|
| |||
| P181S | Gain of phosphorylation at Y180 | 0.37 | 0.007 |
| Loss of acetylation at K179 | 0.21 | 0.03 | |
| Gain of N-linked glycosylation at N182 | 0.03 | 0.03 | |
|
| |||
| A96G | Altered transmembrane protein | 0.30 | 0.0001 |
| Loss of helix | 0.29 | 0.02 | |
| Gain of disulfide linkage at C98 | 0.25 | 0.006 | |
| Gain of pyrrolidone carboxylic at Q100 | 0.20 | 0.002 | |
|
| |||
| P249S | Gain of loop | 0.31 | 0.004 |
| Loss of B factor | 0.29 | 0.02 | |
| Gain of phosphorylation at Y244 | 0.24 | 0.04 | |
| Gain of o-linked glycosylation at P249 | 0.12 | 0.04 | |
iStable 2.0 result for prediction of CCBE1 protein stability due to the selected nsSNPs in CCBE1.
| AAS | Confidence score | Stability |
|---|---|---|
| G330E | −0.002680719 | Increase |
| C174R | 0.021838337 | Increase |
| C102S | −1.2213084 | Decrease |
| G107D | −0.86388123 | Decrease |
| R125W | −0.85255766 | Decrease |
| G327R | 0.0042461157 | Increase |
| P290L | 0.2298831 | Increase |
| K355T | −0.052274585 | Decrease |
| Q353R | 0.8725257 | Increase |
| D336N | −1.2082165 | Decrease |
| T153N | −0.546193 | Decrease |
| C75S | −1.0542232 | Decrease |
| P87S | −1.9976869 | Decrease |
| T144M | 0.23297998 | Increase |
| R118L | −0.5704589 | Decrease |
| D397Y | 0.071232796 | Increase |
| R301W | −0.3441298 | Decrease |
| P249S | −1.1325055 | Decrease |
| D41E | 0.4703572 | Increase |
| S19N | 0.77003396 | Increase |
| R167W | −0.4350294 | Decrease |
| A96G | −0.041893244 | Increase |
Figure 5Secondary structure predication by NetSurfP-2.0.
TM‐score and RMSD values of 23 selected damaging nsSNPs in CCBE1.
| SNP-ID | Residual change | TM-score | RMSD values | SNP-ID | Residual change | TM-score | RMSD values |
|---|---|---|---|---|---|---|---|
| rs199902030 | D336N | 0.92174 | 1.25 | rs121908252 | G327R | 0.91250 | 1.36 |
| rs200149541 | T153N | 0.95561 | 0.84 | rs121908254 | C174R | 0.92822 | 1.46 |
| rs372499913 | G107D | 0.95388 | 1.13 | rs147974432 | T144M | 0.93348 | 1.13 |
| rs267605221 | P249S | 0.91250 | 1.36 | rs192224843 | Q353R | 0.92957 | 1.32 |
| rs374941368 | S19N | 0.92526 | 1.44 | rs115982879 | R118L | 0.92844 | 1.57 |
| rs375717418 | R301W | 0.93696 | 1.02 | rs139059968 | K355T | 0.93921 | 1.01 |
| rs80008675 | D41E | 0.95466 | 1.12 | rs141125426 | D397Y | 0.96008 | 1.06 |
| rs149792489 | A96G | 0.92689 | 1.50 | rs147208835 | R125W | 0.96213 | 0.89 |
| rs116675104 | R167W | 0.93715 | 1.16 | rs147681552 | P290L | 0.92174 | 1.25 |
| rs121908250 | C75S | 0.96248 | 0.90 | rs148498685 | P87S | 0.93523 | 1.14 |
| rs121908251 | C102S | 0.96432 | 1.22 | rs149531418 | G330E | 0.94082 | 1.12 |
Figure 6(a) Wild-type CCBE1 protein structure. (b) Superimposed structure of CCBE1 and its C174R mutant. (c) Superimposed structure of CCBE1 and its A96G mutant. (d) Superimposed structure of CCBE1 and its R118L mutant.
Figure 7(a) Ligand-binding site prediction by FT site in whole predicated 3D protein model of CCBEI gene. (b) 3 meshes were predicted as ligand-binding site.