| Literature DB >> 32260281 |
Arthavan Selvanathan1,2, Cheng Yee Nixon3, Ying Zhu1,4, Luigi Scietti5, Federico Forneris5, Lina M Moreno Uribe6, Andrew C Lidral7, Peter A Jezewski8, John B Mulliken9,10, Jeffrey C Murray11, Michael F Buckley1, Timothy C Cox12, Tony Roscioli1,13,14,15.
Abstract
Pathogenic variants in CDH1, encoding epithelial cadherin (E-cadherin), have been implicated in hereditary diffuse gastric cancer (HDGC), lobular breast cancer, and both syndromic and non-syndromic cleft lip/palate (CL/P). Despite the large number of CDH1 mutations described, the nature of the phenotypic consequence of such mutations is currently not able to be predicted, creating significant challenges for genetic counselling. This study collates the phenotype and molecular data for available CDH1 variants that have been classified, using the American College of Medical Genetics and Genomics criteria, as at least 'likely pathogenic', and correlates their molecular and structural characteristics to phenotype. We demonstrate that CDH1 variant type and location differ between HDGC and CL/P, and that there is clustering of CL/P variants within linker regions between the extracellular domains of the cadherin protein. While these differences do not provide for exact prediction of the phenotype for a given mutation, they may contribute to more accurate assessments of risk for HDGC or CL/P for individuals with specific CDH1 variants.Entities:
Keywords: cadherin 1; cleft lip; cleft palate; gastric cancer; genotype-phenotype correlation; orofacial clefting
Mesh:
Substances:
Year: 2020 PMID: 32260281 PMCID: PMC7231129 DOI: 10.3390/genes11040391
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Numbers of variants by variant type and phenotype.
| Nonsense | Missense | Splice | Total | |
|---|---|---|---|---|
|
| 175 | 11 | 59 | 245 |
|
| 3 | 1 | 4 | 8 |
|
| 3 | 19 | 5 | 27 |
|
| 181 | 31 | 68 | 280 |
HDGC: hereditary diffuse gastric cancer; CL/P: cleft lip/palate.
Figure 1Location of all variants with respect to the epithelial cadherin (E-cadherin) domain structure grouped by mutation type and phenotype (HDGC: hereditary diffuse gastric cancer; CL/P: cleft lip/palate). Variant positions are marked by vertical lines that correspond on the x-axis to the amino acid residue number. The height of the vertical lines corresponds to the number of variants located at that given residue position (y-axis). The color of the vertical lines represents the type of variant: start lost (blue), truncating (orange), missense (green), and in-frame deletion (purple). For each phenotype, variants are grouped by type: start lost and truncating variant (upper); missense and in-frame deletion variants (lower). For reference, numbers on the schematic of the protein represent the start and end residues of each domain.
Figure 2Location of splice variants in CDH1 grouped by phenotype (HDGC: hereditary diffuse gastric cancer; CL/P: cleft lip/palate). Vertical lines correspond to the approximate location of the variants with respect to the gene structure (schematic; x-axis). Exons are shown as boxes: coding region (blue); untranslated regions (grey). Introns (tan) and exons are not drawn to scale. The height of the vertical lines corresponds to the number of variants located at that given position (y-axis). The color of the vertical lines represents the type of variant: splice donor/acceptor variants (red), new splice donor (light blue).
Missense and in-frame variants in each phenotypic group by location.
| S/PP | ER | TM/IC | Total | |
|---|---|---|---|---|
|
| 2 | 8 | 1 | 11 |
|
| 0 | 1 | 0 | 1 |
|
| 0 | 17 | 2 | 19 |
|
| 2 | 26 | 3 | 31 |
S/PP: signal/pro-peptide region; ER: extracellular region; TM/IC: transmembrane and intracellular region.
Figure 3Homologous location of likely pathogenic and pathogenic human missense and in-frame deletion variants causing CL/P (left panel) and HDGC (right panel) on the three-dimensional ectodomain structure of E-cadherin. The extracellular region of mature mouse E-cadherin (PDB 3Q2V) [3], comprised of five EC domains, is shown in grey. The positions of CL/P variants are shown as red spheres (left image); HDGC variants are shown as pink spheres (right image). The location of the CL/P variant in the tryptophan (W156) that is critical for in trans interaction of E-cadherin is shown in orange. Chelated calcium ions are shown as green spheres. Note apparent clustering of CL/P variants around the linker regions, in contrast to the HDGC variants. LP/P—likely pathogenic/pathogenic.
Figure 4Most pathogenic/likely pathogenic CL/P missense variants, but very few HDGC variants, cluster in the linker regions between the extracellular domains of E-cadherin. (A) Residues contributing to each linker are colored on the 3D structure (upper panel in (A)) in teal (EC1-EC2 linker), orange (EC2-EC3 linker), blue (EC3-EC4 linker), and pink (EC4-EC5 linker). The locations of pathogenic/likely pathogenic missense CL/P variants (middle panel) and missense HDGC variants (lower panel) that map to near the respective 3D linker region structures are shown (colored and labeled). (B) Schematic of the E-cadherin primary sequence showing the different domains (rectangles): SS—single sequence; PRO—pro domain; EC1–5—extracellular domains 1–5; TM—transmembrane domain; CYTO—cytoplasmic (intracellular) domain. A total of 15 of the 17 CL/P missense variants found within the extracellular region (and 15 of 19 total) are part of, or one residue adjacent to, the linker regions marked by arrows, as compared to only three (L583R and both F626V) of the 11 HDGC variants. Note: five distinct clusters of amino acids (horizontal colored lines joined by arcs in the schematic), spread across the primary sequence of two adjacent EC domains (grey rectangles) contribute to each of the respective linker regions in the 3D protein structure. As in (A), the clusters of amino acid residues in the schematic are colored in teal, orange, blue, and pink for those contributing to the EC1-EC2 linker, EC2-EC3 linker, EC3-EC4 linker, and EC4-EC5 linker, respectively. The variants indicated in brackets in (B), reside immediately adjacent a defined cluster.