| Literature DB >> 18547406 |
Lise Lotte Christensen1, Bo E Madsen, Friedrik P Wikman, Carsten Wiuf, Karen Koed, Anne Tjønneland, Anja Olsen, Ann-Christine Syvänen, Claus L Andersen, Torben F Orntoft.
Abstract
BACKGROUND: Mutations in the mismatch repair genes hMLH1 and hMSH2 predispose to hereditary non-polyposis colorectal cancer (HNPCC). Genetic screening of more than 350 Danish patients with colorectal cancer (CRC) has led to the identification of several new genetic variants (e.g. missense, silent and non-coding) in hMLH1 and hMSH2. The aim of the present study was to investigate the frequency of these variants in hMLH1 and hMSH2 in Danish patients with sporadic colorectal cancer and in the healthy background population. The purpose was to reveal if any of the common variants lead to increased susceptibility to colorectal cancer.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18547406 PMCID: PMC2438340 DOI: 10.1186/1471-2350-9-52
Source DB: PubMed Journal: BMC Med Genet ISSN: 1471-2350 Impact factor: 2.103
Characteristics of the cohort
| Cases (n = 380) | Sub-cohort (n = 770) | |
| Sex (Number (%)) | ||
| Men | 213 (56%) | 427 (55%) |
| Women | 167 (44%) | 343 (45%) |
| Age* (Years (sd)) | ||
| Men | 58.4 (4.1) | 56.9 (4.5) |
| Women | 58.5 (4.5) | 56.4 (4.4) |
| Time observed (Years (sd)) | ||
| Men | - | 6.6 (1.2) |
| Women | - | 6.6 (1.0) |
*Age at inclusion into the cohort given as mean (sd) in years
MLH1 and MSH2 variants
| Id | Variant | Amino acid change | Identified in the present study | Phylogeny | Pathogenic status | References |
| 23 | c.307-29 C>A | Intronic | + | - | - | - |
| 24 | c.350 C>T | p.Thr117Met | - | Conserved | Pathogen | [11,50] |
| 25 | c.453+79 A>G | Intronic | + | - | - | - |
| 26 | c.545+43 C>G | Intronic | + | - | - | - |
| 27 | c.655 A>G | p.Val219Ile | + | Ile other species | Neutral | [11,40,51] |
| 28 | c.790+10 A>G | Intronic | + | - | - | - |
| 29 | c.884+39 G>A | Intronic | - | - | - | - |
| 30 | c.884+83_84 ins T | Intronic | - | - | - | - |
| 85 | c.1217 G>A | p.Ser406Asn | + | Non-conserved | Neutral | [11,41,52] |
| 31 | c.1379 A>C | p.Glu460Ala | - | Non-conserved | - | - |
| 86* | c.1558+11 G>A | Intronic | - | - | Neutral | [53] |
| 32 | c.1558+14 G>A | Intronic | + | - | Neutral | [40] |
| 37 | c.1668-19 A>G | Intronic | + | - | Neutral | [40] |
| 101 | c.1689 A>G | p.Ile563Met | - | Non-conserved | - | - |
| 33 | c.1732-2 A>T | Intronic | - | - | Pathogen | [54] |
| 34 | c.1852_1853 AA>GC | p.Lys618Ala | + | Non-conserved | Neutral/Pathogen | [9,11,45,46,48] |
| 100 | c.1942 C>T | p.Pro648Ser | - | Conserved | Pathogen | [9,45,55] |
| 35 | c.1959 G>T | p.Leu653Leu | + | - | Neutral | [42] |
| 36 | c.2152 C>T | p.His718Tyr | - | Conserved | Neutral | [11,56] |
| 89 | c.-118 T>C | promoter | + | - | - | [32,33] |
| 87 | c.131 C>T | p.Thr44Met | - | Conserved | Pathogen | [55] |
| 88 | c.134 C>T | p.Ala45Val | - | Val other species | Neutral | [55] |
| 39 | c.212-23 A>C | Intronic | - | - | - | - |
| 90* | c.287 G>A | p.Arg96His | - | Non-conserved | Neutral | [57,58] |
| 91* | c.329 A>G | p.Lys110Arg | - | Non-conserved | Neutral | [57] |
| 92* | c.380 A>G | p.Asn127Ser | - | Conserved | Neutral/Pathogen | [48,59] |
| 93 | c.560 T>G | p.Leu187Arg | - | Conserved | Neutral/Pathogen | [60] |
| 41 | c.965 G>A | p.Gly322Asp | + | Conserved | Neutral | [51,61] |
| 43 | c.1511-9 A>T | Intronic | + | - | - | - |
| 48 | c.1786_1788 del AAT | p.Asn596del | - | Non-conserved | Pathogen | [62,63] |
| 95* | c.2006-6 T>C/G | Intronic | - | - | Neutral | [64] |
| 96 | c.2062 A>G | p.Met688Val | - | Conserved | - | - |
| 97* | c.2139 G>C | p.Gly713Gly | - | - | Neutral | [57,65] |
| 50 | c.2500 G>A | p.Ala834Thr | - | Non-conserved | Neutral | [10,66] |
| 51 | c.2542 G>T | p.Ala848Ser | - | Conserved | - | - |
* identified in CRC families from other Caucasian populations than the Danish
Genotype frequencies of the variants in the analyzed cohorts
| Id | Variant | Sporadic cases | Familiar CRC cohort | Sub-cohort | ||||||
| HoWt | He | HoMut | HoWt | He | HoMut | HoWt | He | HoMut | ||
| 23 | ||||||||||
| 24 | c. 350 C>T | 1.000 | 0.000 | 0.000 | 0.997 | 0.003 | 0.000 | 1.000 | 0.000 | 0.000 |
| 25 | ||||||||||
| 26 | 1.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | ||||
| 27 | ||||||||||
| 28 | ||||||||||
| 29 | c.884+39 G>A | 1.000 | 0.000 | 0.000 | 0.997 | 0.003 | 0.000 | 1.000 | 0.000 | 0.000 |
| 30 | c.884+83_84 ins T | 1.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 |
| 85 | 1.000 | 0.000 | 0.000 | |||||||
| 31 | c.1379 A>C | 1.000 | 0.000 | 0.000 | 0.994 | 0.006 | 0.000 | 1.000 | 0.000 | 0.000 |
| 86* | c.1558+11 G>A | 1.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 |
| 32 | ||||||||||
| 37 | ||||||||||
| 101 | c.1689 A>G | 1.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 |
| 33 | c.1732-2 A>T | 1.000 | 0.000 | 0.000 | 0.997 | 0.003 | 0.000 | 1.000 | 0.000 | 0.000 |
| 34 | ||||||||||
| 100 | c.1942 C>T | 1.000 | 0.000 | 0.000 | 0.997 | 0.003 | 0.000 | 1.000 | 0.000 | 0.000 |
| 35 | ||||||||||
| 36 | c.2152 C>T | 1.000 | 0.000 | 0.000 | 0.994 | 0.006 | 0.000 | 1.000 | 0.000 | 0.000 |
| 89 | na | na | na | |||||||
| 87 | c.131 C>T | 1.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 |
| 88 | c.134 C>T | 1.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 |
| 39 | c.212-23 A>C | 1.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 |
| 90* | c.287 G>A | 1.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 |
| 91* | c.329 A>G | 1.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 |
| 92* | c.380 A>G | 1.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 |
| 93 | c.560 T>G | 1.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 |
| 41 | ||||||||||
| 43 | ||||||||||
| 48 | c.1786_1788 del AAT | 1.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 |
| 95* | c.2006-6 T>C/G | 1.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 |
| 96 | c.2062 A>G | 1.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 |
| 97* | c.2139 G>C | 1.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 |
| 50 | c.2500 G>A | 1.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 |
| 51 | c.2542 G>T | 1.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 |
* identified in CRC families from other populations than the Danish
§ na: not analyzed
Variants that where polymorphic in sporadic CRC cohort or in the sub-cohort are in bold
The p-values of marginal Fisher's exact tests between groups
| Id | Variant | Sporadic-Sub-cohort | Familiar-Sub-cohort | Sporadic-familiar | All |
| 23 | c.307-29 C>A | 0.7259 | 1.0000 | 0.6673 | 0.8632 |
| 25 | c.453+79 A>G | 0.7908 | - | - | - |
| 24 | c.350 C>T | - | 0.2988 | - | - |
| 26 | c.545+43 C>G | 0.3263 | 1.000 | 1.000 | 0.4741 |
| 27 | c.655 A>G | 0.7715 | 0.5619 | 0.4869 | 0.7489 |
| 28 | c.790+10 A>G | 1.0000 | - | - | - |
| 29 | C884+39G>A | - | 0.2943 | - | - |
| 85 | c.1217 G>A | 1.0000 | 0.5359 | 0.4858 | 0.4833 |
| 31 | c.1379 A>C | - | 0.1880 | - | - |
| 32 | c.1558+14 G>A | 1.0000 | 0.6063 | 0.7685 | 0.8722 |
| 37 | 0.5382 | 0.0329 | 0.0148 | ||
| 33 | c.1732-2 A>T | - | 0.3109 | - | - |
| 34 | c.1852_1853 AA>GC | 0.2408 | 0.5672 | 0.6760 | 0.3569 |
| 100 | c.1942 C>T | - | 0.4747 | ||
| 35 | c.1959 G>T | 0.2473 | 1.0000 | 0.3642 | 0.4493 |
| 36 | c.2152 C>T | - | 0.0840 | - | - |
| 89 | - | - | - | ||
| 41 | c.965 G>A | 0.2574 | 0.0244 | 0.3609 | 0.0582 |
| 43 | c.1511-9 A>T | 0.0958 | 0.7314 | 0.0796 | 0.1617 |
The Bonferroni corrected p-value of sporadic versus sub-cohort is 0.05/13 = 0.0038
The Bonferroni corrected p-value of familiar-versus sub-cohort is 0.05/15 = 0.0033
Polymorphic variants with borderline significant p-values are in bold
Figure 1LD plots of the sub-cohort in the genomic regions covering . Each square indicates the level of LD between two variants. The colours are defined as: red (high LD), LOD ≥ 2 and D' = 1; blue, LOD<2 and D' = 1; white, LOD<2 and D'<1. The top of the figure shows the genomic position and the known genes in the region.
In silico functional characterization of the missense variants
| Id | Variant | PMUT | SIFT | PolyPhen | Phylogeny | Activity in functional assays | References |
| MLH1 | |||||||
| 24 | p.Thr117Met | Neutral | Not tolerated | Possible damaging | Conserved | Aberrant | [11,12] |
| 27 | p.Val219Ile | Neutral | Tolerated | Benign | Ile other species | Normal | [11,12] |
| 85 | p.Ser406Asn | Neutral | Tolerated | Benign | Non-conserved | Normal | [11] |
| Pathogen | Tolerated | Benign | Non-conserved | NA | - | ||
| Neutral | Tolerated | Possible damaging | Non-conserved | NA | - | ||
| 34 | p.Lys618Ala | Pathogen | Not tolerated | Possible damaging | Non-conserved | Normal/aberrant | [9,11,45–47] |
| 100 | p.Pro648Ser | Pathogen | Not tolerated | Probably damaging | Conserved | Normal (aberrant protein stability) | [9,67] |
| 36 | p.His718Tyr | Neutral | Not tolerated | Probably damaging | Conserved | Normal | [11] |
| MSH2 | |||||||
| 89 | p.Thr44Met | Neutral | Not tolerated | Possible damaging | Conserved | NA | - |
| 87 | p.Ala45Val | Neutral | Tolerated | Benign | Val other species | NA | - |
| 90 | p.Arg96His | Pathogen | Tolerated | Probably damaging | Non-conserved | NA | - |
| 91 | p.Lys110Arg | Neutral | Tolerated | Benign | Non-conserved | NA | - |
| 92 | p.Asn127Ser | Neutral | Not tolerated | Probably damaging | Conserved | NA | - |
| 93 | p.Leu187Arg | Neutral | Not tolerated | Probably damaging | Conserved | NA | - |
| 41 | p.Gly322Asp | Neutral | Tolerated | Benign | Conserved | Slightly reduced | [8,44] |
| Neutral | Not tolerated | Probably damaging | Conserved | NA | - | ||
| 50 | p.Ala834Thr | Pathogen | Tolerated | Possible damaging | Non-conserved | Normal | [10] |
| Neutral | Not tolerated | Possible damaging | Conserved | NA | - |
NA, not available
Danish variants that have not been described previously are in bold
Figure 2Phylogeny of the MSH2 Met688Val and Ala848Ser variants. Multiple sequence alignment of the hMSH2 polypeptide sequence and orthologues from other species were generated using the ClustalW algorithm [31]. The following polypeptide sequences were used in the alignment: P43246 (Human; Homo sapiens), CAA57049 (Mouse; Mus musculus), S53609 (Frog; Xenopus laevis), XP 426110 (Chicken; Gallus gallus) and CAB42554 (Maize; Zea mays).